Regularity conditions
|
|
- Gladys Hill
- 5 years ago
- Views:
Transcription
1 Statistica Siica: Supplemet MEASUREMENT ERROR IN LASSO: IMPACT AND LIKELIHOOD BIAS CORRECTION Øystei Sørese, Aroldo Frigessi ad Mage Thorese Uiversity of Oslo Supplemetary Material This ote cotais proofs of theorems ad propositios stated i the paper S Regularity coditios We assume xed true covariates which satisfy /X X = C xx Σ xx, as, / max i x ix i, as, S S where Σ xx is a positive deite matrix The radom measuremet errors are assumed ormally distributed with mea zero ad covariace Σ uu It follows Aderso 3, Th 344 that the limitig distributio of / C uu Σ uu is ormal with mea ad covariaces Σ uu ik Σ uu jl + Σ uu il Σ uu jk, where Σ uu ik is the i, kth elemet of Σ uu ad i, j, k, l {,, p} Now, C uu Σ uu, as, / max i u iu i, as, S3 S4 hold with probability It follows from S-S4 that with probability, C ww Σ ww ad / max i w i w i, as The limitig distributio of / C ww Σ ww has mea zero ad ite covariaces Regularity coditios like these have also bee assumed by, eg, Kight ad Fu ad Zhao ad Yu 6
2 S S Ø Sørese, A Frigessi, M Thorese Karush-Kuh-Tucker Coditios We itroduce the ew coeciet γ = β β, which yields the aive lasso o the form ˆγ = arg mi ɛ Wγ + γ C ww γ + γ C wu β + λ γ + β γ, S where we have removed all terms which are costat i γ Takig derivatives, we arrive at the Karush-Kuh-Tucker KKT coditios for the aive Lasso Lemma ˆγ = ˆβ β is a solutio to S if ad oly if /ɛ W + C ww ˆγ + C wu = λˆτ, where ˆτ R p satises ˆτ ad ˆτ j = sig ˆβj for j such that ˆβ j The same chage of variables for the corrected lasso yields { ˆγ = arg mi γ : γ+β R ɛ Wγ + γ C ww Σ uu γ S + γ C wu Σ uu β + λ γ + β } Due to the additioal costrait γ + β R added because of o-covexity, the KKT coditios ca oly characterize critical poits i the iterior of this domai A critical poit o the boudary may ot have a zero subgradiet Uder the assumptios of Loh ad Waiwright, for sucietly large, all local optima lie i a small l -ball aroud β We assume that R is chose large eough such that γ + β < R for all these optima, while R is small eough to avoid the trivial solutios for which oe or more compoet of ˆγ is ± Lemma Assume ˆγ = ˆβ β is a critical poit of S If ˆγ + β < R, the /ɛ W + C ww Σ uu ˆγ + C wu Σ uu = λˆτ, where ˆτ R p is as deed i Lemma S3 Proof of Propositio By deitio Bühlma ad va de Geer, / y Wˆβ + λ ˆβ / y Wβ + λ β, ad after reorgaizig terms, / W ˆβ β + λ ˆβ / ɛ Uβ W ˆβ β + λ β S3 Uder S3, / ɛ Uβ W ˆβ β / ɛ Uβ W ˆβ β λ ˆβ β,
3 Measuremet Error i Lasso S3 which iserted ito S3 yields / W ˆβ β + λ ˆβ λ ˆβ β + λ β Now use, ˆβ β S ˆβ S β S + ˆβ S c, ˆβ β = ˆβ S β S + ˆβ S c, ad λ λ, to obtai / W ˆβ β + λ ˆβ S c 3λ ˆβ S β S S3 S33 Iequality S33 shows that ˆβ S c 3 ˆβ S β S That is, the vector ˆβ β is amog the vectors to which the compatibility coditio applies, for the idex set S Next, use S3 agai i S33 to obtai / W ˆβ β + λ ˆβ β 4λ ˆβ S β S S34 Uder the compatibility coditio o S, ˆβ S β S s / φ W ˆβ / β Usig this ad the iequality 4uv 4u + v i S34, we arrive at W ˆβ β + λ ˆβ β S 4λ s /φ S4 Proof of Propositio This proof goes alog the lies of the proof of Theorem i Kight ad Fu, but with the additio of measuremet error We start with the aive Lasso after reparametrizatio, ad let L γ = γ W ɛ + γ C ww γ + γ C wu β + λ γ + β S45 Note that / W/ ɛ d N, 4/σ Σ ww ; the rst term i S45 coverges i distributio to a ormally distributed quatity whose variace goes to zero as /, which is equivalet to covergece i probability to zero Combiig this result with the assumptio that λ as, yields L γ p Lγ = γ Σ ww γ + γ Σ uu β Sice L γ is covex, it follows that argmi{l γ} p argmi{lγ} Kight ad Fu γ γ The miimum of Lγ is easily foud, ad accordigly, ˆγ p Σ wwσ uu β The result follows immediately
4 S4 Ø Sørese, A Frigessi, M Thorese S5 Proof of Theorem We follow the structure of the proof by Zhao ad Yu 6, who proved the correspodig result i the absece of measuremet error Cosider the aive lasso, ad ote that { sig β S ˆγS > β } S {sig = sig ˆβS β } S ad ˆγ S c = ˆβ S c = Thus, by the KKT coditios for the aive lasso Lemma, if a solutio ˆγ exists, ad W S ɛ + C ww S, S ˆγ S + C wu S, S β S = λ sig β S, S5 ˆγ S < β S, S5 Sc W ɛ + C ww S c, S ˆγ S + C wu S, c S β S λ, S53 the sigˆβ S = sigβ S ad sigˆβ S c = Evet A implies the existece of ˆγ S < β S such that Z ɛ Z β S = ˆγ S λ C ww S, S sig β S Buth the there must also exist ˆγ S < β S such that Z ɛ Z β S = ˆγ S λ C ww S, S sig β S, which essetially meas choosig the appropriate sigs of the elemets of ˆγ S Multiplyig through by C ww S, S ad reorgaizig terms, we get S5 Thus, A esures that S5 ad S5 are satised Next, addig ad subtractig C ww S, c S ˆγ S to the left-had side of evet B ad the usig the triagle iequaltity, yields W S c ɛ + + C ww S c, S ˆγ S + C wu S, c S β S C wws, c S C ww S, S W S ɛ + C ww S c, S C ww S, S C wu S, S β S + C ww S c, S ˆγ S λ θ The secod term o the left-had side of this expressio is the left-had side of S5 multiplied by C ww S, c S C ww S, S It ca thus be replaced by the right-had side of S5 multiplied by this factor This yields W S c ɛ + + C ww S c, S ˆγ S + C wu S, c S β S
5 Measuremet Error i Lasso S5 λ C wws, c S C ww S, S sig β S λ θ, which implies, due to the IC-ME, W S c ɛ + C ww S c, S ˆγ S + C wu S, c S β S λ This is ideed S53 Altogether, A implies S5 ad S5, while B A implies S53 For the asymptotic result, dee the vectors We have z = C ww S, S W S ɛ, a = β S Cww S, S C wu S, S β S, b = C ww S, S sig β S, ζ = C ww S, c S C ww S, S W S W S c ɛ, f = C ww S c, S C ww S, S C wu S, S C wu S c, S β S P A B P A c + P B c s P z j a j λ p s b j + P ζ j f j λ θ It is clear that z d N, σ C ww S, S, as Hece, there exists a ite costat k such that Ez j < k for j =,, s Next, we have by assumptio a β S Σ ww S, S Σ uu S, S β S, as, b Σ ww S, S sig β S, as Now usig the assumptio λ = o, we get P A c s + o zj P k < k a j + o s = o exp c, Φ s a j + o where we used the boud for the Gaussia tail probability Φt < t exp /t S54
6 S6 Ø Sørese, A Frigessi, M Thorese Next, we ote that ζ d N, σ Σ ww S c, S c Σ ww S c, S Σ ww S, S Σ ww S, S c, as Next, we cosider f, ad ote that the limitig distributio of C wu = C uu +C xu as is ormal with mea Σ wu = Σ uu ad ite variaces Aderso 3, Th 344 I additio, C ww Σ ww as Thus, applyig Slutsky's theorem to the product of the matrices, the limitig distributio of Cww S c, S C ww S, S C wu S, S C wu S c, S as has mea Σww S c, S Σ ww S, S Σ wu S, S Σ wu S c, S = ad ite variaces The latter term equals zero by the MEC Now f = Cww S c, S C ww S, S C wu S, S C wu S c, S β S is a vector i R p s whose elemets are liear combiatios of variables whose limitig distributios as are ormal with mea zero ad ite variaces Accordigly, the limitig distributio of f as is ormal with mea zero ad ite variaces So agai there exists a ite costat k such that Eζ j f j < k for j =,, p s Thus, whe λ c/ for c [,, we have P B c p s + o ζj f j P < λ θ k k p s = o exp c It follows that P A B = oexp c λ Φ θ k S6 Proof of Propositio 3 We cosider { ow the Lasso with ɛ = I this case, y = Xβ, ad the Lasso becomes ˆβ = argmi Wβ Xβ } + λ β We follow the proof of Bühlma ad va de β Geer, Th 7, but also take measuremet error ito accout Part The KKT coditios take the form C ww S, S ˆβ S β S + C ww S, S c ˆβ S c + C wu S, S β S = λˆτ S S6
7 Measuremet Error i Lasso S7 C ww S c, S ˆβ S β S + C ww S c, S c ˆβ S c + C wu S c, S β S = λˆτ S c, S6 where ˆτ = ˆτ S, ˆτ S c has the properties ˆτ ad ˆτ j = sig ˆβ j if β j We multiply S6 by ˆβ S C wws, c S c C ww S, S ad S6 by ˆβ S, ad the subtract c the rst from the secod, to get ˆβ S c Cww S, c S c C ww S, c S C ww S, S C ww S, S c ˆβS + c ˆβ S c Cwu S, c S C ww S, c S C ww S, S C wu S, S β S = λ ˆβ S C wws, c S c C ww S, S ˆτ S ˆβ S ˆτ c S c S63 The matrix term withi the paratheses i the leftmost term is positive semideite, sice it is the Schur complemet of the positive semideite matrix C ww, i which the part C ww S, S is positive deite, sice s < Next, the term withi the paratheses o the right-had side is ˆβ S C ww S, c S C ww S, S ˆτ S ˆβ S C ww S, c S C ww S, S ˆτ S ˆβS c The last iequality follows from the IC-ME, ad is strict wheever ˆβ S c Fially, the secod term o the left-had side of S63 is zero by assumptio Thus, if ˆβ S c, the left-had side of S63 must be egative, which is a cotradictio We thus coclude that ˆβ S c =, ad the KKT coditios S6 ad S6 reduce to C ww S, S ˆβS β S + C wu S, S β S = λˆτ S S64 C ww S, c S ˆβS β S + C wu S, c S β S = λˆτ S c, S65 From S64 we get ˆβ S β λ S = C S S, S ˆτ S + C ww S, S C wu S, S β S λ sup Cww S, S τ S + Cww S, S C wu S, S β S τ S66 Now, if j S det ad ˆβ j =, the ˆβ j βj = β j λ > + sup τ Cww S, S τ S + v j, where v = v,, v p = C ww S, S C wu S, S β S, cotradictig S66 Thus, ˆβ j for j S det
8 S8 Ø Sørese, A Frigessi, M Thorese Part We start by assumig sigˆβ = sigβ Thus, the KKT coditios are S64 ad S65 From S64 we get ˆβ S β S = λ C wws, S ˆτ S C ww S, S C wu S, S β S Isertig this ito S65 yields C ww S, c S C ww S, S ˆτ S + Cww S c λ, S C ww S, S C wu S, S C wu S, c S β S = ˆτ S c, ad the ecessary coditio stated i Propositio 3 follows by deitio S7 Proof of Theorem Startig from the KKT coditios of Lemma, we will redo the steps of the proof of Theorem, but with the isertio of extra terms represetig the correctio for measuremet error The corrected lasso is ot i geeral covex, ad our aalysis will thus cocer ay critical poit ˆγ = ˆβ β i the iterior of the feasible set {γ : γ + β < R} If ˆγ exists, ad W S ɛ + C ww S, S Σ uu S, S ˆγ S + Cwu S, S Σ uu S, S β S = λ sig β S, ˆγ S < β S, W S c ɛ + C ww S c, S Σ uu S, c S ˆγ S + Cwu S c, S Σ uu S c, S β S λ, S7 S7 S73 the sigˆβ S = sigβ S ad sigˆβ S = Evet A i Theorem implies the existece of ˆγ S < β S such that Z6 Z 7 β S = ˆγ S λ C ww S, S Σ uu S, S sig β S But the there must exist ˆγ S < β S such that Z 6 Z 7 β S = ˆγ S λ C ww S, S Σ uu S, S sig β S
9 Measuremet Error i Lasso S9 Multiplyig through by C ww S, S Σ uu S, S ad reorgaizig terms, we get S7 Thus, A esures that S7 ad S7 are satised Next, addig ad subtractig Cww S, c S Σ uu S, c S ˆγ S to the left-had side of evet B ad the usig the triagle iequality, yields Sc W ɛ + C wu S c, S Σ uu S, c S ˆγ S + C wu S, c S Σ uu S, c S β S C ww S, c S Σ uu S, c S C ww S, S Σ uu S, S W S + Cww S, c S Σ uu S, c S S ww S, S Σ uu S, S C wu S, S Σ uu S, S β S + S ww S c, S Σ uu S c, S ˆγ S λ θ The secod term o the left-had side of this expressio is the left-had side of S7 multiplied by C ww S, c S Σ uu S, c S C ww S, S Σ uu S, S It ca thus be replaced by the right-had side of S7 multiplied by this factor This yields Sc W ɛ + C wu S c, S Σ uu S, c S ˆγ S + C wu S, c S Σ uu S, c S β S λ λ C ww S c, S Σ uu S c, S C ww S, S Σ uu S, S sig β S θ, which implies, due to the IC-CL, W S c ɛ + C wu S c, S Σ uu S, c S ˆγ S + C wu S c, S Σ uu S c, S β S λ This is ideed S73 Altogether, A implies S7 ad S7, while B A implies S73 For the asymptotic result, dee the vectors z = C ww S, S Σ uu S, S W S ɛ, a = β S C ww S, S Σ uu S, S C wu S, S Σ uu S, S β S, b = C ww S, S Σ uu S, S sig β S, ζ = C ww S, c S Σ uu S, c S C ww S, S Σ uu S, S W S W S c ɛ,
10 S Ø Sørese, A Frigessi, M Thorese f = We have C ww S c, S Σ uu S c, S C ww S, S Σ uu S, S C wu S, S Σ uu S, S C wu S c, S Σ uu S c, S P A B P A c + P B c s P z j a j λ p s b j + P It is clear that z d N β S ζ j f j λ θ, σ Σ xx S, S Σ ww S, S Σ xx S, S, as Hece, there exists a ite costat k such that Ez j < k for j =,, s Next, we have by assumptio a β S, as Now usig the assumptio λ = o, we get P A c s + o zj P k < k a j + o s = o exp c, where we used the boud S54 Next, we ote that Φ s a j + o ζ d N, σ Σ xx S c, S c Σ xx S c, S Σ xx S, S Σ xx S, S c, as Next, we cosider f, ad ote that the limitig distributio of C wu Σ uu, as, is ormal with mea ad ite variaces Aderso 3, Th 344 I additio, C ww Σ uu Σ xx, as Thus, applyig Slutsky's theorem to the product of the matrices, the limitig distributio of C ww S c, S Σ uu S c, S C ww S, S Σ uu S, S C wu S, S Σ uu S, S C wu S, c S Σ uu S, c S, as, is ormal with mea ad ite variaces Now, f = C ww S c, S Σ uu S c, S C ww S, S Σ uu S, S C wu S, S Σ uu S, S C wu S c, S Σ uu S c, S β S
11 Measuremet Error i Lasso S is a vector i R p s whose elemets are liear combiatios of variables whose limitig distributios as are ormal with mea zero ad ite variaces Accordigly, the limitig distributio of f as is ormal with mea zero ad ite variaces So agai there exists a ite costat k such that Eζ j f j < k for j =,, p s Thus, whe λ c/ for c [,, we have P B c p s + o ζj f j P < λ θ k k p s = o exp c It follows that P A B = oexp c λ Φ θ k Additioal Refereces Aderso, T W 3 A Itroductio to Multivariate Statistical Aalysis, Third Editio Joh Wiley ad Sos, Hoboke
Optimally Sparse SVMs
A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but
More informationSummary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector
Summary ad Discussio o Simultaeous Aalysis of Lasso ad Datzig Selector STAT732, Sprig 28 Duzhe Wag May 4, 28 Abstract This is a discussio o the work i Bickel, Ritov ad Tsybakov (29). We begi with a short
More informationMath 451: Euclidean and Non-Euclidean Geometry MWF 3pm, Gasson 204 Homework 3 Solutions
Math 451: Euclidea ad No-Euclidea Geometry MWF 3pm, Gasso 204 Homework 3 Solutios Exercises from 1.4 ad 1.5 of the otes: 4.3, 4.10, 4.12, 4.14, 4.15, 5.3, 5.4, 5.5 Exercise 4.3. Explai why Hp, q) = {x
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More informationSupplemental Material: Proofs
Proof to Theorem Supplemetal Material: Proofs Proof. Let be the miimal umber of traiig items to esure a uique solutio θ. First cosider the case. It happes if ad oly if θ ad Rak(A) d, which is a special
More informationLinear Regression Demystified
Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to
More informationA 2nTH ORDER LINEAR DIFFERENCE EQUATION
A 2TH ORDER LINEAR DIFFERENCE EQUATION Doug Aderso Departmet of Mathematics ad Computer Sciece, Cocordia College Moorhead, MN 56562, USA ABSTRACT: We give a formulatio of geeralized zeros ad (, )-discojugacy
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More informationTechnical Proofs for Homogeneity Pursuit
Techical Proofs for Homogeeity Pursuit bstract This is the supplemetal material for the article Homogeeity Pursuit, submitted for publicatio i Joural of the merica Statistical ssociatio. B Proofs B. Proof
More informationECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors
ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013 Fuctioal Law of Large Numbers. Costructio of the Wieer Measure Cotet. 1. Additioal techical results o weak covergece
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 3 9//203 Large deviatios Theory. Cramér s Theorem Cotet.. Cramér s Theorem. 2. Rate fuctio ad properties. 3. Chage of measure techique.
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationDefinitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients.
Defiitios ad Theorems Remember the scalar form of the liear programmig problem, Miimize, Subject to, f(x) = c i x i a 1i x i = b 1 a mi x i = b m x i 0 i = 1,2,, where x are the decisio variables. c, b,
More informationNotes 19 : Martingale CLT
Notes 9 : Martigale CLT Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: [Bil95, Chapter 35], [Roc, Chapter 3]. Sice we have ot ecoutered weak covergece i some time, we first recall
More informationEconomics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator
Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationAnalytic Continuation
Aalytic Cotiuatio The stadard example of this is give by Example Let h (z) = 1 + z + z 2 + z 3 +... kow to coverge oly for z < 1. I fact h (z) = 1/ (1 z) for such z. Yet H (z) = 1/ (1 z) is defied for
More informationLinear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d
Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y
More informationPrecise Rates in Complete Moment Convergence for Negatively Associated Sequences
Commuicatios of the Korea Statistical Society 29, Vol. 16, No. 5, 841 849 Precise Rates i Complete Momet Covergece for Negatively Associated Sequeces Dae-Hee Ryu 1,a a Departmet of Computer Sciece, ChugWoo
More informationAssignment 5: Solutions
McGill Uiversity Departmet of Mathematics ad Statistics MATH 54 Aalysis, Fall 05 Assigmet 5: Solutios. Let y be a ubouded sequece of positive umbers satisfyig y + > y for all N. Let x be aother sequece
More information( ) (( ) ) ANSWERS TO EXERCISES IN APPENDIX B. Section B.1 VECTORS AND SETS. Exercise B.1-1: Convex sets. are convex, , hence. and. (a) Let.
Joh Riley 8 Jue 03 ANSWERS TO EXERCISES IN APPENDIX B Sectio B VECTORS AND SETS Exercise B-: Covex sets (a) Let 0 x, x X, X, hece 0 x, x X ad 0 x, x X Sice X ad X are covex, x X ad x X The x X X, which
More informationMath 299 Supplement: Real Analysis Nov 2013
Math 299 Supplemet: Real Aalysis Nov 203 Algebra Axioms. I Real Aalysis, we work withi the axiomatic system of real umbers: the set R alog with the additio ad multiplicatio operatios +,, ad the iequality
More informationProblem Set 2 Solutions
CS271 Radomess & Computatio, Sprig 2018 Problem Set 2 Solutios Poit totals are i the margi; the maximum total umber of poits was 52. 1. Probabilistic method for domiatig sets 6pts Pick a radom subset S
More informationProduct measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.
Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the
More informationOn Nonsingularity of Saddle Point Matrices. with Vectors of Ones
Iteratioal Joural of Algebra, Vol. 2, 2008, o. 4, 197-204 O Nosigularity of Saddle Poit Matrices with Vectors of Oes Tadeusz Ostrowski Istitute of Maagemet The State Vocatioal Uiversity -400 Gorzów, Polad
More informationAppendix to Quicksort Asymptotics
Appedix to Quicksort Asymptotics James Alle Fill Departmet of Mathematical Scieces The Johs Hopkis Uiversity jimfill@jhu.edu ad http://www.mts.jhu.edu/~fill/ ad Svate Jaso Departmet of Mathematics Uppsala
More informationSequences and Limits
Chapter Sequeces ad Limits Let { a } be a sequece of real or complex umbers A ecessary ad sufficiet coditio for the sequece to coverge is that for ay ɛ > 0 there exists a iteger N > 0 such that a p a q
More informationExercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).
Assigmet 7 Exercise 4.3 Use the Cotiuity Theorem to prove the Cramér-Wold Theorem, Theorem 4.12. Hit: a X d a X implies that φ a X (1) φ a X(1). Sketch of solutio: As we poited out i class, the oly tricky
More informationHomework Set #3 - Solutions
EE 15 - Applicatios of Covex Optimizatio i Sigal Processig ad Commuicatios Dr. Adre Tkaceko JPL Third Term 11-1 Homework Set #3 - Solutios 1. a) Note that x is closer to x tha to x l i the Euclidea orm
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationMath Solutions to homework 6
Math 175 - Solutios to homework 6 Cédric De Groote November 16, 2017 Problem 1 (8.11 i the book): Let K be a compact Hermitia operator o a Hilbert space H ad let the kerel of K be {0}. Show that there
More informationLecture 10 October Minimaxity and least favorable prior sequences
STATS 300A: Theory of Statistics Fall 205 Lecture 0 October 22 Lecturer: Lester Mackey Scribe: Brya He, Rahul Makhijai Warig: These otes may cotai factual ad/or typographic errors. 0. Miimaxity ad least
More informationECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
More informationarxiv: v1 [math.pr] 13 Oct 2011
A tail iequality for quadratic forms of subgaussia radom vectors Daiel Hsu, Sham M. Kakade,, ad Tog Zhag 3 arxiv:0.84v math.pr] 3 Oct 0 Microsoft Research New Eglad Departmet of Statistics, Wharto School,
More informationNotes 27 : Brownian motion: path properties
Notes 27 : Browia motio: path properties Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces:[Dur10, Sectio 8.1], [MP10, Sectio 1.1, 1.2, 1.3]. Recall: DEF 27.1 (Covariace) Let X = (X
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationACO Comprehensive Exam 9 October 2007 Student code A. 1. Graph Theory
1. Graph Theory Prove that there exist o simple plaar triagulatio T ad two distict adjacet vertices x, y V (T ) such that x ad y are the oly vertices of T of odd degree. Do ot use the Four-Color Theorem.
More informationSTAT Homework 1 - Solutions
STAT-36700 Homework 1 - Solutios Fall 018 September 11, 018 This cotais solutios for Homework 1. Please ote that we have icluded several additioal commets ad approaches to the problems to give you better
More informationRademacher Complexity
EECS 598: Statistical Learig Theory, Witer 204 Topic 0 Rademacher Complexity Lecturer: Clayto Scott Scribe: Ya Deg, Kevi Moo Disclaimer: These otes have ot bee subjected to the usual scrutiy reserved for
More informationMcGill University Math 354: Honors Analysis 3 Fall 2012 Solutions to selected problems
McGill Uiversity Math 354: Hoors Aalysis 3 Fall 212 Assigmet 3 Solutios to selected problems Problem 1. Lipschitz fuctios. Let Lip K be the set of all fuctios cotiuous fuctios o [, 1] satisfyig a Lipschitz
More informationMachine Learning for Data Science (CS 4786)
Machie Learig for Data Sciece CS 4786) Lecture & 3: Pricipal Compoet Aalysis The text i black outlies high level ideas. The text i blue provides simple mathematical details to derive or get to the algorithm
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013 Large Deviatios for i.i.d. Radom Variables Cotet. Cheroff boud usig expoetial momet geeratig fuctios. Properties of a momet
More informationTime series models 2007
Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Solutios to problem sheet 1, 2007 Exercise 1.1 a Let Sc = E[Y c 2 ]. The This gives Sc = EY 2 2cEY + c 2 ds dc = 2EY + 2c = 0
More informationLecture 20: Multivariate convergence and the Central Limit Theorem
Lecture 20: Multivariate covergece ad the Cetral Limit Theorem Covergece i distributio for radom vectors Let Z,Z 1,Z 2,... be radom vectors o R k. If the cdf of Z is cotiuous, the we ca defie covergece
More information2.2. Central limit theorem.
36.. Cetral limit theorem. The most ideal case of the CLT is that the radom variables are iid with fiite variace. Although it is a special case of the more geeral Lideberg-Feller CLT, it is most stadard
More informationDetailed proofs of Propositions 3.1 and 3.2
Detailed proofs of Propositios 3. ad 3. Proof of Propositio 3. NB: itegratio sets are geerally omitted for itegrals defied over a uit hypercube [0, s with ay s d. We first give four lemmas. The proof of
More informationAPPENDIX A SMO ALGORITHM
AENDIX A SMO ALGORITHM Sequetial Miimal Optimizatio SMO) is a simple algorithm that ca quickly solve the SVM Q problem without ay extra matrix storage ad without usig time-cosumig umerical Q optimizatio
More information62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +
62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of
More informationMINIMAX RATES OF CONVERGENCE AND OPTIMALITY OF BAYES FACTOR WAVELET REGRESSION ESTIMATORS UNDER POINTWISE RISKS
Statistica Siica 19 (2009) 1389 1406: Supplemet S1 S17 1 MINIMAX RATES OF ONVERGENE AND OPTIMALITY OF BAYES FATOR WAVELET REGRESSION ESTIMATORS UNDER POINTWISE RISKS Natalia Bochkia ad Theofais Sapatias
More informationMa 4121: Introduction to Lebesgue Integration Solutions to Homework Assignment 5
Ma 42: Itroductio to Lebesgue Itegratio Solutios to Homework Assigmet 5 Prof. Wickerhauser Due Thursday, April th, 23 Please retur your solutios to the istructor by the ed of class o the due date. You
More informationj=1 dz Res(f, z j ) = 1 d k 1 dz k 1 (z c)k f(z) Res(f, c) = lim z c (k 1)! Res g, c = f(c) g (c)
Problem. Compute the itegrals C r d for Z, where C r = ad r >. Recall that C r has the couter-clockwise orietatio. Solutio: We will use the idue Theorem to solve this oe. We could istead use other (perhaps
More informationChapter 7 Isoperimetric problem
Chapter 7 Isoperimetric problem Recall that the isoperimetric problem (see the itroductio its coectio with ido s proble) is oe of the most classical problem of a shape optimizatio. It ca be formulated
More informationMetric Space Properties
Metric Space Properties Math 40 Fial Project Preseted by: Michael Brow, Alex Cordova, ad Alyssa Sachez We have already poited out ad will recogize throughout this book the importace of compact sets. All
More informationLecture 3: August 31
36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,
More informationThe natural exponential function
The atural expoetial fuctio Attila Máté Brookly College of the City Uiversity of New York December, 205 Cotets The atural expoetial fuctio for real x. Beroulli s iequality.....................................2
More informationAsymptotic distribution of products of sums of independent random variables
Proc. Idia Acad. Sci. Math. Sci. Vol. 3, No., May 03, pp. 83 9. c Idia Academy of Scieces Asymptotic distributio of products of sums of idepedet radom variables YANLING WANG, SUXIA YAO ad HONGXIA DU ollege
More informationStatistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions
Statistical ad Mathematical Methods DS-GA 00 December 8, 05. Short questios Sample Fial Problems Solutios a. Ax b has a solutio if b is i the rage of A. The dimesio of the rage of A is because A has liearly-idepedet
More informationSolution to Chapter 2 Analytical Exercises
Nov. 25, 23, Revised Dec. 27, 23 Hayashi Ecoometrics Solutio to Chapter 2 Aalytical Exercises. For ay ε >, So, plim z =. O the other had, which meas that lim E(z =. 2. As show i the hit, Prob( z > ε =
More informationWeb-based Supplementary Materials for A Modified Partial Likelihood Score Method for Cox Regression with Covariate Error Under the Internal
Web-based Supplemetary Materials for A Modified Partial Likelihood Score Method for Cox Regressio with Covariate Error Uder the Iteral Validatio Desig by David M. Zucker, Xi Zhou, Xiaomei Liao, Yi Li,
More informationRandom Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices
Radom Matrices with Blocks of Itermediate Scale Strogly Correlated Bad Matrices Jiayi Tog Advisor: Dr. Todd Kemp May 30, 07 Departmet of Mathematics Uiversity of Califoria, Sa Diego Cotets Itroductio Notatio
More informationThe standard deviation of the mean
Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider
More informationZ ß cos x + si x R du We start with the substitutio u = si(x), so du = cos(x). The itegral becomes but +u we should chage the limits to go with the ew
Problem ( poits) Evaluate the itegrals Z p x 9 x We ca draw a right triagle labeled this way x p x 9 From this we ca read off x = sec, so = sec ta, ad p x 9 = R ta. Puttig those pieces ito the itegralrwe
More informationMAT1026 Calculus II Basic Convergence Tests for Series
MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real
More informationDefinition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.
4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad
More informationAlgebra of Least Squares
October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal
More informationANSWERS TO MIDTERM EXAM # 2
MATH 03, FALL 003 ANSWERS TO MIDTERM EXAM # PENN STATE UNIVERSITY Problem 1 (18 pts). State ad prove the Itermediate Value Theorem. Solutio See class otes or Theorem 5.6.1 from our textbook. Problem (18
More informationPRELIM PROBLEM SOLUTIONS
PRELIM PROBLEM SOLUTIONS THE GRAD STUDENTS + KEN Cotets. Complex Aalysis Practice Problems 2. 2. Real Aalysis Practice Problems 2. 4 3. Algebra Practice Problems 2. 8. Complex Aalysis Practice Problems
More information1. By using truth tables prove that, for all statements P and Q, the statement
Author: Satiago Salazar Problems I: Mathematical Statemets ad Proofs. By usig truth tables prove that, for all statemets P ad Q, the statemet P Q ad its cotrapositive ot Q (ot P) are equivalet. I example.2.3
More information1. ARITHMETIC OPERATIONS IN OBSERVER'S MATHEMATICS
1. ARITHMETIC OPERATIONS IN OBSERVER'S MATHEMATICS We cosider a ite well-ordered system of observers, where each observer sees the real umbers as the set of all iite decimal fractios. The observers are
More informationStatistical Inference Based on Extremum Estimators
T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0
More informationLecture 2 Long paths in random graphs
Lecture Log paths i radom graphs 1 Itroductio I this lecture we treat the appearace of log paths ad cycles i sparse radom graphs. will wor with the probability space G(, p) of biomial radom graphs, aalogous
More informationb i u x i U a i j u x i u x j
M ath 5 2 7 Fall 2 0 0 9 L ecture 1 9 N ov. 1 6, 2 0 0 9 ) S ecod- Order Elliptic Equatios: Weak S olutios 1. Defiitios. I this ad the followig two lectures we will study the boudary value problem Here
More informationBull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung
Bull. Korea Math. Soc. 36 (999), No. 3, pp. 45{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Abstract. This paper provides suciet coditios which esure the strog cosistecy of regressio
More informationCSE 527, Additional notes on MLE & EM
CSE 57 Lecture Notes: MLE & EM CSE 57, Additioal otes o MLE & EM Based o earlier otes by C. Grat & M. Narasimha Itroductio Last lecture we bega a examiatio of model based clusterig. This lecture will be
More informationSolutions to Tutorial 5 (Week 6)
The Uiversity of Sydey School of Mathematics ad Statistics Solutios to Tutorial 5 (Wee 6 MATH2962: Real ad Complex Aalysis (Advaced Semester, 207 Web Page: http://www.maths.usyd.edu.au/u/ug/im/math2962/
More informationAxioms of Measure Theory
MATH 532 Axioms of Measure Theory Dr. Neal, WKU I. The Space Throughout the course, we shall let X deote a geeric o-empty set. I geeral, we shall ot assume that ay algebraic structure exists o X so that
More informationComplex Analysis Spring 2001 Homework I Solution
Complex Aalysis Sprig 2001 Homework I Solutio 1. Coway, Chapter 1, sectio 3, problem 3. Describe the set of poits satisfyig the equatio z a z + a = 2c, where c > 0 ad a R. To begi, we see from the triagle
More informationProbability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].
Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x
More informationEfficient GMM LECTURE 12 GMM II
DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet
More informationDimension-free PAC-Bayesian bounds for the estimation of the mean of a random vector
Dimesio-free PAC-Bayesia bouds for the estimatio of the mea of a radom vector Olivier Catoi CREST CNRS UMR 9194 Uiversité Paris Saclay olivier.catoi@esae.fr Ilaria Giulii Laboratoire de Probabilités et
More informationGeometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT
OCTOBER 7, 2016 LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT Geometry of LS We ca thik of y ad the colums of X as members of the -dimesioal Euclidea space R Oe ca
More informationEntropy Rates and Asymptotic Equipartition
Chapter 29 Etropy Rates ad Asymptotic Equipartitio Sectio 29. itroduces the etropy rate the asymptotic etropy per time-step of a stochastic process ad shows that it is well-defied; ad similarly for iformatio,
More informationki, X(n) lj (n) = (ϱ (n) ij ) 1 i,j d.
APPLICATIONES MATHEMATICAE 22,2 (1994), pp. 193 200 M. WIŚNIEWSKI (Kielce) EXTREME ORDER STATISTICS IN AN EQUALLY CORRELATED GAUSSIAN ARRAY Abstract. This paper cotais the results cocerig the wea covergece
More informationAdvanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology
Advaced Aalysis Mi Ya Departmet of Mathematics Hog Kog Uiversity of Sciece ad Techology September 3, 009 Cotets Limit ad Cotiuity 7 Limit of Sequece 8 Defiitio 8 Property 3 3 Ifiity ad Ifiitesimal 8 4
More information17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15
17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig
More informationRates of Convergence by Moduli of Continuity
Rates of Covergece by Moduli of Cotiuity Joh Duchi: Notes for Statistics 300b March, 017 1 Itroductio I this ote, we give a presetatio showig the importace, ad relatioship betwee, the modulis of cotiuity
More informationBeurling Integers: Part 2
Beurlig Itegers: Part 2 Isomorphisms Devi Platt July 11, 2015 1 Prime Factorizatio Sequeces I the last article we itroduced the Beurlig geeralized itegers, which ca be represeted as a sequece of real umbers
More informationPoincaré Problem for Nonlinear Elliptic Equations of Second Order in Unbounded Domains
Advaces i Pure Mathematics 23 3 72-77 http://dxdoiorg/4236/apm233a24 Published Olie Jauary 23 (http://wwwscirporg/oural/apm) Poicaré Problem for Noliear Elliptic Equatios of Secod Order i Ubouded Domais
More information(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3
MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special
More informationM17 MAT25-21 HOMEWORK 5 SOLUTIONS
M17 MAT5-1 HOMEWORK 5 SOLUTIONS 1. To Had I Cauchy Codesatio Test. Exercise 1: Applicatio of the Cauchy Codesatio Test Use the Cauchy Codesatio Test to prove that 1 diverges. Solutio 1. Give the series
More informationSolution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1
Solutio Sagchul Lee October 7, 017 1 Solutios of Homework 1 Problem 1.1 Let Ω,F,P) be a probability space. Show that if {A : N} F such that A := lim A exists, the PA) = lim PA ). Proof. Usig the cotiuity
More informationRandomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018)
Radomized Algorithms I, Sprig 08, Departmet of Computer Sciece, Uiversity of Helsiki Homework : Solutios Discussed Jauary 5, 08). Exercise.: Cosider the followig balls-ad-bi game. We start with oe black
More informationsin(n) + 2 cos(2n) n 3/2 3 sin(n) 2cos(2n) n 3/2 a n =
60. Ratio ad root tests 60.1. Absolutely coverget series. Defiitio 13. (Absolute covergece) A series a is called absolutely coverget if the series of absolute values a is coverget. The absolute covergece
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More informationStochastic Matrices in a Finite Field
Stochastic Matrices i a Fiite Field Abstract: I this project we will explore the properties of stochastic matrices i both the real ad the fiite fields. We first explore what properties 2 2 stochastic matrices
More information10-701/ Machine Learning Mid-term Exam Solution
0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it
More informationMath 61CM - Solutions to homework 3
Math 6CM - Solutios to homework 3 Cédric De Groote October 2 th, 208 Problem : Let F be a field, m 0 a fixed oegative iteger ad let V = {a 0 + a x + + a m x m a 0,, a m F} be the vector space cosistig
More information