Multivariate Least Weighted Squares (MLWS)
|
|
- Aleesha Hutchinson
- 5 years ago
- Views:
Transcription
1 () Stochastic Modelling in Economics and Finance 2 Supervisor : Prof. RNDr. Jan Ámos Víšek, CSc. Petr Jonáš 12 th March 2012
2 Contents
3 1 1 Introduction 2 3 Proof of consistency (80%) 4 Appendix (60%) 5 n consistency
4 2 Definition 1 Let us define the multivariate least weighted squares () estimator of the matrix of regression coefficients B as ˆB n,w = arg min (B,Σ) D n ( ) s 1 w d(s) 2 (B, Σ), (1) n s=1 where d (1) (B, Σ) d (2) (B, Σ) d (n) (B, Σ) is the ordered sequence of the residual Mahalanobis distances.
5 3 For any m N denote by Π m the set of all permutations of {1,..., m} and for any π Π n, π j stands for the j-th coordinate of the vector π. For any (B, Σ) D put π(b, Σ, i) = j {1,..., n} di 2 (B, Σ) = d(j) 2 (B, Σ) (2) and π(b, Σ) = (π(b, Σ, 1),..., π(b, Σ, n)). Now we can expess the equation (1) in the equivalent form ˆB n,w = arg min (B,Σ) D n ( ) π(b, Σ, i) 1 w di 2 (B, Σ), (3) n
6 4 For an arbitrary π Π n denote the function we want to minimize by MF (B, Σ, π) = n ( ) πi 1 w di 2 (B, Σ). (4) n To prove the existence of the solution of our minimization problem (1) we can follow the steps of the proof for LWS presented in Víšek (2008) and Víšek (2009). Remark For any ω 0 Ω the estimator is equal to the MWLS estimator with properly arranged weights.
7 5 The last remark gives us the way how to compute. We can minimize MF (ˆB n,π MWLS, Σ n,π MWLS, π) over the set of all permutations Π n. Asymptotic complexity of this algorithm is n!. Impossible to compute for n larger then ! = It is better to modify iterative algorithm for LWS presented in Mašíček (2004).
8 6 1 ˆB = 0, ˆΣ = I, ˆ MF = + 2 choose initial values B 1, Σ 1, j := 1 3 compute Mahalanobis distances d i (B j, Σ j ) 4 assign weights to observations (smaller MD = higher weight) 5 B j+1, Σ j+1 MWLS with weights computed in step 4 6 B j B j+1 ; j:=j+1 and step 3 7 if MF (B j, Σ j, w j ) < MF ˆ then ˆB := B j, ˆΣ = Σj, MF ˆ = MF (B j, Σ j, w j ) 8 repeat step 2 until terminal condition 9 return values ˆB, ˆΣ, ˆ MF
9 7 Previous algorithm is still too slow. Possible improvements: - choice of initial values - heuristic on inperspective
10 8 Choice of initial values Randomly choose m = 1000 subsets - H i of size h n/2, n/4 and compute ˆB = B LS (H i ) and ˆΣ = Σ LS (H i ). - H i of size h = p + q and compute the coefficients of the hyperplane through H i. If H i does not define a unique hyperplane extend H i by adding random observations until it does. ˆB =coefficients of this hyperplatne ˆΣ = i.
11
12 10 Heuristic on inperspective - perform 2 iteration steps with all m - choose a = 10 with lowest MFs - perform 30 iteration steps with a - choose one with lowest MF - iterate until convergence (or terminal condition)
13 We have to enlarge our notation ˆB n,w = arg min (B,Σ) D Let us denote n ( ) π(b, Σ, i) 1 w di 2 (B, Σ), n F B,Σ (r) = P ( ) Y 1 B X 1 < r (5) F (n) B,Σ (r) = 1 n n I (d i (B, Σ) < r) (6) and realize, that d i (B, Σ) and di 2 (B, Σ) are on the same position in the ordered sequences. 11
14 12 Therefore we can write ˆB n,w = arg min (B,Σ) D n w ( ) F (n) B,Σ (d i(b, Σ)) di 2 (B, Σ), (7) After some computation we can express the system of normal. NE Y,X,n (B) = n w ( ) F (n) B,Σ (d i(b, Σ)) X i (Y i B X i ) = 0 (8) In the one dimensional case, the main idea of proving consistency of LWS is to approximate empirical distribution funciton of absolute value of residuals by a continuous distribution function. We try to prove consistency in the simillar way. We shall need following assumptions
15 13 Condition C1 The weight function w : [0, 1] [0, 1] with w(0) = 1 is absolutely continuous and nonincreasing, with derivative w (α) bounded from below by L, (L > 0).
16 14 Condition C2 The sequence {(X i, U i ) } is a sequence of i.i.d. (p + q) dimensional random vectors distributed accordig to distribution function F X,U (x, u) = F X (x)f U (u). F U (u) is absolutely continuous with density f U (u) bounded by U e. Finally, there is a q > 1 so that E X 1 4q <. Symbol X = (tr(xx )) 1 2 denotes the Frobenius norm.
17 15 Now we want to prove, that all of normal are bounded. Denote ( ) B F B X (r) = P X i X < r (9) and the corresponding empirical distribution function F (n) B X (r) = 1 n n I i B ( B X i X i B ) < r (10) To prove the boundedness of, the following lemmas will be useful.
18 16 Lemma A1 (see Víšek (2009)) Under Condition C2 the distribution function F B X (r) is uniformly in r R, uniformly continuous in B, i.e for any δ > 0, there is ξ (0, 1) so that for any B (1) and B (2) such that B (1) B (2) < ξ sup F B (1),Σ (r) F B (2),Σ (r) δ r R sup F (B r R (1) ) X (r) F (B (2) ) X (r) δ
19 17 Lemma A2 (see Víšek (2009)) Let the Condition C2 hold. For any ɛ > 0 there is a constant K ɛ and n ɛ N so that for all n > n ɛ P([ω Ω : sup u R + (n) sup n F B,Σ (u) F B,Σ(u) < K ɛ ]) > 1 ɛ B R p q
20 18 Lemma A3 (see Víšek (2009)) Let the Condition C2 hold. For any ɛ > 0 there is a constant K ɛ and n ɛ N so that for all n > n ɛ P([ω Ω : sup u R + (n) sup n F (u) F B R p q B X B X (u) < K ɛ]) > 1 ɛ
21 19 For any λ R + and any a R put γ λ,a = sup F B X (a). (11) B =λ The surface of the ball {B R p q ; B = λ} is compact, there is B γ,a {B R p q ; B = λ} so that γ λ,a = F B γ,ax (a). (12) ( B For B ) = B. B 1 F B X (u) = P 2 X 1 < r = ) P ( B X1 2 B 2 < r B 2 = F B X ( r B ) It means, we can without loss of generality, consider γ 1,t.
22 20 Lemma 1 Under Conditions C1 a C2, there is a > 0 and b (0, 1) so that a.(b γ 1,a )w(b) > 0. (13)
23 21 Lemma 2 Let Conditions C1 and C2 be fulfilled. Then for any ɛ > 0 there is θ > 0, > 0 and n ɛ, N such that for any n > n ɛ, ( ) P ω Ω : inf 1 B θ n B NE Y,X,n (B) > > 1 ɛ
24 22 Proof 1 n B NE Y,X,n (B) = 1 n ( ) = n B w F (n) B,Σ (d i(b, Σ)) X i (Y i B X i ) 1 n ( ) w F (n) n B,Σ (d i(b, Σ)) B X i X i B 1 n ( ) w F (n) n B,Σ (d i(b, Σ)) B X i Y i
25 23 First of all, we shall pay attention to the quadratic part 1 n n w ( ) F (n) B,Σ (d i(b, Σ)) B X i X i B (14) and we will find a positive definite quadratic form which uniformly for B outside the ball of diameter 2 and with probability at least 1 ɛ is the lower bound for (14). This quadratic form is then for B with enough large norm (> θ), larger than the linear part in B.
26 24 Fix a > 0, b (0, 1) existenece of which was shown in Lemma 1 and Σ = 1. Further for any B denote the set of indices, for which F (n) B,Σ (d i(b, Σ)) < b by I b (B). We can easily verify that the edf overcomes b not later than at its [nb] + 1 jump. It means #I b (B) [nb] (15) i I b (B) implies w(f (n) B,Σ (d i(b, Σ))) w(b) (16) Lets denote I a (B) the set of indices, for which B X i X i B < a and estimate #I a (B).
27 25 Fix ɛ > 0, δ > 0 and put κ = a(b γ 1,a)w(b) 2 Then according to Lemma 1, κ > 0. Employing Lemma A3, find n 1 N so that for all n > n 1 we have ( P ω Ω : sup sup F (n) (u) F B R p q B u R X B X (u) and dentote the corresponding set by B (1) n (17) ) κ > 1 ɛ aw(b) 3 (18) F (n) B X (a) = #I a(b) n (19)
28 Equation (18) imply for any n > n 1 and ω B (1) n #I a (B) = nf (n) B X (a) n(f B X (a) + κ aw(b) ) n(γ 1,a + κ aw(b) ) (20) Equation (20) holds only for {B R p q ; B = 1} Consider ω B n (1), n 1 > n and put C n (B) = {i : F (n) B,Σ (d i(b, Σ)) < b and B X i X i B > a} Then (15) and (20) imply that the number of indices of C n (B) is at least #C n (B) #I b (B) #I a (B) = nb n(γ 1,a + κ aw(b) ) = (21) = n(b γ 1,a κ aw(b) ) 26
29 27 Now employing law of large numbers let us find n 2 N and B n (2), P(B n (2) ) > 1 ɛ 3 such that for any n > n 2, ω B n (2) and 1 B X i X i B #C n (B) a i #C n(b)
30 28 Now we have for any n > max(n 1, n 2 ) any ω B n (1) B n (2) and any B = 1 1 n n w ( ) F (n) B,Σ (d i(b, Σ)) B X i X i B 1 n > a(b γ 1,a )w(b) κ = κ i C n(b) w(b)b X i X Consider now any B R p q ; B = θ 1 and put B = θ 1 B i B B X i X i B = θ 2 B X i X i B (22) We have proved that for any n > max(n 1, n 2 ) any ω B n (1) B n (2) and any B R p q B = 1 #I a (B ) n(γ 1,a + κ aw(b) ) (23)
31 29 Recall that and for any i I b (B) #I b (B) > [nb] (24) w(f (n) B,Σ (d i(b, Σ))) w(b) (25) Now from (22), (23), (24) and (25) 1 n ( ) w F (n) n B,Σ (d i(b, Σ)) B X i X = θ 2 1 n n w i B = ( ) F (n) B,Σ (d i(b, Σ)) B X i X i B θ 2 (a(b γ 1,a )w(b) κ) = θ 2 κ (26)
32 30 So we have proved, that for any n = max(n 1, n 2 ) and ω B n (1) B n (2) and any B 1 1 n ( ) w F (n) n B,Σ (d i(b, Σ)) B X i X i B θ2 κ = B 2 κ (27) Now we shall consider the linear term 1 n ( ) w F (n) n B,Σ (d i(b, Σ)) B X i U i
33 31 Denote τ = E{ U 1 X 1 }. Then find n 3 N so that for any n > n 3, there is B n (3), P(B n (3) ) > 1 ɛ/3 and for any ω B n (3) 1 n ( ) w F (n) n B,Σ (d i(b, Σ)) B X i U i 1 n n B X i U i 2τ B (28)
34 32 Consider n = max(n 1, n 2, n 3 ) and ω B n (1) B n (2) B n (3). It follows P(B n ) > 1 ɛ and (26),(27) imply that for any B, B 1 and for κ we have defined 1 n B NE Y,X,n (B) > B 2.κ 2τ B Then for any > 0 there is a θ > > 0 such that for any B R p q, B θ with probability at least 1 ɛ 1 n B NE Y,X,n (B) >.
35 33 Let Conditions C1 and C2 be fulfilled. Then for any ɛ > 0, δ (0, 1) there is ρ > 0 and n ɛ,δ,ρ N such that for any n > n ɛ,δ,ρ we have P(ω Ω : sup B ρ 1 n n w ( ) F (n) B,Σ (d i(b, Σ)) B X i (Y i B X i ) ( B E w (F B,Σ (d 1 (B, Σ))) X 1 (Y 1 B X 1 ) ) < ) > 1 ɛ
36 34 Proof Let us fix ɛ, δ (0, 1), ρ > 0 and Σ > 1 and denote E X 1 = κ (1). Recalling that we have assumed B 0 = 0, we shall consider for B R p q, B ρ 1 n B NE Y,X,n (B) = 1 n = 1 n 1 n n w n w n w ( ) F (n) B,Σ (d i(b, Σ)) B X i (Y i B X i ) ( ) F (n) B,Σ (d i(b, Σ)) B X i X i B (29) ( ) F (n) B,Σ (d i(b, Σ)) B X i Y i
37 35 Let us again start with (29) and put τ (1) = δ/16κ (1) ρ 2 L. Due to Lemma A2 we can find n 1 N so that for any n > n 1, there is a set B n (1) such that P(B n (1) ) > 1 ɛ/8 and for any ω B n (1) sup sup F (n) B,Σ (r) F B,Σ(r) τ (1) (30) B R p q r R Employing the law of large numbers, find n 2 > n 1 so that for any n > n 2, there is a set B n (2) such that P(B n (2) ) > 1 ɛ/8 and for any ω B n (2) n X i 2 < 2κ (1) (31)
38 36 Since ten for any n > n 2 1 n sup B ρ n ( ( ) ) w F (n) B,Σ (d i(b, Σ)) w (F B,Σ (d i (B, Σ))) X i X i Lτ (1) n n X i X Lτ (1) i n 2κ(1) = δ 8ρ 2 We have for any n > n 2 and any ω B n (1) B n (2) 1 n sup B ρ n (w(f (n) B,Σ (d i(b, Σ))) w(f B,Σ (d i (B, Σ))))B X i X i B δ 8 (32)
39 37 δ Employing Lemma A1 and find for = such τ (2) > 0 16Lκ (1) ρ 2 that for T (ρ, τ (2) ) = { B (1) ρ, B (2) ρ, B (1) B (2) < τ (2)} we have sup (B (1),B (2) ) T (ρ,τ (2) ) r R sup F B (1),Σ (r) F B (2),Σ (r) < (33) then for any n > n 2 and any ω B n (1) B n (2) 1 sup (w(f n B (1),Σ (r i(b (2) ))) w(f B (2),Σ (r i(b (2) )))) (B (1),B (2) ) T (ρ,τ (2) ) B (1) X i X i B (1)
40 38 L ρ 2 n X i 2 δ 8 (34) The rest of the proof requires further notation and use of Hoelder s inequality and runs along similar line as the proof of in Víšek (2008).
41 39 I Mašíček, L. (2004). Diagnostika a senzitivita robustních modelů. MFF UK Prague: PhD. thesis. Víšek, J. Á. (2008). Consistency of the least weighted squares. Kybernetika(submitted). Víšek, J. Á. (2009). Consistency of the least weighted squares under heteroscedasticity. Kybernetika(submitted).
42 40 Thank You For Your Attention!
Multivariate Least Weighted Squares (MLWS)
() Stochastic Modelling in Economics and Finance 2 Supervisor : Prof. RNDr. Jan Ámos Víšek, CSc. Petr Jonáš 23 rd February 2015 Contents 1 2 3 4 5 6 7 1 1 Introduction 2 Algorithm 3 Proof of consistency
More informationodhady a jejich (ekonometrické)
modifikace Stochastic Modelling in Economics and Finance 2 Advisor : Prof. RNDr. Jan Ámos Víšek, CSc. Petr Jonáš 27 th April 2009 Contents 1 2 3 4 29 1 In classical approach we deal with stringent stochastic
More informationMachine Learning. Support Vector Machines. Fabio Vandin November 20, 2017
Machine Learning Support Vector Machines Fabio Vandin November 20, 2017 1 Classification and Margin Consider a classification problem with two classes: instance set X = R d label set Y = { 1, 1}. Training
More informationROBUST - September 10-14, 2012
Charles University in Prague ROBUST - September 10-14, 2012 Linear equations We observe couples (y 1, x 1 ), (y 2, x 2 ), (y 3, x 3 ),......, where y t R, x t R d t N. We suppose that members of couples
More informationAn introduction to Mathematical Theory of Control
An introduction to Mathematical Theory of Control Vasile Staicu University of Aveiro UNICA, May 2018 Vasile Staicu (University of Aveiro) An introduction to Mathematical Theory of Control UNICA, May 2018
More informationTopics in Theoretical Computer Science: An Algorithmist's Toolkit Fall 2007
MIT OpenCourseWare http://ocw.mit.edu 18.409 Topics in Theoretical Computer Science: An Algorithmist's Toolkit Fall 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationStochastic Optimization with Inequality Constraints Using Simultaneous Perturbations and Penalty Functions
International Journal of Control Vol. 00, No. 00, January 2007, 1 10 Stochastic Optimization with Inequality Constraints Using Simultaneous Perturbations and Penalty Functions I-JENG WANG and JAMES C.
More informationChapter 4: Asymptotic Properties of the MLE
Chapter 4: Asymptotic Properties of the MLE Daniel O. Scharfstein 09/19/13 1 / 1 Maximum Likelihood Maximum likelihood is the most powerful tool for estimation. In this part of the course, we will consider
More informationlarge number of i.i.d. observations from P. For concreteness, suppose
1 Subsampling Suppose X i, i = 1,..., n is an i.i.d. sequence of random variables with distribution P. Let θ(p ) be some real-valued parameter of interest, and let ˆθ n = ˆθ n (X 1,..., X n ) be some estimate
More informationUniversity of Warwick, EC9A0 Maths for Economists Lecture Notes 10: Dynamic Programming
University of Warwick, EC9A0 Maths for Economists 1 of 63 University of Warwick, EC9A0 Maths for Economists Lecture Notes 10: Dynamic Programming Peter J. Hammond Autumn 2013, revised 2014 University of
More informationAssignment 1: From the Definition of Convexity to Helley Theorem
Assignment 1: From the Definition of Convexity to Helley Theorem Exercise 1 Mark in the following list the sets which are convex: 1. {x R 2 : x 1 + i 2 x 2 1, i = 1,..., 10} 2. {x R 2 : x 2 1 + 2ix 1x
More informationAsymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.
Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical
More informationIntroduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence
Introduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations
More informationA Resampling Method on Pivotal Estimating Functions
A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation
More informationDynamic Systems and Applications 13 (2004) PARTIAL DIFFERENTIATION ON TIME SCALES
Dynamic Systems and Applications 13 (2004) 351-379 PARTIAL DIFFERENTIATION ON TIME SCALES MARTIN BOHNER AND GUSEIN SH GUSEINOV University of Missouri Rolla, Department of Mathematics and Statistics, Rolla,
More informationIntroduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued
Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and
More informationValue Iteration and Action ɛ-approximation of Optimal Policies in Discounted Markov Decision Processes
Value Iteration and Action ɛ-approximation of Optimal Policies in Discounted Markov Decision Processes RAÚL MONTES-DE-OCA Departamento de Matemáticas Universidad Autónoma Metropolitana-Iztapalapa San Rafael
More informationThe Generalized Viscosity Implicit Rules of Asymptotically Nonexpansive Mappings in Hilbert Spaces
Applied Mathematical Sciences, Vol. 11, 2017, no. 12, 549-560 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2017.718 The Generalized Viscosity Implicit Rules of Asymptotically Nonexpansive
More informationUniversity of California San Diego and Stanford University and
First International Workshop on Functional and Operatorial Statistics. Toulouse, June 19-21, 2008 K-sample Subsampling Dimitris N. olitis andjoseph.romano University of California San Diego and Stanford
More informationOn the Local Convergence of an Iterative Approach for Inverse Singular Value Problems
On the Local Convergence of an Iterative Approach for Inverse Singular Value Problems Zheng-jian Bai Benedetta Morini Shu-fang Xu Abstract The purpose of this paper is to provide the convergence theory
More informationRandom Matrix Theory and its Applications to Econometrics
Random Matrix Theory and its Applications to Econometrics Hyungsik Roger Moon University of Southern California Conference to Celebrate Peter Phillips 40 Years at Yale, October 2018 Spectral Analysis of
More informationOn Derivative Estimation of the Mean Time to Failure in Simulations of Highly Reliable Markovian Systems
On Derivative Estimation of the Mean Time to Failure in Simulations of Highly Reliable Markovian Systems Marvin K. Nakayama Department of Computer and Information Science New Jersey Institute of Technology
More informationRoth s Theorem on Arithmetic Progressions
September 25, 2014 The Theorema of Szemerédi and Roth For Λ N the (upper asymptotic) density of Λ is the number σ(λ) := lim sup N Λ [1, N] N [0, 1] The Theorema of Szemerédi and Roth For Λ N the (upper
More informationEmpirical Risk Minimization
Empirical Risk Minimization Fabrice Rossi SAMM Université Paris 1 Panthéon Sorbonne 2018 Outline Introduction PAC learning ERM in practice 2 General setting Data X the input space and Y the output space
More informationON A HYBRID PROXIMAL POINT ALGORITHM IN BANACH SPACES
U.P.B. Sci. Bull., Series A, Vol. 80, Iss. 3, 2018 ISSN 1223-7027 ON A HYBRID PROXIMAL POINT ALGORITHM IN BANACH SPACES Vahid Dadashi 1 In this paper, we introduce a hybrid projection algorithm for a countable
More informationarxiv: v2 [math.ds] 9 Jun 2013
SHAPES OF POLYNOMIAL JULIA SETS KATHRYN A. LINDSEY arxiv:209.043v2 [math.ds] 9 Jun 203 Abstract. Any Jordan curve in the complex plane can be approximated arbitrarily well in the Hausdorff topology by
More informationESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS
Statistica Sinica 13(2003), 1201-1210 ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS Liqun Wang University of Manitoba Abstract: This paper studies a minimum distance moment estimator for
More information4th Preparation Sheet - Solutions
Prof. Dr. Rainer Dahlhaus Probability Theory Summer term 017 4th Preparation Sheet - Solutions Remark: Throughout the exercise sheet we use the two equivalent definitions of separability of a metric space
More informationConvergence of Multivariate Quantile Surfaces
Convergence of Multivariate Quantile Surfaces Adil Ahidar Institut de Mathématiques de Toulouse - CERFACS August 30, 2013 Adil Ahidar (Institut de Mathématiques de Toulouse Convergence - CERFACS) of Multivariate
More informationThe annals of Statistics (2006)
High dimensional graphs and variable selection with the Lasso Nicolai Meinshausen and Peter Buhlmann The annals of Statistics (2006) presented by Jee Young Moon Feb. 19. 2010 High dimensional graphs and
More informationAn Accelerated Hybrid Proximal Extragradient Method for Convex Optimization and its Implications to Second-Order Methods
An Accelerated Hybrid Proximal Extragradient Method for Convex Optimization and its Implications to Second-Order Methods Renato D.C. Monteiro B. F. Svaiter May 10, 011 Revised: May 4, 01) Abstract This
More informationDesign of hybrid control systems for continuous-time plants: from the Clegg integrator to the hybrid H controller
Design of hybrid control systems for continuous-time plants: from the Clegg integrator to the hybrid H controller Luca Zaccarian LAAS-CNRS, Toulouse and University of Trento University of Oxford November
More informationTowards stability and optimality in stochastic gradient descent
Towards stability and optimality in stochastic gradient descent Panos Toulis, Dustin Tran and Edoardo M. Airoldi August 26, 2016 Discussion by Ikenna Odinaka Duke University Outline Introduction 1 Introduction
More informationNonconcave Penalized Likelihood with A Diverging Number of Parameters
Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized
More informationANALYSIS QUALIFYING EXAM FALL 2016: SOLUTIONS. = lim. F n
ANALYSIS QUALIFYING EXAM FALL 206: SOLUTIONS Problem. Let m be Lebesgue measure on R. For a subset E R and r (0, ), define E r = { x R: dist(x, E) < r}. Let E R be compact. Prove that m(e) = lim m(e /n).
More informationEXISTENCE OF SOLUTIONS TO ASYMPTOTICALLY PERIODIC SCHRÖDINGER EQUATIONS
Electronic Journal of Differential Equations, Vol. 017 (017), No. 15, pp. 1 7. ISSN: 107-6691. URL: http://ejde.math.txstate.edu or http://ejde.math.unt.edu EXISTENCE OF SOLUTIONS TO ASYMPTOTICALLY PERIODIC
More informationLong-Run Covariability
Long-Run Covariability Ulrich K. Müller and Mark W. Watson Princeton University October 2016 Motivation Study the long-run covariability/relationship between economic variables great ratios, long-run Phillips
More informationStatistical learning with Lipschitz and convex loss functions
Statistical learning with Lipschitz and convex loss functions Geoffrey Chinot, Guillaume Lecué and Matthieu Lerasle October 3, 08 Abstract We obtain risk bounds for Empirical Risk Minimizers ERM and minmax
More informationUnderstanding Regressions with Observations Collected at High Frequency over Long Span
Understanding Regressions with Observations Collected at High Frequency over Long Span Yoosoon Chang Department of Economics, Indiana University Joon Y. Park Department of Economics, Indiana University
More informationThreshold solutions and sharp transitions for nonautonomous parabolic equations on R N
Threshold solutions and sharp transitions for nonautonomous parabolic equations on R N P. Poláčik School of Mathematics, University of Minnesota Minneapolis, MN 55455 Abstract This paper is devoted to
More informationHigh Breakdown Point Estimation in Regression
WDS'08 Proceedings of Contributed Papers, Part I, 94 99, 2008. ISBN 978-80-7378-065-4 MATFYZPRESS High Breakdown Point Estimation in Regression T. Jurczyk Charles University, Faculty of Mathematics and
More informationA derivative-free nonmonotone line search and its application to the spectral residual method
IMA Journal of Numerical Analysis (2009) 29, 814 825 doi:10.1093/imanum/drn019 Advance Access publication on November 14, 2008 A derivative-free nonmonotone line search and its application to the spectral
More informationA NEW UPPER BOUND FOR FINITE ADDITIVE BASES
A NEW UPPER BOUND FOR FINITE ADDITIVE BASES C SİNAN GÜNTÜRK AND MELVYN B NATHANSON Abstract Let n, k denote the largest integer n for which there exists a set A of k nonnegative integers such that the
More informationSOME NOTES ON HOTELLING TUBES
SOME NOTES ON HOTELLING TUBES ROGER KOENKER 1. Introduction We begin with the two examples of Johansen and Johnstone. The first is Hotelling s (1939) original example, and the second is closely related
More informationOptimal Covariance Change Point Detection in High Dimension
Optimal Covariance Change Point Detection in High Dimension Joint work with Daren Wang and Alessandro Rinaldo, CMU Yi Yu School of Mathematics, University of Bristol Outline Review of change point detection
More informationfor all subintervals I J. If the same is true for the dyadic subintervals I D J only, we will write ϕ BMO d (J). In fact, the following is true
3 ohn Nirenberg inequality, Part I A function ϕ L () belongs to the space BMO() if sup ϕ(s) ϕ I I I < for all subintervals I If the same is true for the dyadic subintervals I D only, we will write ϕ BMO
More informationGENERALIZED RAY KNIGHT THEORY AND LIMIT THEOREMS FOR SELF-INTERACTING RANDOM WALKS ON Z 1. Hungarian Academy of Sciences
The Annals of Probability 1996, Vol. 24, No. 3, 1324 1367 GENERALIZED RAY KNIGHT THEORY AND LIMIT THEOREMS FOR SELF-INTERACTING RANDOM WALKS ON Z 1 By Bálint Tóth Hungarian Academy of Sciences We consider
More informationEquilibria with a nontrivial nodal set and the dynamics of parabolic equations on symmetric domains
Equilibria with a nontrivial nodal set and the dynamics of parabolic equations on symmetric domains J. Földes Department of Mathematics, Univerité Libre de Bruxelles 1050 Brussels, Belgium P. Poláčik School
More informationEstimation and Inference of Quantile Regression. for Survival Data under Biased Sampling
Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased
More informationNoncooperative continuous-time Markov games
Morfismos, Vol. 9, No. 1, 2005, pp. 39 54 Noncooperative continuous-time Markov games Héctor Jasso-Fuentes Abstract This work concerns noncooperative continuous-time Markov games with Polish state and
More informationConvex Geometry. Carsten Schütt
Convex Geometry Carsten Schütt November 25, 2006 2 Contents 0.1 Convex sets... 4 0.2 Separation.... 9 0.3 Extreme points..... 15 0.4 Blaschke selection principle... 18 0.5 Polytopes and polyhedra.... 23
More informationA NOTE ON STOCHASTIC INTEGRALS AS L 2 -CURVES
A NOTE ON STOCHASTIC INTEGRALS AS L 2 -CURVES STEFAN TAPPE Abstract. In a work of van Gaans (25a) stochastic integrals are regarded as L 2 -curves. In Filipović and Tappe (28) we have shown the connection
More informationThe Metamathematics of Randomness
The Metamathematics of Randomness Jan Reimann January 26, 2007 (Original) Motivation Effective extraction of randomness In my PhD-thesis I studied the computational power of reals effectively random for
More informationOutline. A Central Limit Theorem for Truncating Stochastic Algorithms
Outline A Central Limit Theorem for Truncating Stochastic Algorithms Jérôme Lelong http://cermics.enpc.fr/ lelong Tuesday September 5, 6 1 3 4 Jérôme Lelong (CERMICS) Tuesday September 5, 6 1 / 3 Jérôme
More informationNonparametric Regression
Adaptive Variance Function Estimation in Heteroscedastic Nonparametric Regression T. Tony Cai and Lie Wang Abstract We consider a wavelet thresholding approach to adaptive variance function estimation
More informationModel Selection and Geometry
Model Selection and Geometry Pascal Massart Université Paris-Sud, Orsay Leipzig, February Purpose of the talk! Concentration of measure plays a fundamental role in the theory of model selection! Model
More informationA GENERAL THEOREM ON APPROXIMATE MAXIMUM LIKELIHOOD ESTIMATION. Miljenko Huzak University of Zagreb,Croatia
GLASNIK MATEMATIČKI Vol. 36(56)(2001), 139 153 A GENERAL THEOREM ON APPROXIMATE MAXIMUM LIKELIHOOD ESTIMATION Miljenko Huzak University of Zagreb,Croatia Abstract. In this paper a version of the general
More informationSome definitions. Math 1080: Numerical Linear Algebra Chapter 5, Solving Ax = b by Optimization. A-inner product. Important facts
Some definitions Math 1080: Numerical Linear Algebra Chapter 5, Solving Ax = b by Optimization M. M. Sussman sussmanm@math.pitt.edu Office Hours: MW 1:45PM-2:45PM, Thack 622 A matrix A is SPD (Symmetric
More informationWeb Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.
Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we
More informationCompressibility of Infinite Sequences and its Interplay with Compressed Sensing Recovery
Compressibility of Infinite Sequences and its Interplay with Compressed Sensing Recovery Jorge F. Silva and Eduardo Pavez Department of Electrical Engineering Information and Decision Systems Group Universidad
More informationExercise Solutions to Functional Analysis
Exercise Solutions to Functional Analysis Note: References refer to M. Schechter, Principles of Functional Analysis Exersize that. Let φ,..., φ n be an orthonormal set in a Hilbert space H. Show n f n
More informationRectifiability of sets and measures
Rectifiability of sets and measures Tatiana Toro University of Washington IMPA February 7, 206 Tatiana Toro (University of Washington) Structure of n-uniform measure in R m February 7, 206 / 23 State of
More informationSection 8: Asymptotic Properties of the MLE
2 Section 8: Asymptotic Properties of the MLE In this part of the course, we will consider the asymptotic properties of the maximum likelihood estimator. In particular, we will study issues of consistency,
More informationBackward Stochastic Differential Equations with Infinite Time Horizon
Backward Stochastic Differential Equations with Infinite Time Horizon Holger Metzler PhD advisor: Prof. G. Tessitore Università di Milano-Bicocca Spring School Stochastic Control in Finance Roscoff, March
More informationIntroduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations
Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research
More informationJUSTIN HARTMANN. F n Σ.
BROWNIAN MOTION JUSTIN HARTMANN Abstract. This paper begins to explore a rigorous introduction to probability theory using ideas from algebra, measure theory, and other areas. We start with a basic explanation
More informationThe quasi-periodic Frenkel-Kontorova model
The quasi-periodic Frenkel-Kontorova model Philippe Thieullen (Bordeaux) joint work with E. Garibaldi (Campinas) et S. Petite (Amiens) Korean-French Conference in Mathematics Pohang, August 24-28,2015
More informationSDS : Theoretical Statistics
SDS 384 11: Theoretical Statistics Lecture 1: Introduction Purnamrita Sarkar Department of Statistics and Data Science The University of Texas at Austin https://psarkar.github.io/teaching Manegerial Stuff
More informationApproximation of BSDEs using least-squares regression and Malliavin weights
Approximation of BSDEs using least-squares regression and Malliavin weights Plamen Turkedjiev (turkedji@math.hu-berlin.de) 3rd July, 2012 Joint work with Prof. Emmanuel Gobet (E cole Polytechnique) Plamen
More informationAsymptotics of minimax stochastic programs
Asymptotics of minimax stochastic programs Alexander Shapiro Abstract. We discuss in this paper asymptotics of the sample average approximation (SAA) of the optimal value of a minimax stochastic programming
More informationMachine Learning And Applications: Supervised Learning-SVM
Machine Learning And Applications: Supervised Learning-SVM Raphaël Bournhonesque École Normale Supérieure de Lyon, Lyon, France raphael.bournhonesque@ens-lyon.fr 1 Supervised vs unsupervised learning Machine
More informationLECTURE 1: SOURCES OF ERRORS MATHEMATICAL TOOLS A PRIORI ERROR ESTIMATES. Sergey Korotov,
LECTURE 1: SOURCES OF ERRORS MATHEMATICAL TOOLS A PRIORI ERROR ESTIMATES Sergey Korotov, Institute of Mathematics Helsinki University of Technology, Finland Academy of Finland 1 Main Problem in Mathematical
More informationContents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.
Minimization Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping. 1 Minimization A Topological Result. Let S be a topological
More informationASYMPTOTICALLY NONEXPANSIVE MAPPINGS IN MODULAR FUNCTION SPACES ABSTRACT
ASYMPTOTICALLY NONEXPANSIVE MAPPINGS IN MODULAR FUNCTION SPACES T. DOMINGUEZ-BENAVIDES, M.A. KHAMSI AND S. SAMADI ABSTRACT In this paper, we prove that if ρ is a convex, σ-finite modular function satisfying
More informationBrownian Motion. 1 Definition Brownian Motion Wiener measure... 3
Brownian Motion Contents 1 Definition 2 1.1 Brownian Motion................................. 2 1.2 Wiener measure.................................. 3 2 Construction 4 2.1 Gaussian process.................................
More informationThe deterministic Lasso
The deterministic Lasso Sara van de Geer Seminar für Statistik, ETH Zürich Abstract We study high-dimensional generalized linear models and empirical risk minimization using the Lasso An oracle inequality
More informationSolution. 1 Solution of Homework 7. Sangchul Lee. March 22, Problem 1.1
Solution Sangchul Lee March, 018 1 Solution of Homework 7 Problem 1.1 For a given k N, Consider two sequences (a n ) and (b n,k ) in R. Suppose that a n b n,k for all n,k N Show that limsup a n B k :=
More informationHigh Dimensional Inverse Covariate Matrix Estimation via Linear Programming
High Dimensional Inverse Covariate Matrix Estimation via Linear Programming Ming Yuan October 24, 2011 Gaussian Graphical Model X = (X 1,..., X p ) indep. N(µ, Σ) Inverse covariance matrix Σ 1 = Ω = (ω
More informationThe Strong Convergence Theorems for the Split Equality Fixed Point Problems in Hilbert Spaces
Applied Mathematical Sciences, Vol. 2, 208, no. 6, 255-270 HIKARI Ltd, www.m-hikari.com https://doi.org/0.2988/ams.208.86 The Strong Convergence Theorems for the Split Equality Fixed Point Problems in
More information2 Statistical Estimation: Basic Concepts
Technion Israel Institute of Technology, Department of Electrical Engineering Estimation and Identification in Dynamical Systems (048825) Lecture Notes, Fall 2009, Prof. N. Shimkin 2 Statistical Estimation:
More informationLarge deviations and averaging for systems of slow fast stochastic reaction diffusion equations.
Large deviations and averaging for systems of slow fast stochastic reaction diffusion equations. Wenqing Hu. 1 (Joint work with Michael Salins 2, Konstantinos Spiliopoulos 3.) 1. Department of Mathematics
More informationSingular Integrals. 1 Calderon-Zygmund decomposition
Singular Integrals Analysis III Calderon-Zygmund decomposition Let f be an integrable function f dx 0, f = g + b with g Cα almost everywhere, with b
More informationMath The Laplacian. 1 Green s Identities, Fundamental Solution
Math. 209 The Laplacian Green s Identities, Fundamental Solution Let be a bounded open set in R n, n 2, with smooth boundary. The fact that the boundary is smooth means that at each point x the external
More informationChapter 7. Hypothesis Testing
Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function
More informationGeneralized Method of Moments (GMM) Estimation
Econometrics 2 Fall 2004 Generalized Method of Moments (GMM) Estimation Heino Bohn Nielsen of29 Outline of the Lecture () Introduction. (2) Moment conditions and methods of moments (MM) estimation. Ordinary
More informationAn introduction to Birkhoff normal form
An introduction to Birkhoff normal form Dario Bambusi Dipartimento di Matematica, Universitá di Milano via Saldini 50, 0133 Milano (Italy) 19.11.14 1 Introduction The aim of this note is to present an
More informationJournal of Multivariate Analysis. Sphericity test in a GMANOVA MANOVA model with normal error
Journal of Multivariate Analysis 00 (009) 305 3 Contents lists available at ScienceDirect Journal of Multivariate Analysis journal homepage: www.elsevier.com/locate/jmva Sphericity test in a GMANOVA MANOVA
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationIndex Models for Sparsely Sampled Functional Data. Supplementary Material.
Index Models for Sparsely Sampled Functional Data. Supplementary Material. May 21, 2014 1 Proof of Theorem 2 We will follow the general argument in the proof of Theorem 3 in Li et al. 2010). Write d =
More informationThe Equivalence of the Convergence of Four Kinds of Iterations for a Finite Family of Uniformly Asymptotically ø-pseudocontractive Mappings
±39ff±1ffi ß Ω χ Vol.39, No.1 2010fl2fl ADVANCES IN MATHEMATICS Feb., 2010 The Equivalence of the Convergence of Four Kinds of Iterations for a Finite Family of Uniformly Asymptotically ø-pseudocontractive
More informationMath 61CM - Solutions to homework 6
Math 61CM - Solutions to homework 6 Cédric De Groote November 5 th, 2018 Problem 1: (i) Give an example of a metric space X such that not all Cauchy sequences in X are convergent. (ii) Let X be a metric
More information(Part 1) High-dimensional statistics May / 41
Theory for the Lasso Recall the linear model Y i = p j=1 β j X (j) i + ɛ i, i = 1,..., n, or, in matrix notation, Y = Xβ + ɛ, To simplify, we assume that the design X is fixed, and that ɛ is N (0, σ 2
More informationSupremum of simple stochastic processes
Subspace embeddings Daniel Hsu COMS 4772 1 Supremum of simple stochastic processes 2 Recap: JL lemma JL lemma. For any ε (0, 1/2), point set S R d of cardinality 16 ln n S = n, and k N such that k, there
More informationv( x) u( y) dy for any r > 0, B r ( x) Ω, or equivalently u( w) ds for any r > 0, B r ( x) Ω, or ( not really) equivalently if v exists, v 0.
Sep. 26 The Perron Method In this lecture we show that one can show existence of solutions using maximum principle alone.. The Perron method. Recall in the last lecture we have shown the existence of solutions
More informationA FIXED POINT THEOREM FOR GENERALIZED NONEXPANSIVE MULTIVALUED MAPPINGS
Fixed Point Theory, (0), No., 4-46 http://www.math.ubbcluj.ro/ nodeacj/sfptcj.html A FIXED POINT THEOREM FOR GENERALIZED NONEXPANSIVE MULTIVALUED MAPPINGS A. ABKAR AND M. ESLAMIAN Department of Mathematics,
More informationGeneralization theory
Generalization theory Daniel Hsu Columbia TRIPODS Bootcamp 1 Motivation 2 Support vector machines X = R d, Y = { 1, +1}. Return solution ŵ R d to following optimization problem: λ min w R d 2 w 2 2 + 1
More informationA new class of pseudodifferential operators with mixed homogenities
A new class of pseudodifferential operators with mixed homogenities Po-Lam Yung University of Oxford Jan 20, 2014 Introduction Given a smooth distribution of hyperplanes on R N (or more generally on a
More informationAPPLICATIONS IN FIXED POINT THEORY. Matthew Ray Farmer. Thesis Prepared for the Degree of MASTER OF ARTS UNIVERSITY OF NORTH TEXAS.
APPLICATIONS IN FIXED POINT THEORY Matthew Ray Farmer Thesis Prepared for the Degree of MASTER OF ARTS UNIVERSITY OF NORTH TEXAS December 2005 APPROVED: Elizabeth M. Bator, Major Professor Paul Lewis,
More informationThe Arzelà-Ascoli Theorem
John Nachbar Washington University March 27, 2016 The Arzelà-Ascoli Theorem The Arzelà-Ascoli Theorem gives sufficient conditions for compactness in certain function spaces. Among other things, it helps
More informationSUPPLEMENT TO POSTERIOR CONTRACTION AND CREDIBLE SETS FOR FILAMENTS OF REGRESSION FUNCTIONS. North Carolina State University
Submitted to the Annals of Statistics SUPPLEMENT TO POSTERIOR CONTRACTION AND CREDIBLE SETS FOR FILAMENTS OF REGRESSION FUNCTIONS By Wei Li and Subhashis Ghosal North Carolina State University The supplementary
More information