Testing the number of parameters with multidimensional MLP
|
|
- Amy McGee
- 5 years ago
- Views:
Transcription
1 Testig the umber of parameters with multidimesioal MLP Joseph Rykiewicz To cite this versio: Joseph Rykiewicz. Testig the umber of parameters with multidimesioal MLP. ASMDA 2005, 2005, Brest, Frace. pp , <hal > HAL Id: hal Submitted o 21 Feb 2008 HAL is a multi-discipliary ope access archive for the deposit ad dissemiatio of scietific research documets, whether they are published or ot. The documets may come from teachig ad research istitutios i Frace or abroad, or from public or private research ceters. L archive ouverte pluridiscipliaire HAL, est destiée au dépôt et à la diffusio de documets scietifiques de iveau recherche, publiés ou o, émaat des établissemets d eseigemet et de recherche fraçais ou étragers, des laboratoires publics ou privés.
2 Testig the umber of parameters of multidimesioal MLP Joseph Rykiewicz 1 SAMOS - MATISSE Uiversité de Paris I 72 rue Regault, Paris, Frace joseph.rykiewicz@uiv-paris1.fr Abstract. This work cocers testig the umber of parameters i oe hidde layer multilayer perceptro MLP. For this purpose we assume that we have idetifiable models, up to a fiite group of trasformatios o the weights, this is for example the case whe the umber of hidde uits is kow. I this framework, we show that we get a simple asymptotic distributio, if we use the logarithm of the determiat of the empirical error covariace matrix as cost fuctio. Keywords: Multilayer Perceptro, Statistical test, Asymptotic distributio. hal , versio 1-21 Feb Itroductio Cosider a sequece Y t, Z t t N of i.i.d. 1 radom vectors i.e. idetically distributed ad idepedets. So, each couple Y t, Z t has the same law that a geeric variable Y, Z R d R d. 1.1 The model Assume that the model ca be writte where Y t = F W 0Z t + ε t F W 0 is a fuctio represeted by a oe hidde layer MLP with parameters or weights W 0 ad sigmoidal fuctios i the hidde uit. The oise, ε t t N, is sequece of i.i.d. cetered variables with ukow ivertible covariace matrix ΓW 0. Write ε the geeric variable with the same law that each ε t. Notes that a fiite umber of trasformatios of the weights leave the MLP fuctios ivariat, these permutatios form a fiite group see [Sussma, 1992]. To overcome this problem, we will cosider equivalece classes of MLP : two 1 It is ot hard to exted all what we show i this paper for statioary mixig variables ad so for time series
3 2 Joseph Rykiewicz MLP are i the same class if the first oe is the image by such trasformatio of the secod oe, the cosidered set of parameter is the the quotiet space of parameters by the fiite group of trasformatios. I this space, we assume that the model is idetifiable, this ca be doe if we cosider oly MLP with the true umber of hidde uits see [Sussma, 1992]. Note that, if the umber of hidde uits is over-estimated, the such test ca have very bad behavior see [Fukumizu, 2003]. We agree that the assumptio of idetifiability is very restrictive, but we wat emphasize the fact that, eve i this framework, classical test of the umber of parameters i the case of multidimesioal output MLP is ot satisfactory ad we propose to improve it. 1.2 testig the umber of parameters Let q be a iteger lesser tha s, we wat to test H 0 : W Θ q R q agaist H 1 : W Θ s R s, where the sets Θ q ad Θ s are compact. H 0 express the fact that W belogs to a subset of Θ s with a parametric dimesio lesser tha s or, equivaletly, that s q weights of the MLP i Θ s are ull. If we cosider the classic cost fuctio : V W = Y t F W Z t 2 where x deotes the Euclidea orm of x, we get the followig statistic of test : S = mi V W mi V W W Θ q W Θ s It is show i [Yao, 2000], that S coverges i law to a poderated sum of χ 2 1 s q D S λ i χ 2 i,1 i=1 where the χ 2 i,1 are s q i.i.d. χ2 1 variables ad λ i are strictly positives values, differet of 1 if the true covariace matrix of the oise is ot the idetity. So, i the geeral case, where the true covariace matrix of the oise is ot the idetity, the asymptotic distributio is ot kow, because the λ i are ot kow ad it is difficult to compute the asymptotic level of the test. To overcome this difficulty we propose to use istead the cost fuctio 1 U W := l det Y t F W Z t Y t F W Z t T. 1 we will show that, uder suitable assumptios, the statistic of test : T = mi U W mi U W W Θ q W Θ s will coverge to a classical χ 2 s q so the asymptotic level of the test will be very easy to compute. The sequel of this paper is devoted to the proof of this property. 2
4 multidimesioal MLP 3 2 Asymptotic properties of T I order to ivestigate the asymptotic properties of the test we have to prove the cosistecy ad the asymptotic ormality of Ŵ = argmi W Θs U W. Assume, i the sequel, that ε has a momet of order at least 2 ad ote Γ W = 1 Y t F W Z t Y t F W Z t T remark that these matrix Γ W ad it iverse are symmetric. i the same way, we ote ΓW = lim Γ W, which is well defied because of the momet coditio o ε 2.1 Cosistecy of Ŵ First we have to idetify cotrast fuctio associated to U W Lemma 1 U W U W 0 a.s. KW, W 0 with KW, W 0 0 ad KW, W 0 = 0 if ad oly if W = W 0. Proof : By the strog law of large umber we have U W U W 0 a.s. l detγw l detγw 0 = l detγw l det Γ W 0 ΓW ΓW 0 + I d detγw 0 = where I d deotes the idetity matrix of R d. So, the lemme is true if ΓW ΓW 0 is a positive matrix, ull oly if W = W 0. But this property is true sice ΓW = E Y F W ZY F W Z T = E Y F W 0Z + F W 0Z F W ZY F W 0Z + F W 0Z F W Z T = E Y F W 0ZY F W 0Z T + E F W 0Z F W ZF W 0Z F W Z T = ΓW 0 + E F W 0Z F W ZF W 0Z F W Z T We deduce the the theorem of cosistecy : Theorem 1 If E ε 2 <, Ŵ P W 0
5 4 Joseph Rykiewicz Proof Remark that it exist a costat B such that sup W Θs Y F W Z 2 < Y 2 + B because Θ s is compact, so F W Z is bouded. For a matrix A R d d, let A be a orm, for example A 2 = tr AA T. We have ad sice the fuctio : limif W Θs Γ W = ΓW 0 > 0 limsup W Θs Γ W := C < Γ l det Γ, for C Γ ΓW 0 is uiformly cotiuous, by the same argumet that example 19.8 of [Va der Vaart, 1998] the set of fuctios U W, W Θ s is Gliveko- Catelli. Fially, the theorem 5.7 of [Va der Vaart, 1998], show that Ŵ coverge i probability to W Asymptotic ormality For this purpose we have to compute the first ad the secod derivative with respect to the parameters of U W. First, we itroduce a otatio : if F W X is a d-dimesioal parametric fuctio depedig of a parameter W, FW X write resp. 2 F WX for the d-dimesioal vector of partial derivative resp. secod order partial derivatives of each compoet of F W X. First derivatives : if Γ W is a matrix depedig of the parameter vector W, we get from [Magus ad Neudecker, 1988] l detγ W = tr Γ W W Γ W k Hece, if we ote usig the fact tr Γ A W k = 1 F Wz t y t F W z t T WA W k = tr A T W kγ W = tr Γ WAT W k we get l det Γ W = 2tr Γ WA W k 3
6 multidimesioal MLP 5 Secod derivatives : We write ow B W k, W l := 1 T F W z t F W z t ad We get C W k, W l := 1 y t F W z t 2 F W z t T 2 U W W l = Γ W 2tr W l 2tr Γ WA W k = W l A W k + 2tr Γ WB W k, W l + 2tr Γ W C W k, W l Now, [Magus ad Neudecker, 1988], give a aalytic form of the derivative of a iverse matrix, so we get so 2 U W 2tr Γ = 2tr Γ WB W k, W l + 2tr Γ 2 U W +2tr Γ W A W k + A T W k Γ WA W k + WC W k, W l WA W k Γ WA W k = 4tr Γ WB W k, W l + 2tr Γ WC W k, W l 4 Asymptotic distributio of Ŵ : The previous equatios allow us to give the asymptotic properties of the estimator miimizig the cost fuctio U W, amely from equatio 3 ad 4 we ca compute the asymptotic properties of the first ad the secod derivatives of U W. If the variable Z has a momet of order at least 3 the we get the followig lemma : Theorem 2 Assume that E ε 2 < ad E Z 3 <, let U W 0 be the gradiet vector of U W at W 0 ad HU W 0 be the Hessia matrix of U W at W 0. Write fially We get the BW k, W l := F WZ 1. HU W 0 a.s. 2I 0 2. U W 0 Law N0, 4I 0 3. Ŵ W 0 Law N0, I0 where, the compoet k, l of the matrix I 0 is : T F W Z W l tr Γ 0 E BW 0 k, W 0 l
7 6 Joseph Rykiewicz proof : We ca show easily that, for all x R d, we have : FWZ Cte1 + Z 2 F W Z Cte1 + Z 2 2 F W Z 2 F 0 W Z Cte W W Z 3 Write AW k = F WZ Y F W Z T ad UW := log dety F W Z. Note that the compoet k, l of the matrix 4I 0 is: UW 0 UW 0 E Wl 0 = E 2tr Γ0 A T Wk 0 2tr Γ0 AWl 0 ad, sice the trace of the product is ivariat by circular permutatio, E UW 0 UW 0 = Wl 0 4E F W 0ZT Γ0 Y F W 0ZY F W 0Z T Γ0 F W 0Z W l = 4E = 4tr W k FW 0Z T Γ0 F W 0Z W l Γ0 FW 0Z F E W 0Z T = 4tr Γ 0 E BW 0 k, W 0 l Now, the derivative FWZ is square itegrable, so U W 0 fulfills Lideberg s coditio see [Hall ad Heyde, 1980] ad U W 0 Law N0, 4I 0 For the compoet k, l of the expectatio of the Hessia matrix, remark first that ad so lim tr Γ W 0 A Wk 0 Γ W 0 A Wk 0 = 0 lim trγ C Wk 0, W l 0 = 0 lim H W 0 = lim 4tr Γ W 0 A Wk 0 2trΓ W 0 B Wk 0, W l 0 + 2trΓ C Wk 0, W l 0 = = 2tr Γ0 E BWk 0, W l 0 Now, sice 2 F WZ Cte1 + Z 2 ad 2 F W Z Γ W 0 A W 0 k + 2 F 0 W Z Cte W W Z 3, by stadard argumets foud, for example, i [Yao, 2000] we get Ŵ W 0 Law N0, I 0
8 multidimesioal MLP Asymptotic distributio of T I this sectio, we write Ŵ = argmi W Θs U W ad Ŵ 0 = argmi W Θq U W, where Θ q is view as a subset of R s. The asymptotic distributio of T is the a cosequece of the previous sectio, amely, if we have to replace U W by its Taylor expasio aroud Ŵ ad Ŵ 0, followig [Va der Vaart, 1998] chapter 16 we have : T = Ŵ Ŵ 0 T I0 Ŵ Ŵ 0 + o P 1 D χ 2 s q 3 Coclusio It has bee show that, i the case of multidimesioal output, the cost fuctio U W leads to a test for the umber of parameters i MLP simpler tha with the traditioal mea square cost fuctio. I fact the estimator Ŵ is also more efficiet tha the least square estimator see [Rykiewicz, 2003]. We ca also remark that U W matches with twice the cocetrated Gaussia log-likelihood but we have to emphasize, that its ice asymptotic properties eed oly momet coditio o ε ad Z, so it works eve if the distributio of the oise is ot Gaussia. A other solutio could be to use a approximatio of the covariace error matrix to compute geeralized least square estimator : 1 Y t F W Z t T Γ Y t F W Z t, assumig that Γ is a good approximatio of the true covariace matrix of the oise ΓW 0. However it take time to compute a good the matrix Γ ad if we try to compute the best matrix Γ with the data, it leads to the cost fuctio U W see for example [Gallat, 1987]. Fially, as we see i this paper, the computatio of the derivatives of U W is easy, so we ca use the effective differetial optimizatio techiques to estimate Ŵ ad umerical examples ca be foud i [Rykiewicz, 2003]. Refereces Fukumizu, 2003.K. Fukumizu. Likelihood ratio of uidetifiable models ad multilayer eural etworks. Aals of Statistics, 31:3: , Gallat, 1987.R.A. Gallat. No liear statistical models. J. Wiley ad Sos, New- York, Hall ad Heyde, 1980.P. Hall ad C. Heyde. Martigale limit theory ad its applicatios. Academic Press, New-York, Magus ad Neudecker, 1988.Ja R. Magus ad Heiz Neudecker. Matrix differetial calculus with applicatios i statistics ad ecoometrics. J. Wiley ad Sos, New-York, 1988.
9 8 Joseph Rykiewicz Rykiewicz, 2003.J. Rykiewicz. Estimatio of multidimesioal regressio model with multilayer perceptros. I J. Mira ad J.R. Alvarez, editors, Computatioal methods i eural modelig, volume 2686 of Lectures otes i computer sciece, pages , Sussma, 1992.H.J. Sussma. Uiqueess of the weights for miimal feedforward ets with a give iput-output map. Neural Networks, pages , Va der Vaart, 1998.A.W. Va der Vaart. Asymptotic statistics. Cambridge Uiversity Press, Cambridge, UK, Yao, 2000.J. Yao. O least square estimatio for stable oliear ar processes. The Aals of Istitut of Mathematical Statistics, 52: , 2000.
On the behavior at infinity of an integrable function
O the behavior at ifiity of a itegrable fuctio Emmauel Lesige To cite this versio: Emmauel Lesige. O the behavior at ifiity of a itegrable fuctio. The America Mathematical Mothly, 200, 7 (2), pp.75-8.
More informationThe Goldbach conjectures
The Goldbach cojectures Jamel Ghaouchi To cite this versio: Jamel Ghaouchi. The Goldbach cojectures. 2015. HAL Id: hal-01243303 https://hal.archives-ouvertes.fr/hal-01243303 Submitted o
More informationImprovement of Generic Attacks on the Rank Syndrome Decoding Problem
Improvemet of Geeric Attacks o the Rak Sydrome Decodig Problem Nicolas Arago, Philippe Gaborit, Adrie Hauteville, Jea-Pierre Tillich To cite this versio: Nicolas Arago, Philippe Gaborit, Adrie Hauteville,
More informationTURBULENT FUNCTIONS AND SOLVING THE NAVIER-STOKES EQUATION BY FOURIER SERIES
TURBULENT FUNCTIONS AND SOLVING THE NAVIER-STOKES EQUATION BY FOURIER SERIES M Sghiar To cite this versio: M Sghiar. TURBULENT FUNCTIONS AND SOLVING THE NAVIER-STOKES EQUATION BY FOURIER SERIES. Iteratioal
More informationQuantile regression with multilayer perceptrons.
Quatile regressio with multilayer perceptros. S.-F. Dimby ad J. Rykiewicz Uiversite Paris 1 - SAMM 90 Rue de Tolbiac, 75013 Paris - Frace Abstract. We cosider oliear quatile regressio ivolvig multilayer
More informationA Simple Proof of the Shallow Packing Lemma
A Simple Proof of the Shallow Packig Lemma Nabil Mustafa To cite this versio: Nabil Mustafa. A Simple Proof of the Shallow Packig Lemma. Discrete ad Computatioal Geometry, Spriger Verlag, 06, 55 (3), pp.739-743.
More informationOptimization Results for a Generalized Coupon Collector Problem
Optimizatio Results for a Geeralized Coupo Collector Problem Emmauelle Aceaume, Ya Busel, E Schulte-Geers, B Sericola To cite this versio: Emmauelle Aceaume, Ya Busel, E Schulte-Geers, B Sericola. Optimizatio
More informationStatistical Inference Based on Extremum Estimators
T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0
More informationGini Index and Polynomial Pen s Parade
Gii Idex ad Polyomial Pe s Parade Jules Sadefo Kamdem To cite this versio: Jules Sadefo Kamdem. Gii Idex ad Polyomial Pe s Parade. 2011. HAL Id: hal-00582625 https://hal.archives-ouvertes.fr/hal-00582625
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More informationECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors
ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic
More informationEfficient GMM LECTURE 12 GMM II
DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet
More informationRank tests and regression rank scores tests in measurement error models
Rak tests ad regressio rak scores tests i measuremet error models J. Jurečková ad A.K.Md.E. Saleh Charles Uiversity i Prague ad Carleto Uiversity i Ottawa Abstract The rak ad regressio rak score tests
More informationRegression with an Evaporating Logarithmic Trend
Regressio with a Evaporatig Logarithmic Tred Peter C. B. Phillips Cowles Foudatio, Yale Uiversity, Uiversity of Aucklad & Uiversity of York ad Yixiao Su Departmet of Ecoomics Yale Uiversity October 5,
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationLinear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d
Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y
More informationGeneral bound of overfitting for MLP regression models.
arxiv:20.0633v [math.st] 3 Ja 202 Geeral boud of overfittig for MLP regressio models. Rykiewicz, J. Abstract Multilayer perceptros (MLP) with oe hidde layer have bee used for a log time to deal with o-liear
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationLecture 20: Multivariate convergence and the Central Limit Theorem
Lecture 20: Multivariate covergece ad the Cetral Limit Theorem Covergece i distributio for radom vectors Let Z,Z 1,Z 2,... be radom vectors o R k. If the cdf of Z is cotiuous, the we ca defie covergece
More informationA note on self-normalized Dickey-Fuller test for unit root in autoregressive time series with GARCH errors
Appl. Math. J. Chiese Uiv. 008, 3(): 97-0 A ote o self-ormalized Dickey-Fuller test for uit root i autoregressive time series with GARCH errors YANG Xiao-rog ZHANG Li-xi Abstract. I this article, the uit
More informationBerry-Esseen bounds for self-normalized martingales
Berry-Essee bouds for self-ormalized martigales Xiequa Fa a, Qi-Ma Shao b a Ceter for Applied Mathematics, Tiaji Uiversity, Tiaji 30007, Chia b Departmet of Statistics, The Chiese Uiversity of Hog Kog,
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationA RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS
J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a
More informationKolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data
Proceedigs 59th ISI World Statistics Cogress, 5-30 August 013, Hog Kog (Sessio STS046) p.09 Kolmogorov-Smirov type Tests for Local Gaussiaity i High-Frequecy Data George Tauche, Duke Uiversity Viktor Todorov,
More informationJournal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula
Joural of Multivariate Aalysis 102 (2011) 1315 1319 Cotets lists available at ScieceDirect Joural of Multivariate Aalysis joural homepage: www.elsevier.com/locate/jmva Superefficiet estimatio of the margials
More informationAsymptotic Results for the Linear Regression Model
Asymptotic Results for the Liear Regressio Model C. Fli November 29, 2000 1. Asymptotic Results uder Classical Assumptios The followig results apply to the liear regressio model y = Xβ + ε, where X is
More informationECE 330:541, Stochastic Signals and Systems Lecture Notes on Limit Theorems from Probability Fall 2002
ECE 330:541, Stochastic Sigals ad Systems Lecture Notes o Limit Theorems from robability Fall 00 I practice, there are two ways we ca costruct a ew sequece of radom variables from a old sequece of radom
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationTESTING FOR THE BUFFERED AUTOREGRESSIVE PROCESSES (SUPPLEMENTARY MATERIAL)
TESTING FOR THE BUFFERED AUTOREGRESSIVE PROCESSES SUPPLEMENTARY MATERIAL) By Ke Zhu, Philip L.H. Yu ad Wai Keug Li Chiese Academy of Scieces ad Uiversity of Hog Kog APPENDIX: PROOFS I this appedix, we
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More informationExplicit Maximal and Minimal Curves over Finite Fields of Odd Characteristics
Explicit Maximal ad Miimal Curves over Fiite Fields of Odd Characteristics Ferruh Ozbudak, Zülfükar Saygı To cite this versio: Ferruh Ozbudak, Zülfükar Saygı. Explicit Maximal ad Miimal Curves over Fiite
More informationECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
More information11 THE GMM ESTIMATION
Cotets THE GMM ESTIMATION 2. Cosistecy ad Asymptotic Normality..................... 3.2 Regularity Coditios ad Idetificatio..................... 4.3 The GMM Iterpretatio of the OLS Estimatio.................
More informationFirst Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise
First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >
More information3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,
3. Z Trasform Referece: Etire Chapter 3 of text. Recall that the Fourier trasform (FT) of a DT sigal x [ ] is ω ( ) [ ] X e = j jω k = xe I order for the FT to exist i the fiite magitude sese, S = x [
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More informationChapter 13: Tests of Hypothesis Section 13.1 Introduction
Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed
More informationLecture 33: Bootstrap
Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece
More informationAsymptotic distribution of products of sums of independent random variables
Proc. Idia Acad. Sci. Math. Sci. Vol. 3, No., May 03, pp. 83 9. c Idia Academy of Scieces Asymptotic distributio of products of sums of idepedet radom variables YANLING WANG, SUXIA YAO ad HONGXIA DU ollege
More informationOn the convergence rates of Gladyshev s Hurst index estimator
Noliear Aalysis: Modellig ad Cotrol, 2010, Vol 15, No 4, 445 450 O the covergece rates of Gladyshev s Hurst idex estimator K Kubilius 1, D Melichov 2 1 Istitute of Mathematics ad Iformatics, Vilius Uiversity
More informationComparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series
Applied Mathematical Scieces, Vol. 7, 03, o. 6, 3-337 HIKARI Ltd, www.m-hikari.com http://d.doi.org/0.988/ams.03.3430 Compariso Study of Series Approimatio ad Covergece betwee Chebyshev ad Legedre Series
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationThe central limit theorem for Student s distribution. Problem Karim M. Abadir and Jan R. Magnus. Econometric Theory, 19, 1195 (2003)
The cetral limit theorem for Studet s distributio Problem 03.6.1 Karim M. Abadir ad Ja R. Magus Ecoometric Theory, 19, 1195 (003) Z Ecoometric Theory, 19, 003, 1195 1198+ Prited i the Uited States of America+
More informationKernel density estimator
Jauary, 07 NONPARAMETRIC ERNEL DENSITY ESTIMATION I this lecture, we discuss kerel estimatio of probability desity fuctios PDF Noparametric desity estimatio is oe of the cetral problems i statistics I
More informationEFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS
EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog
More informationIntroduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT
Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory
More informationLecture 4. We also define the set of possible values for the random walk as the set of all x R d such that P(S n = x) > 0 for some n.
Radom Walks ad Browia Motio Tel Aviv Uiversity Sprig 20 Lecture date: Mar 2, 20 Lecture 4 Istructor: Ro Peled Scribe: Lira Rotem This lecture deals primarily with recurrece for geeral radom walks. We preset
More informationLecture 8: Convergence of transformations and law of large numbers
Lecture 8: Covergece of trasformatios ad law of large umbers Trasformatio ad covergece Trasformatio is a importat tool i statistics. If X coverges to X i some sese, we ofte eed to check whether g(x ) coverges
More informationCoefficient of variation and Power Pen s parade computation
Coefficiet of variatio ad Power Pe s parade computatio Jules Sadefo Kamdem To cite this versio: Jules Sadefo Kamdem. Coefficiet of variatio ad Power Pe s parade computatio. 20. HAL Id: hal-0058658
More informationNYU Center for Data Science: DS-GA 1003 Machine Learning and Computational Statistics (Spring 2018)
NYU Ceter for Data Sciece: DS-GA 003 Machie Learig ad Computatioal Statistics (Sprig 208) Brett Berstei, David Roseberg, Be Jakubowski Jauary 20, 208 Istructios: Followig most lab ad lecture sectios, we
More informationInvariant relations between binary Goldbach s decompositions numbers coded in a 4 letters language
Ivariat relatios betwee biary Goldbach s decompositios umbers coded i a letters laguage Deise Vella-Chemla To cite this versio: Deise Vella-Chemla. Ivariat relatios betwee biary Goldbach s decompositios
More informationDiagonal approximations by martingales
Alea 7, 257 276 200 Diagoal approximatios by martigales Jaa Klicarová ad Dalibor Volý Faculty of Ecoomics, Uiversity of South Bohemia, Studetsa 3, 370 05, Cese Budejovice, Czech Republic E-mail address:
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week Lecture: Cocept Check Exercises Starred problems are optioal. Statistical Learig Theory. Suppose A = Y = R ad X is some other set. Furthermore, assume P X Y is a discrete
More informationOptimally Sparse SVMs
A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but
More informationEntropy Rates and Asymptotic Equipartition
Chapter 29 Etropy Rates ad Asymptotic Equipartitio Sectio 29. itroduces the etropy rate the asymptotic etropy per time-step of a stochastic process ad shows that it is well-defied; ad similarly for iformatio,
More informationBull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung
Bull. Korea Math. Soc. 36 (999), No. 3, pp. 45{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Abstract. This paper provides suciet coditios which esure the strog cosistecy of regressio
More informationINFINITE SEQUENCES AND SERIES
11 INFINITE SEQUENCES AND SERIES INFINITE SEQUENCES AND SERIES 11.4 The Compariso Tests I this sectio, we will lear: How to fid the value of a series by comparig it with a kow series. COMPARISON TESTS
More informationSolution to Chapter 2 Analytical Exercises
Nov. 25, 23, Revised Dec. 27, 23 Hayashi Ecoometrics Solutio to Chapter 2 Aalytical Exercises. For ay ε >, So, plim z =. O the other had, which meas that lim E(z =. 2. As show i the hit, Prob( z > ε =
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationTHE STRONG LAW OF LARGE NUMBERS FOR STATIONARY SEQUENCES
THE STRONG LAW OF LARGE NUMBERS FOR STATIONARY SEQUENCES Debdeep Pati Idia Statistical Istitute, Kolkata Jue 26, 2006 Abstract The traditioal proof of the strog law of large umbers usig idepedet ad idetically
More information5 Birkhoff s Ergodic Theorem
5 Birkhoff s Ergodic Theorem Amog the most useful of the various geeralizatios of KolmogorovâĂŹs strog law of large umbers are the ergodic theorems of Birkhoff ad Kigma, which exted the validity of the
More informationApproximation by Superpositions of a Sigmoidal Function
Zeitschrift für Aalysis ud ihre Aweduge Joural for Aalysis ad its Applicatios Volume 22 (2003, No. 2, 463 470 Approximatio by Superpositios of a Sigmoidal Fuctio G. Lewicki ad G. Mario Abstract. We geeralize
More informationMATHEMATICAL SCIENCES PAPER-II
MATHEMATICAL SCIENCES PAPER-II. Let {x } ad {y } be two sequeces of real umbers. Prove or disprove each of the statemets :. If {x y } coverges, ad if {y } is coverget, the {x } is coverget.. {x + y } coverges
More informationAn almost sure invariance principle for trimmed sums of random vectors
Proc. Idia Acad. Sci. Math. Sci. Vol. 20, No. 5, November 200, pp. 6 68. Idia Academy of Scieces A almost sure ivariace priciple for trimmed sums of radom vectors KE-ANG FU School of Statistics ad Mathematics,
More informationChapter 7 Isoperimetric problem
Chapter 7 Isoperimetric problem Recall that the isoperimetric problem (see the itroductio its coectio with ido s proble) is oe of the most classical problem of a shape optimizatio. It ca be formulated
More informationChapter 10: Power Series
Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because
More informationSummary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector
Summary ad Discussio o Simultaeous Aalysis of Lasso ad Datzig Selector STAT732, Sprig 28 Duzhe Wag May 4, 28 Abstract This is a discussio o the work i Bickel, Ritov ad Tsybakov (29). We begi with a short
More informationLimit distributions for products of sums
Statistics & Probability Letters 62 (23) 93 Limit distributios for products of sums Yogcheg Qi Departmet of Mathematics ad Statistics, Uiversity of Miesota-Duluth, Campus Ceter 4, 7 Uiversity Drive, Duluth,
More information1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1
8. The cetral limit theorems 8.1. The cetral limit theorem for i.i.d. sequeces. ecall that C ( is N -separatig. Theorem 8.1. Let X 1, X,... be i.i.d. radom variables with EX 1 = ad EX 1 = σ (,. Suppose
More informationMAT1026 Calculus II Basic Convergence Tests for Series
MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real
More informationMathematical Statistics - MS
Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios
More informationNotes 19 : Martingale CLT
Notes 9 : Martigale CLT Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: [Bil95, Chapter 35], [Roc, Chapter 3]. Sice we have ot ecoutered weak covergece i some time, we first recall
More informationAn Introduction to Asymptotic Theory
A Itroductio to Asymptotic Theory Pig Yu School of Ecoomics ad Fiace The Uiversity of Hog Kog Pig Yu (HKU) Asymptotic Theory 1 / 20 Five Weapos i Asymptotic Theory Five Weapos i Asymptotic Theory Pig Yu
More informationThis section is optional.
4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore
More informationDimension-free PAC-Bayesian bounds for the estimation of the mean of a random vector
Dimesio-free PAC-Bayesia bouds for the estimatio of the mea of a radom vector Olivier Catoi CREST CNRS UMR 9194 Uiversité Paris Saclay olivier.catoi@esae.fr Ilaria Giulii Laboratoire de Probabilités et
More informationSlide Set 13 Linear Model with Endogenous Regressors and the GMM estimator
Slide Set 13 Liear Model with Edogeous Regressors ad the GMM estimator Pietro Coretto pcoretto@uisa.it Ecoometrics Master i Ecoomics ad Fiace (MEF) Uiversità degli Studi di Napoli Federico II Versio: Friday
More informationEECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1
EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum
More informationSelf-normalized deviation inequalities with application to t-statistic
Self-ormalized deviatio iequalities with applicatio to t-statistic Xiequa Fa Ceter for Applied Mathematics, Tiaji Uiversity, 30007 Tiaji, Chia Abstract Let ξ i i 1 be a sequece of idepedet ad symmetric
More informationPAPER : IIT-JAM 2010
MATHEMATICS-MA (CODE A) Q.-Q.5: Oly oe optio is correct for each questio. Each questio carries (+6) marks for correct aswer ad ( ) marks for icorrect aswer.. Which of the followig coditios does NOT esure
More informationPrecise Rates in Complete Moment Convergence for Negatively Associated Sequences
Commuicatios of the Korea Statistical Society 29, Vol. 16, No. 5, 841 849 Precise Rates i Complete Momet Covergece for Negatively Associated Sequeces Dae-Hee Ryu 1,a a Departmet of Computer Sciece, ChugWoo
More informationb i u x i U a i j u x i u x j
M ath 5 2 7 Fall 2 0 0 9 L ecture 1 9 N ov. 1 6, 2 0 0 9 ) S ecod- Order Elliptic Equatios: Weak S olutios 1. Defiitios. I this ad the followig two lectures we will study the boudary value problem Here
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationEstimation of the Mean and the ACVF
Chapter 5 Estimatio of the Mea ad the ACVF A statioary process {X t } is characterized by its mea ad its autocovariace fuctio γ ), ad so by the autocorrelatio fuctio ρ ) I this chapter we preset the estimators
More informationAverage Number of Real Zeros of Random Fractional Polynomial-II
Average Number of Real Zeros of Radom Fractioal Polyomial-II K Kadambavaam, PG ad Research Departmet of Mathematics, Sri Vasavi College, Erode, Tamiladu, Idia M Sudharai, Departmet of Mathematics, Velalar
More information1 Convergence in Probability and the Weak Law of Large Numbers
36-752 Advaced Probability Overview Sprig 2018 8. Covergece Cocepts: i Probability, i L p ad Almost Surely Istructor: Alessadro Rialdo Associated readig: Sec 2.4, 2.5, ad 4.11 of Ash ad Doléas-Dade; Sec
More informationA note on the sum of uniform random variables
A ote o the sum of uiform radom variables Aiello Buoocore, Erica Pirozzi, Luigia Caputo To cite this versio: Aiello Buoocore, Erica Pirozzi, Luigia Caputo. A ote o the sum of uiform radom variables. Statistics
More informationMa 530 Introduction to Power Series
Ma 530 Itroductio to Power Series Please ote that there is material o power series at Visual Calculus. Some of this material was used as part of the presetatio of the topics that follow. What is a Power
More informationLecture 3. Properties of Summary Statistics: Sampling Distribution
Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary
More informationNANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS
NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.
More information1 Covariance Estimation
Eco 75 Lecture 5 Covariace Estimatio ad Optimal Weightig Matrices I this lecture, we cosider estimatio of the asymptotic covariace matrix B B of the extremum estimator b : Covariace Estimatio Lemma 4.
More informationThe random version of Dvoretzky s theorem in l n
The radom versio of Dvoretzky s theorem i l Gideo Schechtma Abstract We show that with high probability a sectio of the l ball of dimesio k cε log c > 0 a uiversal costat) is ε close to a multiple of the
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationNotes 27 : Brownian motion: path properties
Notes 27 : Browia motio: path properties Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces:[Dur10, Sectio 8.1], [MP10, Sectio 1.1, 1.2, 1.3]. Recall: DEF 27.1 (Covariace) Let X = (X
More information( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2
82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1,
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More informationLECTURE 8: ASYMPTOTICS I
LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1 Defiitio: (Weak covergece) A sequece
More informationLecture 6 Simple alternatives and the Neyman-Pearson lemma
STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull
More informationDetailed proofs of Propositions 3.1 and 3.2
Detailed proofs of Propositios 3. ad 3. Proof of Propositio 3. NB: itegratio sets are geerally omitted for itegrals defied over a uit hypercube [0, s with ay s d. We first give four lemmas. The proof of
More information