THE ADAPTIVE LASSSO UNDER A GENERALIZED SPARSITY CONDITION. Joel L. Horowitz Department of Economics Northwestern University Evanston, IL

Size: px
Start display at page:

Download "THE ADAPTIVE LASSSO UNDER A GENERALIZED SPARSITY CONDITION. Joel L. Horowitz Department of Economics Northwestern University Evanston, IL"

Transcription

1 THE ADAPTIVE LASSSO UNDER A GENERALIZED SPARSITY CONDITION by Joel L. Horowitz Departmet of Ecoomic Northweter Uiverity Evato, IL 68 ad Jia Huag Departmet of Statitic ad Actuarial Sciece Uiverity of Iowa Iowa City, IA 54 October ABSTRACT We coider etimatio of a liear model i which a few coefficiet are large ad may be obect of ubtative iteret, wherea other are mall but ot ecearily zero. The umber of mall coefficiet may exceed the ample ize. It i ot kow which coefficiet are large ad which are mall. The large coefficiet ca be etimated with a maller mea-quare error if the mall coefficiet ca be idetified ad the covariate aociated with them dropped from the model. We how that the adaptive LASSO ditiguihe correctly betwee large ad mall coefficiet with probability approachig a the ample ize icreae. The reult of Mote Carlo experimet ad a empirical example how that the adaptive LASSO reduce the mea-quare error of the etimate of the large coefficiet eve i quite mall ample. Key word: Pealized regreio, high-dimeioal data, variable electio. AMS ubect claificatio: Primary 6J5, 6J7; ecodary 6E, 6F The reearch of Joel L. Horowitz wa upported i part by NSF grat SES-8755.

2 THE ADAPTIVE LASSSO UNDER A GENERALIZED SPARSITY CONDITION. Itroductio We coider the liear mea-regreio model () p Y = x β + ε ; i=,..., i i i = where Yi i the repoe variable, the x i are fixed covariate, ad the ε i are uoberved mea-zero radom variable. We aume without lo of geerality that the data are cetered, o there i o itercept i the model, ad that = for each =,..., p. The umber of x i= i covariate ( p ) may be large, poibly larger tha the ample ize ( ). Our iteret i i idetifyig ad etimatig the coefficiet β that are large i a ee that will be defied. We aume that the umber of large β i maller tha. The remaiig coefficiet are defied to be mall. Their value are mall i a ee that will be explaied, but they are ot ecearily zero. We give coditio uder which the adaptive LASSO procedure ditiguihe correctly betwee covariate with large ad mall coefficiet with probability approachig oe a. The adaptive LASSO alo provide coitet etimator of the large coefficiet. Specifically, we coider a geeralized parity coditio (GSC; Zhag ad Huag 8). Deote A {,..., p} a the et of mall coefficiet. Uder the GSC, () β η, A where { η } i a equece of o-egative cotat. Thi aumptio i weaker tha the oe commoly made i the literature, which i that (3) β = if A. Note that (3) i a pecial cae of the GSC. I practice, the GSC ca be a more realitic formulatio of pare model tha i (). Let A deote the complemet of A. We defie the elemet of A to be large coefficiet. We defie a covariate to be importat if it coefficiet i i A ad uimportat if it coefficiet i i A. A importat pecial cae occur whe the umber of large coefficiet i fixed, / η = o ( ), ad / mi A β a. I other word, the mall coefficiet are / maller tha O ( ) ad the large coefficiet are larger tha O ( / ) a. I thi cae, the mea-quare etimatio error of the large coefficiet are maller if all the uimportat

3 covariate are excluded from the model tha if ay of the uimportat covariate i icluded. Thu, if the obective i to etimate oe or more large coefficiet, it i better to drop the uimportat covariate from the model. For example, i a ecoomic wage equatio, the depedet variable i the logarithm of a idividual weekly wage, ad the obect of iteret i applicatio iclude the coefficiet of covariate uch a a idividual year of educatio, year of labor-force experiece, marital tatu, ad labor uio memberhip. However, widely available data et for etimatig wage equatio ca cotai hudred or eve thouad of variable that may be weakly related to wage. It i ot clear a priori which of thee variable hould be icluded i a wage equatio, though it i clear that icludig all of them will reult i very imprecie etimate of the coefficiet of iteret. Thi illutrate the uefule of a ytematic method for dicrimiatig betwee covariate with large ad mall coefficiet. We give coditio uder which the adaptive LASSO doe thi with probability approachig a. LASSO-type pealizatio method for model electio (Tibhirai 996) have attracted much attetio i recet year. There i alo a large literature o the ue of LASSO for the related problem of predictio (ee, e.g., Greehtei ad Ritov (4) ad Bickel, Ritov, ad Tybakov 9), but here we are cocered oly with model electio. Meihaue ad Bühlma (6) ad Zhao ad Yu (6) howed that, uder a trog irrepreetable coditio o the deig matrix, the LASSO i model-electio coitet i high-dimeioal ettig. Zhag (9) gave coditio uder which the LASSO combied with a threholdig procedure coitetly ditiguihe betwee coefficiet that are zero ad coefficiet whoe magitude a exceed for ome < /. Zou (6) gave coditio uder which the adaptive LASSO i model-electio coitet whe the umber of covariate i fixed. Huag, Ma, ad Zhag (8) provided coditio uder which the adaptive LASSO i model-electio coitet eve whe the umber of covariate i a large a ex p( ) for ome a (,). Kight ad Fu () ad Huag, Horowitz, ad Ma (8) etablihed model-electio coitecy of bridge etimator. Other pealizatio approache have bee ivetigated by Fa ad Li (); Fa ad Peg (4); Fa, Peg, ad Huag (5); Lv ad Fa (9); ad Zou ad Zhag (9). that the mall The foregoig model-electio procedure aume that the large β are o-zero ad β are exactly zero. I a recet paper, Zhag ad Huag (8) tudied the a electio propertie of the LASSO uder the GSC whe p >. They howed that the LASSO elect a model that iclude all the covariate with large coefficiet ad ha the right order of dimeioality. However, i geeral, the LASSO alo iclude ome covariate with mall

4 coefficiet. Thu, for example, the LASSO ted to elect a model that i too large whe the large coefficiet are larger ad the mall coefficiet are maller tha O ( ). Zhag (9) gave coditio uder which the LASSO combied with a threholdig procedure correctly elect coefficiet that are ufficietly far from zero. However, Zhag procedure require a uer-elected threhold, ad it i ot clear how to chooe thi threhold i applicatio. I thi paper, we how that with probability approachig oe a, the adaptive LASSO correctly ditiguihe betwee large ad mall coefficiet uder the GSC. No uerelected threhold i eeded. Sectio of thi paper decribe the adaptive LASSO. Sectio 3 preet it aymptotic propertie uder the GSC. Sectio 4 preet the reult of a Mote Carlo ivetigatio of the umerical performace of the adaptive LASSO. Sectio 5 preet a empirical example, ad Sectio 6 preet cocludig commet. The proof of theorem are i the appedix, which i Sectio 7. /. The Adaptive LASSO Defie = ( Y,..., Y ). Let y x = ( x,..., x ) deote the vector of value of the th covariate, ad let X = ( x,..., x p ) deote the deig matrix. Let β = ( β,..., β p ), ad let β deote the true but ukow value of β. The LASSO obective fuctio i p = + = L ( β; λ ).5 y Xβ λ β, where λ i the pealty parameter. The LASSO etimator i defied a β ( λ ) = argmi β ( β; λ ). L The adaptive LASSO obective fuctio i (3) p = + = L ( β; λ ).5 y Xβ λ w β, where λ i the pealty parameter. The weight w are w = β, where β i the th compoet of β ( λ ). The adaptive LASSO etimator i defied a ˆ ( ) argmi ( ; ) β λ = β L β λ. We defie w = whe β =, ad we et =. Miimizatio of (3) reult i ˆ β = if w =. Thu, if a variable i ot elected by the LASSO, it i ot elected by the adaptive LASSO. 3

5 Uder coditio that are tated i Sectio 3, the LASSO elect (aymptotically) all coefficiet that exceed a certai threhold. However, the LASSO alo ted to elect coefficiet that are below the threhold. The adaptive LASSO i a way to correct LASSO over-electio problem. 3. Aymptotic Propertie For ay A {,..., p}, let X = { x : A} ad C = X X /. Defie A = m ν = A A A A cmi ( m) = mimiν CAν, cmi ( m) = maxmaxν CAν, A = m ν = where A i the umber of elemet of A ad i the orm. We ay that the covariate matrix X atifie the pare Riez coditio (SRC) with rak q ad pectrum boud c c < * < * < if (5) c c ( q) c ( q) c* Awith A = q ad ν q. * mi max Uder (5), all the eigevalue of C are cotaied i the iterval [ c, c *] whe A q. We make the followig aumptio. A (A) The radom variable ε, ε,... are idepedetly ad idetically ditributed with mea. * There are cotat C > ad K > uch that ad i =,,... P( ε > z) Kexp( Cz ) for all i z (A) There i a fiite cotat c > uch that η c qλ /. (A3) The SRC hold. Coditio (A) require the ε i to have ubgauia tail. Coditio (A) defie the cla of mall coefficiet. Let A = { : β ( λ ) } be the et of coefficiet etimated to be o-zero by the LASSO. The followig lemma, which i proved i Zhag ad Huag (8), ummarize importat propertie of A ad β. Lemma : Let (A)-(A3) hold, ad let λ = O( log p). The there are fiite cotat M ad M uch that (i) A M q with probability approachig a. (ii) All covariate with β > M qλ /( c c* ) are elected with probability approachig a. * 4

6 (iii) p β β =O ( h ), where h = q(log p) /. Lemma how that with high probability, the umber of covariate elected by the LASSO i a fiite multiple of the umber of covariate i A (that i, of the umber of covariate with large coefficiet). Moreover, all covariate exceedig the threhold i (ii) are elected with probability approachig a are elected with probability approachig if. I particular, all of the covariate with large coefficiet η = o( q(log p)/ ) ad the large β are larger tha O( q(log p) / ). I additio, the LASSO etimator i etimatio coitet. However, etimatio coitecy doe ot imply model-electio coitecy. We ow give coditio uder which the adaptive LASSO achieve model-electio coitecy. Deote the mallet ad larget eigevalue of repectively. Make the followig additioal aumptio. C = X X by τ ad τ, A A A / (A4) There are cotat < τ τ < uch that τ τ τ τ for all ufficietly large. (A5) atify Let b = mi β. A, the otochatic quatitie q, η,, λ, ad A λ q q( h + η) η( h + η) + +. b b λ h I a pare model, A i much maller tha ad may eve be fixed a, o it i reaoable to aume i (A4) that the eigevalue of (A5) retrict q, η, too mall ad that the b C A are bouded away from ad. λ,ad. It require, the mallet of the large coefficiet, to be ot b b orm of the mall coefficiet i ot too large. I particular, it require b qη. I other word, there mut be eough eparatio betwee the large ad mall coefficiet for the adaptive LASSO to ditiguih betwee them. Now defie ˆ β = { ˆ β : A ad A } βa = { β : A }. For ay vector u = ( u, u,...), defie g( u) = (g( u),g( u ),...), where g( u ) =,, or accordig to whether u <, u =, or u >. Theorem : Let (A)-(A5) hold. The a, P( ˆ β = A ) ad P(g( ˆ β ) = g( β )). A A 5

7 Thu, with probability approachig a, the adaptive LASSO elect all the covariate with large coefficiet ad drop the covariate with mall coefficiet i the ee that it et the coefficiet of thoe covariate equal to zero. 3. A importat pecial cae I the ocial ciece, it i ot uuual for a urvey data et to cotai thouad of obervatio ad hudred or thouad of variable that are arguably related to the depedet variable of iteret i the ee of havig o-zero β coefficiet i (). However, i typical applicatio, mot of thee coefficiet are thought to be mall i the ee of havig magitude ad effect o the depedet variable that are maller tha the radom amplig error of their etimate. The large coefficiet are typically few i umber relative to the ample ize. If, a ofte happe, the total umber of covariate i le tha the ample ize, thi ettig ca be repreeted by a model i which p ad q are fixed a, the mall / coefficiet atify η = o ( ), ad the large coefficiet atify b κ (log ) / a for ome cotat κ >. It follow from Theorem with λ log that a, the adaptive LASSO etimate of the large coefficiet are o-zero ad the etimate of the mall coefficiet are zero. Moreover, a traightforward calculatio how that the mea-quare error (MSE) of the adaptive LASSO etimator of each large coefficiet i ever larger ad, except i pecial cae, i trictly maller tha the MSE of the ordiary leat quare (OLS) etimator that i obtaied whe all covariate are icluded i (). Thu, the adaptive LASSO improve the preciio of the etimate of the large coefficiet. If p > but the umber of large coefficiet i fixed at q, the we ca coider a model i which the mall coefficiet atify η = O[/ log p], the large coefficiet atify / b κ (log p) / for ome cotat κ >, ad λ log p. The it follow agai from Theorem that a, the adaptive LASSO etimate of the large coefficiet are o-zero ad the etimate of the mall coefficiet are zero. Moreover, the MSE of the adaptive LASSO etimator of each large coefficiet i o larger ad, except i pecial cae, i trictly maller tha MSE of the OLS etimator that i obtaied by icludig i the model ay group of up to q uimportat covariate or liear combiatio of uimportat covariate. I ummary, the adaptive LASSO etimator reduce the MSE of the etimator of ay large coefficiet if there i ufficiet eparatio betwee the magitude of the large ad mall coefficiet. 6

8 4. Mote Carlo Experimet Thi ectio report the reult of a Mote Carlo ivetigatio of the fiite-ample performace of the LASSO ad adaptive LASSO whe the mall coefficiet are ot ecearily zero. We write model () i the form d Y = β X + β X + ε ; i=,..., i i i i = = d+ p, where β,..., β d are large coefficiet ad the coefficiet βd+,..., β p are mall or zero. The radom variable ε i are idepedetly ditributed a repeated ample ad are cetered ad caled o that ε N(, σ ). The covariate are fixed i i= X i = ; X = ; i=,..., i= The covariate are geerated a follow. Defie / ρ ξi = ζi + νi; i=,..., ; =,..., p/ ρ / i ρ ξi = ζi + νi; i=,..., ; = p/ +,..., p, ρ where the ζ i ad ν i are idepedetly ditributed a (,) ad ρ, ρ <. Defie N The = i; = ( i i= i= ξ ξ ξ ). X i ξi ξ =. ξ Moreover, ρ if k, p/ corr( Xi, Xik ) = ρ if p / <, k < p / ( ρρ ) if p/< k p I the experimet reported here, if d β =.5 if d + p/ p/+ p 7

9 I additio, =, p = 5, σ ε =, ρ =.5, ad ρ =.. The coefficiet of iteret i β. Experimet are reported with pealizatio parameter i obtaied by miimizig the BIC. d =, 4, ad 6 ad with the LASSO ad adaptive LASSO. The Table how the mea-quare error of the etimate of β obtaied from applyig OLS to the full model ad to the model cotaiig oly the variable whoe coefficiet are large (the reduced model). Thee reult are obtaied aalytically uig the algebra of leat quare. They how that the mea-quare error i maller whe β i etimated from the reduced model tha whe it i etimated from the full model. Table how the reult of etimatio uig the LASSO ad adaptive LASSO. There are Mote Carlo replicatio i each experimet. Both verio of the LASSO reduce the mea-quare etimatio error by about a factor of two relative to OLS etimatio with the full model. Not urpriigly, either verio achieve the mea-quare error that i achievable whe the variable with large coefficiet are kow. The model elected by the LASSO ha a higher probability of cotaiig all the importat covariate tha doe the model elected by the adaptive LASSO. If β i the coefficiet of iteret, it i reaoable to coider verio of the LASSO ad adaptive LASSO i which X i ( i=,,., ) i alway i the choe model. Thi ca be achieved by leavig β out of the pealty fuctio. Table 3 how the reult of LASSO ad adaptive LASSO etimatio with β ot i the pealty fuctio. Forcig X i ito the model greatly reduce the mea-quare error of the adaptive LASSO etimator of β. It i eetially the ame a the mea-quare error that i obtaied by applyig OLS to the model with oly the covariate with large coefficiet. 5. A Empirical Example Thi ectio preet a empirical example that illutrate the applicatio of the LASSO ad adaptive LASSO i a ettig where may coefficiet are plauibly mall but o-zero. The applicatio coit of etimatig a wage equatio for black male aged 4-49 year who reide i the ortheater U.S. The data are from the Natioal Logitudial Survey of Youth. There are 6 obervatio. The depedet variable i the logarithm of the hourly wage. There are 4 covariate, icludig core o ectio of the armed force qualificatio examiatio, idicator of educatio level, a variety of peroal characteritic, a biary idicator of marital tatu (married or ot), ad a biary idicator of memberhip i a labor uio. The variable of iteret i thi example are marital tatu ad uio memberhip. Their coefficiet meaure the 8

10 fractioal chage i the wage aociated with beig married or belogig to a labor uio. It i arguable that all of the covariate affect productivity ad, therefore, the hourly wage but that the effect of may covariate may be mall. Applicatio of the LASSO ad adaptive LASSO uig the BIC to elect the pealty parameter reulted i electio of 7 ad 4 covariate, repectively. A aymptotic chi-quare tet doe ot reect the hypothee that the coefficiet of the variable ot elected by the LASSO or adaptive LASSO are zero ( p >.6 ). Thi implie that the value of thee coefficiet are mall eough to be withi radom amplig error of zero. They are ot ecearily equal to zero. Table 4 how the etimate ad aymptotic tadard error of the two coefficiet of iteret that are obtaied from applyig ordiary leat quare to the full model (all 4 covariate), the model elected by the LASSO, ad the model elected by the adaptive LASSO. The three poit etimate of the coefficiet of labor uio memberhip are imilar, but the tadard error of the etimate obtaied from the full model i early twice a large a the tadard error obtaied from the model elected by the LASSO ad adaptive LASSO. The etimate of the coefficiet of marital tatu obtaied from the model elected by the LASSO ad adaptive LASSO are early 4 time a large a the etimate obtaied from the full model, ad the tadard error of the etimate obtaied from the elected model are about 55% of the tadard error obtaied with the full model. 6. Cocluio I applicatio of regreio aalyi, it i ofte the cae that there are may covariate whoe coefficiet are thought to be mall but ot ecearily zero ad relatively few covariate with large coefficiet. I uch ituatio, the preciio of etimatig the large coefficiet ca be icreaed by leavig the covariate with mall coefficiet out of the model. However, it i rarely kow a priori which coefficiet are large ad which are mall. Thi paper ha give coditio uder which the adaptive LASSO correctly ditiguihe betwee large coefficiet ad coefficiet that are mall but ot ecearily zero. Specifically, we have how that uder a geeralized parity coditio ad other mild regularity coditio, the adaptive LASSO correctly ditiguihe betwee large ad mall coefficiet with probability approachig oe a the ample ize icreae. 9

11 7. Proof of Theorem Let ψ ( v) = exp( v ). The ψ -Orlicz orm x ψ of ay radom variable x i defied a x = if{ C > : Eψ ( x / C) }. The Orlicz orm i ueful for obtaiig maximal ψ iequalitie (Va der Vaart ad Weller 996). Lemma : Suppoe that ε,..., ε are iid radom variable with Eε = ad i i Var( ε ) = σ. Suppoe that P( ε > z) Kexp( Cz ) for i=,..., ad cotat C ad K. i The, for all cotat a i atifyig a i= i =, (6) where K / / aiεi K[ σ + ( + K) C ], i= ψ i a cotat. Coequetly (7) g( t) up P aiεi > t exp( t / ) M a a = i= for ome cotat M that deped oly o K ad C. Proof of Theorem : By the Karuh-Kuh-Tucker coditio, uique adaptive LASSO etimator if ˆ β = ( ˆ β,..., ˆ β ) p i the (8) x ( ˆ ) ˆ ˆ y Xβ = λw g( β) if β ( ˆ ) ˆ x y Xβ λw if β = ad the vector { x, ˆ β ) are liearly idepedet. Let = { w g( β ): A } ad A ˆ A = ( X A XA ) ( XA y- λ A ) β A CA XA XA XA A (9) = β + ( ε + β ), λ A where C = X X A A A /. If g( ˆ β ) = g( β ), the (8) hold for ˆ β = ( ˆ β, ), where A A A A A i a vector of zero with legth A. Let how that P[g( ˆ β ) = g( β )]. A A * β = ( β, ). To prove the theorem, it uffice to A A Sice Xβ = X ˆA β for thi ˆ β ad { x : A } are liearly idepedet, ˆ A

12 () ˆ * g( β ) g( ) if g( ˆ βa ) = g( βa ) = β ( ˆ x y β ) λ X A A w A. Let A A A H = I X C X / idetity matrix. From (9), we have () be the proectio oto the ull of y X ˆ = H + X C + H X A. A β A ε λ A A A A β X A, where I i the By () ad (), g( ˆ β ) = g( β ) if * () g( )( ˆ β β β ) < β A x ( Hε + λxa CA A + HXA βa ) λw A. Thu, by (9) ad (), for ay < κ < κ + ν < P{g( ˆ β ) g( β )} * e A A A βa β P { C X X /3 for ome A } e A Aε β + P { C X /3 for ome A} λ e A A β + P { C / 3 for ome A} + P{ x H ε λ w / 3 for ome A } + P { A A A x X C w / 3 for ome A } + P{ x H X β λ w / 3 for ome A } A A PB ( ) + PB ( ) + PB ( ) + PB ( ) + PB ( ) + PB ( ), where e i the uit vector i the directio of the th coordiate. Coider B. Becaue e A A A βa e A A A βa C X X C X X / / / A η τη C, we have PB ( ) by (A5).

13 Now coider B. Becaue / / e A A A ( ) C X C τ / ad β b for A, = A Aε β PB ( ) P ( e C X /3 A ) / qg [ b ( τ ) /3] with the tail probability g () t i Lemma. Therefore, PB ( ) by (A), Lemma, (A4) ad (A5). Now / A = Op q b [ /( )]. Therefore, by (A5), λ λq λ C O o b ). Thi give PB ( ). A e A () ( A = p = p τ τb 3 For B 4, we have w = β Op( h ) + η. Sice x H /, for large C / 4 ε λ η PB ( ) P{ x H (/3) /[ C ( h + )] A } + o() / λ η qg{(/ 3) /[ C ( h + )]}. Therefore, by Lemma ad (A5), PB ( ). For B 5 we have 4 XA CA A x w x A A A β X C Therefore, Therefore, PB ( ) 5 / / q τ [ Op( h) + η]. τ b by (A5). Fially, for B 6 we have x H X β x X β η. A A A A x HXA β A η β η[ Op( h) + η]. w Therefore, PB ( ) 6 by (A5). Thi complete the proof.

14 REFERENCES Bickel, P.J., Y. Ritov, A.B. Tybakov (9). Simultaeou aalyi of Lao ad Datzig elector, Aal of Statitic. 37, Fa, J. ad R. Li (). Variable electio via ococave pealized likelihood ad it oracle propertie, Joural of the America Statitical Aociatio, 96, Fa, J. ad H. Peg (4). Nococave pealized likelihood with a divergig umber of parameter, Aal of Statitic, 3, Fa, J., H. Peg, ad T. Huag (5). Semiliear high-dimeioal model for ormalizatio of microarray data, Joural of the America Statitical Aociatio,, Greehtei, E. ad Y. Ritov (4). Peritece i high-dimeioal liear predictor electio ad the virtue of overparameterizatio, Beroulli,, Huag, J., J.L. Horowitz, ad S. Ma (8). Aymptotic propertie of bridge etimator i pare high-dimeioal regreio model, Aal of Statitic, 36, Huag, J., S. Ma, ad C.-H. Zhag (8). Adaptive Lao for pare high-dimeioal regreio model, Statitica Siica, 8, Kight, K. ad W.J. Fu (). Aymptotic for lao-type etimator, Aal of Statitic, 8, Lv, J. ad Y. Fa (9). A uified approach to model electio ad pare recovery uig regularized leat quare, Aal of Statitic, 37, Meihaue, N. ad P. Bühlma (6). High dimeioal graph ad variable electio with the Lao, Aal of Statitic, 34, Tibhirai, R. (996). Regreio hrikage ad electio via the Lao, Joural of the Royal Statitical Society Serie B, 58, Va der Vaart, A.W. ad J.A. Weller (996). Weak Covergece ad Empirical Procee, New York: Spriger-Verlag. Zhag, C.-H. ad J. Huag (8). The parity ad bia of the Lao electio i highdimeioal liear model, Aal of Statitic, 36, Zhag, T. (9). Some harp performace boud for leat quare regreio with pealizatio, Aal of Statitic, 37, L Zhao, P. ad B. Yu (6). O model electio coitecy of LASSO, Joural of Machie Learig Reearch, 7, Zou, H. (6). The adaptive Lao ad it oracle propertie, Joural of the America Statitical Aociatio,,

15 Zou, H. ad H.H. Zhag (9). O the adaptive elatic-et with a divergig umber of parameter, Aal of Statitic, 37,

16 TABLE : Mea Square Error of OLS Etimate of β from Full ad Reduced Model Mea-Square Error of Etimate of β d Full Model Reduced Model

17 TABLE : Reult of LASSO ad Adaptive LASSO Etimatio Prob. that Selected Average Size of Model Cotai d MSE of ˆ β Selected Model Large Variable Average λ LASSO ADAPTIVE LASSO

18 TABLE 3: Reult of LASSO ad Adaptive LASSO Etimatio with β Not i the Pealty Fuctio d Prob. that Selected Average Size of Model Cotai MSE of ˆ β Selected Model Large Variable Average λ LASSO ADAPTIVE LASSO

19 TABLE 4: Reult of Etimatig Effect of Uio Memberhip ad Marital Statu o Wage Coefficiet (Stadard Error) Obtaied from Variable OLS LASSO Adaptive LASSO Uio... Member (.7) (.96) (.94) Marital.5.9. Statu (.9) (.) (.) 8

10-716: Advanced Machine Learning Spring Lecture 13: March 5

10-716: Advanced Machine Learning Spring Lecture 13: March 5 10-716: Advaced Machie Learig Sprig 019 Lecture 13: March 5 Lecturer: Pradeep Ravikumar Scribe: Charvi Ratogi, Hele Zhou, Nicholay opi Note: Lae template courtey of UC Berkeley EECS dept. Diclaimer: hee

More information

x z Increasing the size of the sample increases the power (reduces the probability of a Type II error) when the significance level remains fixed.

x z Increasing the size of the sample increases the power (reduces the probability of a Type II error) when the significance level remains fixed. ] z-tet for the mea, μ If the P-value i a mall or maller tha a pecified value, the data are tatitically igificat at igificace level. Sigificace tet for the hypothei H 0: = 0 cocerig the ukow mea of a populatio

More information

TESTS OF SIGNIFICANCE

TESTS OF SIGNIFICANCE TESTS OF SIGNIFICANCE Seema Jaggi I.A.S.R.I., Library Aveue, New Delhi eema@iari.re.i I applied ivetigatio, oe i ofte itereted i comparig ome characteritic (uch a the mea, the variace or a meaure of aociatio

More information

A Tail Bound For Sums Of Independent Random Variables And Application To The Pareto Distribution

A Tail Bound For Sums Of Independent Random Variables And Application To The Pareto Distribution Applied Mathematic E-Note, 9009, 300-306 c ISSN 1607-510 Available free at mirror ite of http://wwwmaththuedutw/ ame/ A Tail Boud For Sum Of Idepedet Radom Variable Ad Applicatio To The Pareto Ditributio

More information

Comments on Discussion Sheet 18 and Worksheet 18 ( ) An Introduction to Hypothesis Testing

Comments on Discussion Sheet 18 and Worksheet 18 ( ) An Introduction to Hypothesis Testing Commet o Dicuio Sheet 18 ad Workheet 18 ( 9.5-9.7) A Itroductio to Hypothei Tetig Dicuio Sheet 18 A Itroductio to Hypothei Tetig We have tudied cofidece iterval for a while ow. Thee are method that allow

More information

STRONG DEVIATION THEOREMS FOR THE SEQUENCE OF CONTINUOUS RANDOM VARIABLES AND THE APPROACH OF LAPLACE TRANSFORM

STRONG DEVIATION THEOREMS FOR THE SEQUENCE OF CONTINUOUS RANDOM VARIABLES AND THE APPROACH OF LAPLACE TRANSFORM Joural of Statitic: Advace i Theory ad Applicatio Volume, Number, 9, Page 35-47 STRONG DEVIATION THEORES FOR THE SEQUENCE OF CONTINUOUS RANDO VARIABLES AND THE APPROACH OF LAPLACE TRANSFOR School of athematic

More information

u t u 0 ( 7) Intuitively, the maximum principles can be explained by the following observation. Recall

u t u 0 ( 7) Intuitively, the maximum principles can be explained by the following observation. Recall Oct. Heat Equatio M aximum priciple I thi lecture we will dicu the maximum priciple ad uiquee of olutio for the heat equatio.. Maximum priciple. The heat equatio alo ejoy maximum priciple a the Laplace

More information

Chapter 9. Key Ideas Hypothesis Test (Two Populations)

Chapter 9. Key Ideas Hypothesis Test (Two Populations) Chapter 9 Key Idea Hypothei Tet (Two Populatio) Sectio 9-: Overview I Chapter 8, dicuio cetered aroud hypothei tet for the proportio, mea, ad tadard deviatio/variace of a igle populatio. However, ofte

More information

VIII. Interval Estimation A. A Few Important Definitions (Including Some Reminders)

VIII. Interval Estimation A. A Few Important Definitions (Including Some Reminders) VIII. Iterval Etimatio A. A Few Importat Defiitio (Icludig Some Remider) 1. Poit Etimate - a igle umerical value ued a a etimate of a parameter.. Poit Etimator - the ample tatitic that provide the poit

More information

Generalized Likelihood Functions and Random Measures

Generalized Likelihood Functions and Random Measures Pure Mathematical Sciece, Vol. 3, 2014, o. 2, 87-95 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/pm.2014.437 Geeralized Likelihood Fuctio ad Radom Meaure Chrito E. Koutzaki Departmet of Mathematic

More information

A tail bound for sums of independent random variables : application to the symmetric Pareto distribution

A tail bound for sums of independent random variables : application to the symmetric Pareto distribution A tail boud for um of idepedet radom variable : applicatio to the ymmetric Pareto ditributio Chritophe Cheeau To cite thi verio: Chritophe Cheeau. A tail boud for um of idepedet radom variable : applicatio

More information

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION I liear regreio, we coider the frequecy ditributio of oe variable (Y) at each of everal level of a ecod variable (X). Y i kow a the depedet variable.

More information

LECTURE 13 SIMULTANEOUS EQUATIONS

LECTURE 13 SIMULTANEOUS EQUATIONS NOVEMBER 5, 26 Demad-upply ytem LETURE 3 SIMULTNEOUS EQUTIONS I thi lecture, we dicu edogeeity problem that arie due to imultaeity, i.e. the left-had ide variable ad ome of the right-had ide variable are

More information

Statistical Inference Procedures

Statistical Inference Procedures Statitical Iferece Procedure Cofidece Iterval Hypothei Tet Statitical iferece produce awer to pecific quetio about the populatio of iteret baed o the iformatio i a ample. Iferece procedure mut iclude a

More information

IntroEcono. Discrete RV. Continuous RV s

IntroEcono. Discrete RV. Continuous RV s ItroEcoo Aoc. Prof. Poga Porchaiwiekul, Ph.D... ก ก e-mail: Poga.P@chula.ac.th Homepage: http://pioeer.chula.ac.th/~ppoga (c) Poga Porchaiwiekul, Chulalogkor Uiverity Quatitative, e.g., icome, raifall

More information

Applied Mathematical Sciences, Vol. 9, 2015, no. 3, HIKARI Ltd,

Applied Mathematical Sciences, Vol. 9, 2015, no. 3, HIKARI Ltd, Applied Mathematical Sciece Vol 9 5 o 3 7 - HIKARI Ltd wwwm-hiaricom http://dxdoiorg/988/am54884 O Poitive Defiite Solutio of the Noliear Matrix Equatio * A A I Saa'a A Zarea* Mathematical Sciece Departmet

More information

ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION

ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION Review of the Air Force Academy No. (34)/7 ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION Aca Ileaa LUPAŞ Military Techical Academy, Bucharet, Romaia (lua_a@yahoo.com) DOI:.96/84-938.7.5..6 Abtract:

More information

20. CONFIDENCE INTERVALS FOR THE MEAN, UNKNOWN VARIANCE

20. CONFIDENCE INTERVALS FOR THE MEAN, UNKNOWN VARIANCE 20. CONFIDENCE INTERVALS FOR THE MEAN, UNKNOWN VARIANCE If the populatio tadard deviatio σ i ukow, a it uually will be i practice, we will have to etimate it by the ample tadard deviatio. Sice σ i ukow,

More information

Collective Support Recovery for Multi-Design Multi-Response Linear Regression

Collective Support Recovery for Multi-Design Multi-Response Linear Regression IEEE TRANACTION ON INFORMATION THEORY, VOL XX, NO XX, XX 4 Collective upport Recovery for Multi-Deig Multi-Repoe Liear Regreio eiguag ag, Yigbi Liag, Eric P Xig Abtract The multi-deig multi-repoe MDMR

More information

STA 4032 Final Exam Formula Sheet

STA 4032 Final Exam Formula Sheet Chapter 2. Probability STA 4032 Fial Eam Formula Sheet Some Baic Probability Formula: (1) P (A B) = P (A) + P (B) P (A B). (2) P (A ) = 1 P (A) ( A i the complemet of A). (3) If S i a fiite ample pace

More information

Heat Equation: Maximum Principles

Heat Equation: Maximum Principles Heat Equatio: Maximum Priciple Nov. 9, 0 I thi lecture we will dicu the maximum priciple ad uiquee of olutio for the heat equatio.. Maximum priciple. The heat equatio alo ejoy maximum priciple a the Laplace

More information

Chapter 1 ASPECTS OF MUTIVARIATE ANALYSIS

Chapter 1 ASPECTS OF MUTIVARIATE ANALYSIS Chapter ASPECTS OF MUTIVARIATE ANALYSIS. Itroductio Defiitio Wiipedia: Multivariate aalyi MVA i baed o the tatitical priciple of multivariate tatitic which ivolve obervatio ad aalyi of more tha oe tatitical

More information

100(1 α)% confidence interval: ( x z ( sample size needed to construct a 100(1 α)% confidence interval with a margin of error of w:

100(1 α)% confidence interval: ( x z ( sample size needed to construct a 100(1 α)% confidence interval with a margin of error of w: Stat 400, ectio 7. Large Sample Cofidece Iterval ote by Tim Pilachowki a Large-Sample Two-ided Cofidece Iterval for a Populatio Mea ectio 7.1 redux The poit etimate for a populatio mea µ will be a ample

More information

STUDENT S t-distribution AND CONFIDENCE INTERVALS OF THE MEAN ( )

STUDENT S t-distribution AND CONFIDENCE INTERVALS OF THE MEAN ( ) STUDENT S t-distribution AND CONFIDENCE INTERVALS OF THE MEAN Suppoe that we have a ample of meaured value x1, x, x3,, x of a igle uow quatity. Aumig that the meauremet are draw from a ormal ditributio

More information

COMPARISONS INVOLVING TWO SAMPLE MEANS. Two-tail tests have these types of hypotheses: H A : 1 2

COMPARISONS INVOLVING TWO SAMPLE MEANS. Two-tail tests have these types of hypotheses: H A : 1 2 Tetig Hypothee COMPARISONS INVOLVING TWO SAMPLE MEANS Two type of hypothee:. H o : Null Hypothei - hypothei of o differece. or 0. H A : Alterate Hypothei hypothei of differece. or 0 Two-tail v. Oe-tail

More information

SOLUTION: The 95% confidence interval for the population mean µ is x ± t 0.025; 49

SOLUTION: The 95% confidence interval for the population mean µ is x ± t 0.025; 49 C22.0103 Sprig 2011 Homework 7 olutio 1. Baed o a ample of 50 x-value havig mea 35.36 ad tadard deviatio 4.26, fid a 95% cofidece iterval for the populatio mea. SOLUTION: The 95% cofidece iterval for the

More information

Société de Calcul Mathématique, S. A. Algorithmes et Optimisation

Société de Calcul Mathématique, S. A. Algorithmes et Optimisation Société de Calcul Mathématique S A Algorithme et Optimiatio Radom amplig of proportio Berard Beauzamy Jue 2008 From time to time we fid a problem i which we do ot deal with value but with proportio For

More information

Statistics and Chemical Measurements: Quantifying Uncertainty. Normal or Gaussian Distribution The Bell Curve

Statistics and Chemical Measurements: Quantifying Uncertainty. Normal or Gaussian Distribution The Bell Curve Statitic ad Chemical Meauremet: Quatifyig Ucertaity The bottom lie: Do we trut our reult? Should we (or ayoe ele)? Why? What i Quality Aurace? What i Quality Cotrol? Normal or Gauia Ditributio The Bell

More information

Tables and Formulas for Sullivan, Fundamentals of Statistics, 2e Pearson Education, Inc.

Tables and Formulas for Sullivan, Fundamentals of Statistics, 2e Pearson Education, Inc. Table ad Formula for Sulliva, Fudametal of Statitic, e. 008 Pearo Educatio, Ic. CHAPTER Orgaizig ad Summarizig Data Relative frequecy frequecy um of all frequecie Cla midpoit: The um of coecutive lower

More information

Another Look at Estimation for MA(1) Processes With a Unit Root

Another Look at Estimation for MA(1) Processes With a Unit Root Aother Look at Etimatio for MA Procee With a Uit Root F. Jay Breidt Richard A. Davi Na-Jug Hu Murray Roeblatt Colorado State Uiverity Natioal Tig-Hua Uiverity U. of Califoria, Sa Diego http://www.tat.colotate.edu/~rdavi/lecture

More information

Fig. 1: Streamline coordinates

Fig. 1: Streamline coordinates 1 Equatio of Motio i Streamlie Coordiate Ai A. Soi, MIT 2.25 Advaced Fluid Mechaic Euler equatio expree the relatiohip betwee the velocity ad the preure field i ivicid flow. Writte i term of treamlie coordiate,

More information

M227 Chapter 9 Section 1 Testing Two Parameters: Means, Variances, Proportions

M227 Chapter 9 Section 1 Testing Two Parameters: Means, Variances, Proportions M7 Chapter 9 Sectio 1 OBJECTIVES Tet two mea with idepedet ample whe populatio variace are kow. Tet two variace with idepedet ample. Tet two mea with idepedet ample whe populatio variace are equal Tet

More information

Summary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector

Summary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector Summary ad Discussio o Simultaeous Aalysis of Lasso ad Datzig Selector STAT732, Sprig 28 Duzhe Wag May 4, 28 Abstract This is a discussio o the work i Bickel, Ritov ad Tsybakov (29). We begi with a short

More information

CHAPTER 6. Confidence Intervals. 6.1 (a) y = 1269; s = 145; n = 8. The standard error of the mean is = s n = = 51.3 ng/gm.

CHAPTER 6. Confidence Intervals. 6.1 (a) y = 1269; s = 145; n = 8. The standard error of the mean is = s n = = 51.3 ng/gm. } CHAPTER 6 Cofidece Iterval 6.1 (a) y = 1269; = 145; = 8. The tadard error of the mea i SE ȳ = = 145 8 = 51.3 g/gm. (b) y = 1269; = 145; = 30. The tadard error of the mea i ȳ = 145 = 26.5 g/gm. 30 6.2

More information

On the Signed Domination Number of the Cartesian Product of Two Directed Cycles

On the Signed Domination Number of the Cartesian Product of Two Directed Cycles Ope Joural of Dicrete Mathematic, 205, 5, 54-64 Publihed Olie July 205 i SciRe http://wwwcirporg/oural/odm http://dxdoiorg/0426/odm2055005 O the Siged Domiatio Number of the Carteia Product of Two Directed

More information

Generalized Fibonacci Like Sequence Associated with Fibonacci and Lucas Sequences

Generalized Fibonacci Like Sequence Associated with Fibonacci and Lucas Sequences Turkih Joural of Aalyi ad Number Theory, 4, Vol., No. 6, 33-38 Available olie at http://pub.ciepub.com/tjat//6/9 Sciece ad Educatio Publihig DOI:.69/tjat--6-9 Geeralized Fiboacci Like Sequece Aociated

More information

JOURNAL OF THE INDIAN SOCIETY OF AGRICULTURAL STATISTICS

JOURNAL OF THE INDIAN SOCIETY OF AGRICULTURAL STATISTICS Available olie at www.ia.org.i/jia JOURA OF THE IDIA OIETY OF AGRIUTURA TATITI 64() 00 55-60 Variace Etimatio for te Regreio Etimator of te Mea i tratified amplig UMMARY at Gupta * ad Javid abbir Departmet

More information

ME 410 MECHANICAL ENGINEERING SYSTEMS LABORATORY REGRESSION ANALYSIS

ME 410 MECHANICAL ENGINEERING SYSTEMS LABORATORY REGRESSION ANALYSIS ME 40 MECHANICAL ENGINEERING REGRESSION ANALYSIS Regreio problem deal with the relatiohip betwee the frequec ditributio of oe (depedet) variable ad aother (idepedet) variable() which i (are) held fied

More information

Chapter 1 Econometrics

Chapter 1 Econometrics Chapter Ecoometric There are o exercie or applicatio i Chapter. 0 Pearo Educatio, Ic. Publihig a Pretice Hall Chapter The Liear Regreio Model There are o exercie or applicatio i Chapter. 0 Pearo Educatio,

More information

18.05 Problem Set 9, Spring 2014 Solutions

18.05 Problem Set 9, Spring 2014 Solutions 18.05 Problem Set 9, Sprig 2014 Solutio Problem 1. (10 pt.) (a) We have x biomial(, θ), o E(X) =θ ad Var(X) = θ(1 θ). The rule-of-thumb variace i jut 4. So the ditributio beig plotted are biomial(250,

More information

Parameter Estimation for Discretely Observed Vasicek Model Driven by Small Lévy Noises

Parameter Estimation for Discretely Observed Vasicek Model Driven by Small Lévy Noises arameter Etimatio for Dicretely Oberved Vaicek Model Drive by Small Lévy Noie Chao Wei Abtract Thi paper i cocered with the parameter etimatio problem for Vaicek model drive by mall Lévy oie from dicrete

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

AN APPLICATION OF HYPERHARMONIC NUMBERS IN MATRICES

AN APPLICATION OF HYPERHARMONIC NUMBERS IN MATRICES Hacettepe Joural of Mathematic ad Statitic Volume 4 4 03, 387 393 AN APPLICATION OF HYPERHARMONIC NUMBERS IN MATRICES Mutafa Bahşi ad Süleyma Solak Received 9 : 06 : 0 : Accepted 8 : 0 : 03 Abtract I thi

More information

Positive solutions of singular (k,n-k) conjugate boundary value problem

Positive solutions of singular (k,n-k) conjugate boundary value problem Joural of Applied Mathematic & Bioiformatic vol5 o 25-2 ISSN: 792-662 prit 792-699 olie Sciepre Ltd 25 Poitive olutio of igular - cojugate boudar value problem Ligbi Kog ad Tao Lu 2 Abtract Poitive olutio

More information

S T A T R a c h e l L. W e b b, P o r t l a n d S t a t e U n i v e r s i t y P a g e 1. = Population Variance

S T A T R a c h e l L. W e b b, P o r t l a n d S t a t e U n i v e r s i t y P a g e 1. = Population Variance S T A T 4 - R a c h e l L. W e b b, P o r t l a d S t a t e U i v e r i t y P a g e Commo Symbol = Sample Size x = Sample Mea = Sample Stadard Deviatio = Sample Variace pˆ = Sample Proportio r = Sample

More information

Chapter 9: Hypothesis Testing

Chapter 9: Hypothesis Testing Chapter 9: Hypothei Tetig Chapter 5 dicued the cocept of amplig ditributio ad Chapter 8 dicued how populatio parameter ca be etimated from a ample. 9. Baic cocept Hypothei Tetig We begi by makig a tatemet,

More information

Statistical treatment of test results

Statistical treatment of test results SCAN-G :07 Revied 007 Pulp, paper ad board Statitical treatmet of tet reult 0 Itroductio The value of tatitical method lie i the fact that they make it poible to iterpret tet reult accordig to trictly

More information

Chapter 8.2. Interval Estimation

Chapter 8.2. Interval Estimation Chapter 8.2. Iterval Etimatio Baic of Cofidece Iterval ad Large Sample Cofidece Iterval 1 Baic Propertie of Cofidece Iterval Aumptio: X 1, X 2,, X are from Normal ditributio with a mea of µ ad tadard deviatio.

More information

Confidence Intervals: Three Views Class 23, Jeremy Orloff and Jonathan Bloom

Confidence Intervals: Three Views Class 23, Jeremy Orloff and Jonathan Bloom Cofidece Iterval: Three View Cla 23, 18.05 Jeremy Orloff ad Joatha Bloom 1 Learig Goal 1. Be able to produce z, t ad χ 2 cofidece iterval baed o the correpodig tadardized tatitic. 2. Be able to ue a hypothei

More information

On the 2-Domination Number of Complete Grid Graphs

On the 2-Domination Number of Complete Grid Graphs Ope Joural of Dicrete Mathematic, 0,, -0 http://wwwcirporg/oural/odm ISSN Olie: - ISSN Prit: - O the -Domiatio Number of Complete Grid Graph Ramy Shahee, Suhail Mahfud, Khame Almaea Departmet of Mathematic,

More information

Estimation Theory. goavendaño. Estimation Theory

Estimation Theory. goavendaño. Estimation Theory Etimatio Theory Statitical Iferece method by which geeralizatio are made about a populatio Two Major Area of Statitical Iferece. Etimatio a parameter i etablihed baed o the amplig ditributio of a proportio,

More information

UNIVERSITY OF CALICUT

UNIVERSITY OF CALICUT Samplig Ditributio 1 UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION BSc. MATHEMATICS COMPLEMENTARY COURSE CUCBCSS 2014 Admiio oward III Semeter STATISTICAL INFERENCE Quetio Bak 1. The umber of poible

More information

On the Positive Definite Solutions of the Matrix Equation X S + A * X S A = Q

On the Positive Definite Solutions of the Matrix Equation X S + A * X S A = Q The Ope Applied Mathematic Joural 011 5 19-5 19 Ope Acce O the Poitive Defiite Solutio of the Matrix Equatio X S + A * X S A = Q Maria Adam * Departmet of Computer Sciece ad Biomedical Iformatic Uiverity

More information

Erick L. Oberstar Fall 2001 Project: Sidelobe Canceller & GSC 1. Advanced Digital Signal Processing Sidelobe Canceller (Beam Former)

Erick L. Oberstar Fall 2001 Project: Sidelobe Canceller & GSC 1. Advanced Digital Signal Processing Sidelobe Canceller (Beam Former) Erick L. Obertar Fall 001 Project: Sidelobe Caceller & GSC 1 Advaced Digital Sigal Proceig Sidelobe Caceller (Beam Former) Erick L. Obertar 001 Erick L. Obertar Fall 001 Project: Sidelobe Caceller & GSC

More information

8.6 Order-Recursive LS s[n]

8.6 Order-Recursive LS s[n] 8.6 Order-Recurive LS [] Motivate ti idea wit Curve Fittig Give data: 0,,,..., - [0], [],..., [-] Wat to fit a polyomial to data.., but wic oe i te rigt model?! Cotat! Quadratic! Liear! Cubic, Etc. ry

More information

Questions about the Assignment. Describing Data: Distributions and Relationships. Measures of Spread Standard Deviation. One Quantitative Variable

Questions about the Assignment. Describing Data: Distributions and Relationships. Measures of Spread Standard Deviation. One Quantitative Variable Quetio about the Aigmet Read the quetio ad awer the quetio that are aked Experimet elimiate cofoudig variable Decribig Data: Ditributio ad Relatiohip GSS people attitude veru their characteritic ad poue

More information

Confidence Intervals. Confidence Intervals

Confidence Intervals. Confidence Intervals A overview Mot probability ditributio are idexed by oe me parameter. F example, N(µ,σ 2 ) B(, p). I igificace tet, we have ued poit etimat f parameter. F example, f iid Y 1,Y 2,...,Y N(µ,σ 2 ), Ȳ i a poit

More information

Lecture 24: Variable selection in linear models

Lecture 24: Variable selection in linear models Lecture 24: Variable selectio i liear models Cosider liear model X = Z β + ε, β R p ad Varε = σ 2 I. Like the LSE, the ridge regressio estimator does ot give 0 estimate to a compoet of β eve if that compoet

More information

= o(1) and n 1/2 γ n log M = o(1) for some

= o(1) and n 1/2 γ n log M = o(1) for some Web-baed upportig material for FMEM: Fuctioal Mixed effect Model for Logitudial Fuctioal Repoe by Hogtu Zhu, Kehui Che, X. Luo, Yig Yua, ad Jae-Lig Wag A Proof of Theorem Recall that y i = (y i1,..., y

More information

Isolated Word Recogniser

Isolated Word Recogniser Lecture 5 Iolated Word Recogitio Hidde Markov Model of peech State traitio ad aligmet probabilitie Searchig all poible aligmet Dyamic Programmig Viterbi Aligmet Iolated Word Recogitio 8. Iolated Word Recogier

More information

Tools Hypothesis Tests

Tools Hypothesis Tests Tool Hypothei Tet The Tool meu provide acce to a Hypothei Tet procedure that calculate cofidece iterval ad perform hypothei tet for mea, variace, rate ad proportio. It i cotrolled by the dialog box how

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 16 11/04/2013. Ito integral. Properties

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 16 11/04/2013. Ito integral. Properties MASSACHUSES INSIUE OF ECHNOLOGY 6.65/15.7J Fall 13 Lecture 16 11/4/13 Ito itegral. Propertie Cotet. 1. Defiitio of Ito itegral. Propertie of Ito itegral 1 Ito itegral. Exitece We cotiue with the cotructio

More information

Last time: Completed solution to the optimum linear filter in real-time operation

Last time: Completed solution to the optimum linear filter in real-time operation 6.3 tochatic Etimatio ad Cotrol, Fall 4 ecture at time: Completed olutio to the oimum liear filter i real-time operatio emi-free cofiguratio: t D( p) F( p) i( p) dte dp e π F( ) F( ) ( ) F( p) ( p) 4444443

More information

State space systems analysis

State space systems analysis State pace ytem aalyi Repreetatio of a ytem i tate-pace (tate-pace model of a ytem To itroduce the tate pace formalim let u tart with a eample i which the ytem i dicuio i a imple electrical circuit with

More information

A survey on penalized empirical risk minimization Sara A. van de Geer

A survey on penalized empirical risk minimization Sara A. van de Geer A survey o pealized empirical risk miimizatio Sara A. va de Geer We address the questio how to choose the pealty i empirical risk miimizatio. Roughly speakig, this pealty should be a good boud for the

More information

We will look for series solutions to (1) around (at most) regular singular points, which without

We will look for series solutions to (1) around (at most) regular singular points, which without ENM 511 J. L. Baai April, 1 Frobeiu Solutio to a d order ODE ear a regular igular poit Coider the ODE y 16 + P16 y 16 + Q1616 y (1) We will look for erie olutio to (1) aroud (at mot) regular igular poit,

More information

Explicit scheme. Fully implicit scheme Notes. Fully implicit scheme Notes. Fully implicit scheme Notes. Notes

Explicit scheme. Fully implicit scheme Notes. Fully implicit scheme Notes. Fully implicit scheme Notes. Notes Explicit cheme So far coidered a fully explicit cheme to umerically olve the diffuio equatio: T + = ( )T + (T+ + T ) () with = κ ( x) Oly table for < / Thi cheme i ometime referred to a FTCS (forward time

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Reasons for Sampling. Forest Sampling. Scales of Measurement. Scales of Measurement. Sampling Error. Sampling - General Approach

Reasons for Sampling. Forest Sampling. Scales of Measurement. Scales of Measurement. Sampling Error. Sampling - General Approach Foret amplig Aver & Burkhart, Chpt. & Reao for amplig Do NOT have the time or moe to do a complete eumeratio Remember that the etimate of the populatio parameter baed o a ample are ot accurate, therefore

More information

Fractional parts and their relations to the values of the Riemann zeta function

Fractional parts and their relations to the values of the Riemann zeta function Arab. J. Math. (08) 7: 8 http://doi.org/0.007/40065-07-084- Arabia Joural of Mathematic Ibrahim M. Alabdulmohi Fractioal part ad their relatio to the value of the Riema zeta fuctio Received: 4 Jauary 07

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Assignment 1 - Solutions. ECSE 420 Parallel Computing Fall November 2, 2014

Assignment 1 - Solutions. ECSE 420 Parallel Computing Fall November 2, 2014 Aigmet - Solutio ECSE 420 Parallel Computig Fall 204 ovember 2, 204. (%) Decribe briefly the followig term, expoe their caue, ad work-aroud the idutry ha udertake to overcome their coequece: (i) Memory

More information

ELEC 372 LECTURE NOTES, WEEK 4 Dr. Amir G. Aghdam Concordia University

ELEC 372 LECTURE NOTES, WEEK 4 Dr. Amir G. Aghdam Concordia University ELEC 37 LECTURE NOTES, WEE 4 Dr Amir G Aghdam Cocordia Uiverity Part of thee ote are adapted from the material i the followig referece: Moder Cotrol Sytem by Richard C Dorf ad Robert H Bihop, Pretice Hall

More information

11/19/ Chapter 10 Overview. Chapter 10: Two-Sample Inference. + The Big Picture : Inference for Mean Difference Dependent Samples

11/19/ Chapter 10 Overview. Chapter 10: Two-Sample Inference. + The Big Picture : Inference for Mean Difference Dependent Samples /9/0 + + Chapter 0 Overview Dicoverig Statitic Eitio Daiel T. Laroe Chapter 0: Two-Sample Iferece 0. Iferece for Mea Differece Depeet Sample 0. Iferece for Two Iepeet Mea 0.3 Iferece for Two Iepeet Proportio

More information

MTH 212 Formulas page 1 out of 7. Sample variance: s = Sample standard deviation: s = s

MTH 212 Formulas page 1 out of 7. Sample variance: s = Sample standard deviation: s = s MTH Formula age out of 7 DESCRIPTIVE TOOLS Poulatio ize = N Samle ize = x x+ x +... + x x Poulatio mea: µ = Samle mea: x = = N ( µ ) ( x x) Poulatio variace: = Samle variace: = N Poulatio tadard deviatio:

More information

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach STAT 425: Itroductio to Noparametric Statistics Witer 28 Lecture 7: Desity Estimatio: k-nearest Neighbor ad Basis Approach Istructor: Ye-Chi Che Referece: Sectio 8.4 of All of Noparametric Statistics.

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

a 1 = 1 a a a a n n s f() s = Σ log a 1 + a a n log n sup log a n+1 + a n+2 + a n+3 log n sup () s = an /n s s = + t i

a 1 = 1 a a a a n n s f() s = Σ log a 1 + a a n log n sup log a n+1 + a n+2 + a n+3 log n sup () s = an /n s s = + t i 0 Dirichlet Serie & Logarithmic Power Serie. Defiitio & Theorem Defiitio.. (Ordiary Dirichlet Serie) Whe,a,,3, are complex umber, we call the followig Ordiary Dirichlet Serie. f() a a a a 3 3 a 4 4 Note

More information

PRIMARY DECOMPOSITION, ASSOCIATED PRIME IDEALS AND GABRIEL TOPOLOGY

PRIMARY DECOMPOSITION, ASSOCIATED PRIME IDEALS AND GABRIEL TOPOLOGY Orietal J. ath., Volue 1, Nuber, 009, Page 101-108 009 Orietal Acadeic Publiher PRIARY DECOPOSITION, ASSOCIATED PRIE IDEALS AND GABRIEL TOPOLOGY. EL HAJOUI, A. IRI ad A. ZOGLAT Uiverité ohaed V aculté

More information

TI-83/84 Calculator Instructions for Math Elementary Statistics

TI-83/84 Calculator Instructions for Math Elementary Statistics TI-83/84 Calculator Itructio for Math 34- Elemetary Statitic. Eterig Data: Data oit are tored i Lit o the TI-83/84. If you have't ued the calculator before, you may wat to erae everythig that wa there.

More information

Brief Review of Linear System Theory

Brief Review of Linear System Theory Brief Review of Liear Sytem heory he followig iformatio i typically covered i a coure o liear ytem theory. At ISU, EE 577 i oe uch coure ad i highly recommeded for power ytem egieerig tudet. We have developed

More information

Stat 3411 Spring 2011 Assignment 6 Answers

Stat 3411 Spring 2011 Assignment 6 Answers Stat 3411 Sprig 2011 Aigmet 6 Awer (A) Awer are give i 10 3 cycle (a) 149.1 to 187.5 Sice 150 i i the 90% 2-ided cofidece iterval, we do ot reject H 0 : µ 150 v i favor of the 2-ided alterative H a : µ

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

NORM ESTIMATES FOR BESSEL-RIESZ OPERATORS ON GENERALIZED MORREY SPACES

NORM ESTIMATES FOR BESSEL-RIESZ OPERATORS ON GENERALIZED MORREY SPACES NORM ESTIMATES FOR ESSEL-RIESZ OPERATORS ON GENERALIZED MORREY SPACES Mochammad Idri, Hedra Guawa, ad Eridai 3 Departmet of Mathematic, Ititut Tekologi adug, adug 403, Idoeia [Permaet Addre: Departmet

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

A class of nonparametric density derivative estimators based on global Lipschitz conditions 1

A class of nonparametric density derivative estimators based on global Lipschitz conditions 1 A cla of oparametric deity derivative etimator baed o global Lipchitz coditio Kairat Mybaev Iteratioal School of Ecoomic Kazah-Britih Techical Uiverity Tolebi 59 Almaty 050000, Kazahta email: airat mybayev@yahoo.com

More information

Weak formulation and Lagrange equations of motion

Weak formulation and Lagrange equations of motion Chapter 4 Weak formulatio ad Lagrage equatio of motio A mot commo approach to tudy tructural dyamic i the ue of the Lagrage equatio of motio. Thee are obtaied i thi chapter tartig from the Cauchy equatio

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Further Investigation of alternative Formulation of RP Model with Response Error. Ed Stanek

Further Investigation of alternative Formulation of RP Model with Response Error. Ed Stanek Further vetigatio of alterative Formulatio of RP odel with Repoe Error Ed Staek TRODCTO We explore the predictor that will reult i a imple radom ample with repoe error whe a differet model i potulated

More information

Expectation of the Ratio of a Sum of Squares to the Square of the Sum : Exact and Asymptotic results

Expectation of the Ratio of a Sum of Squares to the Square of the Sum : Exact and Asymptotic results Expectatio of the Ratio of a Sum of Square to the Square of the Sum : Exact ad Aymptotic reult A. Fuch, A. Joffe, ad J. Teugel 63 October 999 Uiverité de Strabourg Uiverité de Motréal Katholieke Uiveriteit

More information

EULER-MACLAURIN SUM FORMULA AND ITS GENERALIZATIONS AND APPLICATIONS

EULER-MACLAURIN SUM FORMULA AND ITS GENERALIZATIONS AND APPLICATIONS EULER-MACLAURI SUM FORMULA AD ITS GEERALIZATIOS AD APPLICATIOS Joe Javier Garcia Moreta Graduate tudet of Phyic at the UPV/EHU (Uiverity of Baque coutry) I Solid State Phyic Addre: Practicate Ada y Grijalba

More information

Application to Random Graphs

Application to Random Graphs A Applicatio to Radom Graphs Brachig processes have a umber of iterestig ad importat applicatios. We shall cosider oe of the most famous of them, the Erdős-Réyi radom graph theory. 1 Defiitio A.1. Let

More information

On The Computation Of Weighted Shapley Values For Cooperative TU Games

On The Computation Of Weighted Shapley Values For Cooperative TU Games O he Computatio Of Weighted hapley Value For Cooperative U Game Iriel Draga echical Report 009-0 http://www.uta.edu/math/preprit/ Computatio of Weighted hapley Value O HE COMPUAIO OF WEIGHED HAPLEY VALUE

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

The Random Walk For Dummies

The Random Walk For Dummies The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli

More information

Zeta-reciprocal Extended reciprocal zeta function and an alternate formulation of the Riemann hypothesis By M. Aslam Chaudhry

Zeta-reciprocal Extended reciprocal zeta function and an alternate formulation of the Riemann hypothesis By M. Aslam Chaudhry Zeta-reciprocal Eteded reciprocal zeta fuctio ad a alterate formulatio of the Riema hypothei By. Alam Chaudhry Departmet of athematical Sciece, Kig Fahd Uiverity of Petroleum ad ieral Dhahra 36, Saudi

More information

Efficient Point Estimation of the Sharpe Ratio

Efficient Point Estimation of the Sharpe Ratio Joural of Statitical ad Ecoometric Method, vol., o.4, 013, 19-14 ISSN: 41-0384 (prit), 41-0376 (olie) Sciepre Ltd, 013 Efficiet Poit Etimatio of the Sharpe Ratio Grat H. Skrepek 1 ad Ahok Sahai Abtract

More information