Cross-Validation in Function Estimation

Size: px
Start display at page:

Download "Cross-Validation in Function Estimation"

Transcription

1 Crss-Validati i Fucti Estimati Chg Gu Octber 1, 2006 Crss-validati is a ituitive ad effective techique fr mdel selecti i data aalysis. I this discussi, I try t preset a few icaratis f the geeral techique i a few parametric fucti estimati settigs. Justificatis f the techique i Gaussia regressi settigs will be discussed, alg with pssible reass fr the lack f similar justificati i ther settigs. There will be discussis f sme subtle cceptual issues which put certai widely adpted ccepts/practice uder scrutiy. 1 Crss-Validati ad Related Techiques 1.1 PRESS ad C p Csider a liear regressi mdel with P 1 predictrs X 1,..., X P 1, Y = β 0 + β 1 X β p 1 X P 1 + ɛ, where ɛ N(0, σ 2 ). It is assumed that µ Y = β 0 + β 1 X β P 1 X P 1, but sme f the β j X j s may ctribute very little r t at all. Fr mdel selecti i this settig (a.k.a. variable selecti), tw effective techiques are PRESS ad C p. Observig (Y i, X i,1,..., X i,p 1 ), = 1,...,, e may calculate PRESS (Predicted REsidual Sum f Squares) fr every f the (2 P 1 1) pssible mdels, PRESS = (Y i Ŷi(i)) 2, ˆβ [i] ˆβ [i] where Ŷi(i) [i] = 0 + ˆβ 1 X i,1 + + P 1 X i,p 1 ad j s are LS estimates usig the ( 1) bservatis excludig (Y i, X i,1,..., X i,p 1 ). Oe the ca chse the mdel with the miimum PRESS scre. PRESS is prbably the first icarati f crss-validati, ad the bjective f mdel selecti is t achieve mre precise predicti. Fr every f the (2 P 1 1) pssible mdels, e may als calculate the C p statistic, C p = ˆβ [i] SSE ( 2p), MSE(X 1,..., X P 1 ) where SSE is fr the mdel uder csiderati ad p is its umber f cefficiets. Oe selects a mdel with a small C p (s that the MSE E(ˆµ µ) 2 is small) that is clse t p (s that E ˆµ µ). The bjective f C p -based selecti is fr the estimati precisi f µ Y but t fr the idetificati f the crrect mdel, as the full mdel is assumed t be crrect t start with. The estimati f egligible β j s diverts resurces ad thus results i higher variace f the imprtat ˆβ j s, ad the elimiati f the mir terms, while itrducig bias i ˆµ, helps t reduce the variace. 1

2 1.2 Crss-validati, GCV, ad C L Csider a regressi mdel Y = η(x) + ɛ where ɛ N(0, σ 2 ) ad f(x) is smth. Tw f the ppular appraches t the parametric estimati f f(x) are the kerel methd ad the pealized least squares methd. T keep thigs simple, csider ly uivariate x. Observig (Y i, X i ), = 1,...,, e may estimate f(x) via ˆη h (x) = 1 ( ) Xi x Y i K, h h where K( ) is a give kerel fucti. Typically, K(x) is uimdal ad symmetric with respect t 0, K(x)dx = 1, ad x j K(x)dx = 0 fr j = 1,..., m. This prvides a family f estimates idexed by the badwidth h, with smaller h yieldig smaller bias but larger variace ad larger h yieldig smaller variace but larger bias. Assumig (η (m) (x)) 2 dx <, e may estimate η(x) usig the miimizer η λ (x) f 1 (Y i η(x i )) 2 + λ (η (m) (x)) 2 dx, (1) where the smthig parameter λ plays similar rle as the h fr kerel estimates. With λ = e frces a parametric mdel f the frm η(x) = β 0 + β 1 x + + β m 1 x m 1. With λ = 0 + e btais a iterplat with miimum (η (m) (x)) 2 dx. The sluti is called a smthig (atural) splie as η λ is a piece-wise plymial f rder 2m 1. Evaluatig the estimated η(x) at the data pits X i, e gets the predicted values Ŷi. Fr the keral estimates, smthig splie estimates, ad ther parametric estimates f η(x) kw as liear smthers, e has Ŷ = AY, where A is the smthig matrix idexed by h r λ r the like. I practice, e eeds t select h r λ, fr which the methds i the secti title are desiged; the smthig matrix A plays a imprtat rle here. Write Ŷi = η λ (X i ) with η λ (x) the miimizer f (1), ad Ŷi(i) = η [i] λ (X i), where η [i] λ (x) miimizes the delete-e versi f (1), 1 (Y j η(x j )) 2 + λ (η (m) (x)) 2 dx. (2) j i It ca be shw that η [i] λ (x) is the miimizer f (1) with η[i] λ (X i) replacig Y i, thus a i,i (Y i Ŷi(i)) = Ŷ i Ŷi(i), r Y i Ŷi(i) = (Y i Ŷi)/(1 a i,i ). This leads t the rdiary crss-validati scre V 0 (λ) = 1 (Y i Ŷi(i)) 2 = 1 (Y i Ŷi) 2 (1 a i,i ) 2. (3) A ivariace argumet suggests the replacemet f a i,i by their average value tracea/, yieldig the geeralized crss-validati (GCV) f Crave ad Wahba (1979), V (λ) = 1 Y T (I A(λ)) 2 Y { 1 trace(i A(λ))} 2. (4) 2

3 Nte that althugh the derivati f GCV is thrugh (1), it ca be used ad d get used fr all liear smthers. Clsely related t GCV is the C L scre, which assumes a kw σ Optimality f U(λ) ad V (λ) U(λ) = 1 YT (I A(λ)) 2 Y + 2 σ2 tracea(λ), (5) Befre presetig the theretical justificati f C L ad GCV, let me try t clarify a few cceptual issues. I parametric statistics, e has a discrete cllecti f tetative mdels ad assumes that the crrect mdel is amg them, ad a mdel selecti methd is csistet if it zeres i t this crrect mdel as. The csistecy prperty is t predicti-rieted as i the desig f PRESS, r is it fiite-sample MSE-rieted as i the practical use f C p ; with fr fixed umber f parameters, variace is f ccer ad all that matters is the bias. [I am t familiar with this lie f literature, s I d t kw if there are results whe e f the tetative mdels is crrect, r whether e culd frmulate the prblem i such a way that the umber f parameters icrease with s bias-variace trade-ff remais relevat.] I parametric fucti estimati, the family f tetative estimates frm a trajectry i the fucti space ad e culd t ad des t assume that the true fucti is the trajectry. Istead, e lks fr the estimate that is clsest t the true fucti i sme sese. The bjective f such mdel selecti is t t idetify the crrect mdel (there is e), but rather t lcate the best estimate give the bserved data. Bias-variace trade-ff is the cetral issue here (thugh t always explicitly), ad as, the ptimal chice wuld have smaller ad smaller bias. Fr Gaussia regressi, a atural measure fr estimati precisi is the MSE the data pits, L(λ) = 1 (ˆη λ (X i ) η(x i )) 2, ad the estimate the trajectry that miimizes this MSE wuld be the ptimal chice. Nte that the trajectry is depedet the bserved data, s is the MSE lss ad its miimizer. The C L scre f (5) is actually a ubiased estimate f relative lss, with relative meaig the drppig f terms that d t deped λ. The ptimality f U(λ) ad V (λ) were established by Ker-Chau Li (1986): Let λ, λ u, ad λ v be the miimizers f L(λ), U(λ), ad V (λ), respectively, the L(λ ) L(λ u ) p 1, L(λ ) L(λ v ) p 1. (6) The key results leadig t these are that as λ 0 at certai rates that ctai the ptimal e, U(λ) L(λ) 1 ɛ T ɛ = p (L(λ)), V (λ) L(λ) 1 ɛ T ɛ = p (L(λ)); The mai cditi fr these is that R(λ), where R(λ) = E[L(λ)], which states that the parametric -csistecy is t achievable i the settig. Nte that U(λ) V (λ) p 1 ad L(λ) = p (1), s these are delicate results. Fr the recrd, as ad λ 0, L(λ) = O p (λ + 1 λ 1/2m ) fr the miimizer f (1). 3

4 1.4 Crss-validati fr desity estimati Csider a desity estimati prblem with idepedet samples X i f(x), i = 1,...,. estimate f(x), e may use the kerel estimate, f h (x) = 1 h ( ) Xi x K, h where K( ) is as give earlier. T assess the perfrmace f f h (x) as a estimate f f(x), e may use the Kullback-Leibler lss L(h) = E f [lg{f(x)/f h (X)}], which, after drppig the term E f [lg f(x)] t ivlvig h, reduces t the relative KL discrepacy E f [lg f h (X)]. Estimatig the relative KL by crss-validated sample mea, e has the KL crss-validati scre, T V (h) = 1 lg f [i] h (X i), (7) where f [i] h (x) is based the ( 1) samples excludig X i. It was shw by Peter Hall (1987) that if the tails f the kerel K(x) are thier tha the tails f f(x), the L(h ) L(h v ) p 1, (8) where h ad h v miimizes L(h) ad V (h), respectively. Parallel t (1), e may assume a fiite supprt X fr f(x) ad emply the lgistic desity trasfrm f(x) = e η(x) / X eη, ad estimate η(x) by the miimizer η λ (x) f the pealized likelihd fuctial, 1 { η(x i ) lg e η(x)} + λ (η (m) (x)) 2 dx. (9) X 2 The Kullback-Leibler distace is w L(λ) = E f [lg{f(x)/f λ (X)}] = E f [η(x) η λ (X)] + {lg X e ηλ(x) lg e η(x) }, (10) X with a relative KL discrepacy lg e ηλ(x) E f [η λ (X)], X where the first term ca be cmputed ad the secd term ca be estimated thrugh a crssvalidated sample mea, 1 η[i] λ (X i), with η [i] λ (x) miimizig the delete-e versi f (9), 1 1 j i { η(x j ) lg X e η(x)} + λ 2 (η (m) (x)) 2 dx. This yields a crss-validati scre V (λ) = lg e ηλ(x) 1 η [i] X λ (X i). (11) 4

5 While empirical results strgly suggest ptimality similar t thse established by Li (1806) ad Hall (1987), attempts the theretical aalysis have t bee successful. It ca be shw that the symmetrized KL lss, E f [lg{f(x)/f λ (X)}] E fλ [lg{f(x)/f λ (X)}], which rughly dubles the KL lss L(λ) f (10), is f the rder O p (λ + 1 λ 1/2m ), s the miimum KL lss is f rder O p ( 2m/(1+2m) ) = p ( 1/2 ). O the ther had, the estimati f E f [g(x)] thrugh the sample mea is at best f the rder O p ( 1/2 ). Oe seems t eed mre delicate term grupig fr ay success at a theretical aalysis f V (λ). 1.5 Crss-validati fr -Gaussia regressi ad hazard estimati Fr -Gaussia regressi ad hazard estimati, scres similar t (11) were derived fllwig similar lies. The empirical perfrmaces f these scres suggest ptimality prperties similar t (6) ad (8), but theretical aalysis is lackig. 1.6 Crss-validati fr regressi with crrelated errrs Fr regressi with crrelated data, tw scearis have bee treated i tw dissertatis by my frmer studets, Pig Ma ad Chu Ha, respectively. The first thesis was by Pig Ma, which ccers mixed-effect/variace-cmpet mdels f the frm Y = η(x) + z T b + ɛ, where b N(0, B) ad ɛ i.i.d N(0, σ 2 ), with the dimesi p f b much smaller tha ad its variace-cvariace matrix partly r etirely ukw; the errr variace-cvariace matrices are lw-rak mdificatis f σ 2 I. T estimate η(x), e may miimize 1 (Y i η(x i ) z T i b) 2 + λ (η (m) (x)) 2 dx + 1 bt Σb, where (λ, Σ) are tuig parameters; Σ shuld reflect the structure f B 1. The jit estimati f (η(x), b) yields Ŷ = ˆη(x) + zt ˆb, ad e still has a expressi Ŷ = A(λ, Σ)Y. The selecti f (λ, Σ) ca be de usig C L ad GCV, ad we were able t establish the ptimality f such practice similar t (6), with the lsses L 1 (λ, Σ) = 1 (Ŷi η(x i ) z T i b) 2 ad L 2 (λ, Σ) = 1 (ˆη η)t P Z (ˆη η), where η T = (η(x 1 ),..., η(x )), P Z = I Z(ZT Z) 1 Z T, ad Z T = (z 1,....z ). Nte that we are t ccered with the estimati f B i this exercise, but ly use Σ (r B 1 ) as tuig parameters fr the estimati f η(x) r η(x) + z T b. Fr the ptimality with respect t L 1, e eeds p = O( ). Fr L 2, e eeds p fixed. Similar practice was applied t -Gaussia regressi with satisfactry empirical perfrmace, but thery is lackig. 5

6 The secd thesis was by Chu Ha, which ccers statiary time series mdels r mixedeffect mdels with the dimesi f b grwig with ; the errr variace-cvariace matrices are lger lw-rak mdificatis f σ 2 I. Frmally, e has Y i = η(x i ) + ɛ i, ɛ N(0, σ 2 W 1 ), where W is kw up t a set f parameter γ. Oe may estimate η(x) via the miimizati f (Y η) T W (Y η) + λ (η (m) (x)) 2 dx, with (λ, γ) as tuig parameters. Fr the jit selecti f (λ, γ), we derived the C L like scre U(λ, γ) = 1 σ 2 YT W 1/2 (I A) 2 W 1/2 Y 1 lg W + 2 tracea, where W 1/2 Ŷ = AW 1/2 Y, ad the GCV like scre V (λ, γ) = lg{ 1 Y T W 1/2 (I A) 2 W 1/2 Y} 1 lg W + 2 tracea tracea. Remember that W depeds γ ad A depeds (λ, γ). Usig the Kullback-Leibler lss fr the jit estimati f (η 0 (x), γ 0 ) by (η(x), γ), [ 1 L(λ, γ) = E 0 2σ 2 (Y η)t W (Y η) 1 1 lg W 2 2σ 2 (Y η 0) T W 0 (Y η 0 ) + 1 ] 2 lg W 0 = 1 2σ 2 (η 0 η) T W (η 0 η) tr(w W0 I) 1 1 lg W W0, 2 2 the ptimality similar t (6) was established fr U ad V, ad the resultig ˆγ is -csistet. A key cditi fr the thery is that W 1 > ci fr sme c > 0 uifrmly ver the tetative γ. The thery applies t the stadard statiary ad ivertible ARMA mdels with γ i a cmpact set, ad t a mixed-effect mdel with p. 1.7 Refereces PRESS is i early every textbk regressi aalysis ad liear mdels, thugh I d t kw exactly wh first prpsed it. T read abut C p ad C L, check ut the classical referece f Mallws. Mallws, C. L. (1973). Sme cmmets C P, Techmetrics 15, Fr the mtivati ad derivati f GCV alg with sme attempt the theretical justificati f U(λ) ad V (λ), check ut the semial wrk f Crave ad Wahba (1979); the risk-based thery des t really justify its practical use, hwever. The really relevat lss-based justificati f U(λ) ad V (λ) was by Li (1986). Crave, P. ad G. Wahba (1979). Smthig isy data with splie fuctis: Estimatig the crrect degree f smthig by the methd f geeralized crss-validati, Numer. Math. 31, Li, K.-C. (1986). Asympttic ptimality f C L ad geeralized crss-validati i the ridge regressi with applicati t splie smthig, A. Statist. 14,

7 Kerel desity estimati ad the assciated badwidth selecti was ce a busiess by itself. The wrk f Hall (1987) fr the justificati f KL-based crss-validati scre (7) was a imprtat ctributi, but it was take by may as egative the KL distace as perfrmace measure ad egative CV as badwidth selectr, ad partly ispired the develpmets f the s-called plug-i methds fr badwidth selecti; I csider this a ufrtuate tur f evet. Hall, P. (1987). O Kullback-Leibler lss ad desity estimati, A. Statist. 15, The implemetati ad empirical perfrmace f (11) ca be fud i my jit wrk with Jigyua Wag ad i Chapter 6 f my bk. Discussis ccerig -Gaussia regressi ad hazard estimati are i Chapters 5 ad 7 f my bk. Gu, C. ad J. Wag (2003). Pealized likelihd desity estimati: Direct crss-validati ad scalable apprximati, Statist. Si. 13, Gu, C. (2002). Smthig Splie ANOVA Mdels. New Yrk: Spriger-Verlag. Details ccerig crss-validati fr regressi with crrelated errrs are t be fud i the fllwig articles. Gu, C. ad P. Ma (2005). Optimal smthig i parametric mixed-effect mdels, A. Statist. 33, Gu, C. ad P. Ma (2005). Geeralized parametric mixed-effect mdels: Cmputati ad smthig parameter selecti, J. Cmput. Graph. Statist. 14, Ha, C. ad C. Gu (2006). Optimal smthig with crrelated data, mauscript. 2 Prblems (Real ad Perceived), Ccepts, ad Ctrversies 2.1 A simple simulati settig We will use a simple simulati settig t illustrate sme f the issues t be discussed. The issues are t specific t this particular settig, as similar simulatis i ther settigs demstrate the same qualitative characteristics. O x i = (i.5)/, i = 1,,, we geerate 100 replicates f data frm Y i = η(x i ) + ɛ i, ɛ i N(0, σ 2 ), with η(x) = si(2πx π) ad σ 2 = 1. Fr λ a fie grid f lg 10 λ = ( 5)(.05)( 1), we calculate the miimizers f (1) with m = 2 fr each f the replicates ad evaluate varius quatities assciated with them. The grid was brad eugh t bracket the λ f iterest fr all the 100 replicates. 2.2 Udersmthig ad mdificatis f crss-validati Oe prblem suffered by crss-validati methds is udersmthig: i up t 10% f the cases, the methds lead t very small λ r h r the like, resultig i udersmthig r eve iterplati. The prblem des t seem t g away with larger, at least t fr up t 500. A alterative t V (λ) f (4) is the s-called geeralized maximum likelihd (GML) methd derived uder the empirical Bayes iterpretati f smthig splies, which is simply the restricted 7

8 L(λ) f GCV mi L(λ) lg 10 (λ) f V 1.4 (λ) r M(λ) lg 10 (λ) f V 1 (λ) Relative Efficacy M α Figure 1: Left: Perfrmaces f V α (λ) with α = 1 (faded) ad α = 1.4 fr = 100. Ceter: the λ miimizig V 1 (λ) versus that miimizig M(λ) (faded) r V 1.4 (λ), fr = 100. Right: mi L(λ)/L(ˆλ) with ˆλ miimizig M(λ) r V α (λ) at α = 1, 1.2, 1.4, 1.6, 1.8, fr = 100 (fatter bxes) ad = 500 (thier bxes). maximum likelihd (REML) methd fr mixed-effect/variace cmpet mdels. The GMLscre is give by M(λ) = 1 Y T (I A(λ))Y I A(λ) 1/( m), + where I A + is the prduct f the m psitive eigevalues f (I A). The GML methd ever iterplates, but csistetly udersmthes fr η(x) super smth. A simple mdificati seems t cure the udersmthig prblem fr crss-validati. Fr V (λ) f (4), e may use V α (λ) = 1 Y T (I A(λ)) 2 Y { 1 trace(i αa(λ))} 2, (12) where α > 1, ad fr V (λ) f (11), e may use V α (λ) = lg e ηλ(x) 1 X η λ (X i ) + α 1 {η λ (X i ) η [i] λ (X i)}. (13) Simulati studies suggest that a α i the rage wuld be the mst effective. I the settig f 2.1, e may calculate the lss L(λ) as well as the selecti scres V α (λ) ad M(λ) the λ grid fr all the replicates, ad idetify λ, λ v alg with the assciated L(λ). Figure 1 summarizes sme f the empirical results. A s-called exteded expetial (EE) methd was prpsed by Ku ad Efr (2002) t cmbie the stregths f C p ad GML, but I am t able t fllw the argumets. 2.3 Negative crrelati ad mdel idexig Oe majr criticism f crss-validati i the literature was the famus egative crrelati betwee the ptimal ad crss-validated badwidths, as demstrated i the middle frame f Figure 2. The egative crrelati is bthersme ly whe the idex λ is meaigful acrss-replicate, hwever, which will be aalyzed belw. 8

9 L(λ) f GCV * mi L(λ) lg 10 (λ v ) lg 10 (λ ) lg 10 (ρ v ) lg 10 (ρ ) Figure 2: Left: Perfrmaces f V 1 (λ) fr = 100. Ceter: lg 10 (λ ) versus lg 10 (λ v ). Right: lg 10 (ρ ) versus lg 10 (ρ v ). lg 10 (λ) lg 10 (ρ) lg 10 (λ) lg 10 (ρ) Efficacy f Typical ρ r λ ρ λ Figure 3: Left ad Ceter: The ρ-λ mappig ad the (ρ, λ ) pair f 100 replicates. Right: Efficacy f typical ρ r λ. Remember that ulike i the parametric settigs where e selects frm a discrete set f mdels, we are chsig frm a ctiuum f tetative estimates η λ (x). A imprtat issue is hw t alig estimates based differet data. Mathematically, the miimizer f (1) is the sluti t a cstraied LS prblem, mi (Y i η(x i )) 2, s.t. (η (m) (x)) 2 dx ρ fr sme ρ > 0. There is a e-t-e crrespdece betwee λ ad ρ give the data. Ituitively, ρ is meaigful acrss-replicate as it impses the same cstrait η(x) i the estimati prcess regardless the data e bserves, whereas the same λ implies differet cstraits fr differet bservatis. A simple simulati will cfirm that it is ideed the case, that η λ (x) based differet data shuld be aliged by the crrespdig ρ, but we first bserve i the right frame f Figure 2 that the egative crrelati disappears the ρ scale. I the settig f 2.1, e may calculate ρ = 1 0 (η(2) (x)) 2 dx fr all estimates η λ (x) the λ grid, fr all replicates, ad fr the true fucti η(x) = si(2πx π); the true fucti has ρ = I the left ad middle frames f Figure 3, we demstrate the ρ-λ mappig fr 100 9

10 replicates f samples f size = 100 ad idetify the 100 ptimal (ρ, λ ) the grid; it is reassurig t see that is i the middle f the ρ s. T assess the acrss-replicate iterpretability f ρ ad λ, we pick a typical ptimal ρ = , ad a typical ptimal λ = media(λ ), ad calculates their efficacy ver the replicates via L(ρ )/L( ρ) ad L(λ )/L( λ) which is shw i the right frame f Figure 3. The tight spread f ρ s cfirms the ituiti that ρ is the prper mdel idex here. Besides smthig splies, there des t seem t exist a ρ-idex fr ther smthig methds. Nevertheless, mdel idexig remais a imprtat issue, which has subtle implicatis i the thery ad practice f badwidth selecti. Eve fr smthig splies, the ρ-idex is difficult t wrk with, ad the idetificati f it as the prper mdel idex des t ffer ay peratial help. It hwever helps t idetify sme ppular but questiable ccepts ad practices. Oe f the misleadig ccept is the degree-f-freed i regressi, which is defied as tracea: give x i s, λ-tracea is a e-t-e mappig, s the selecti f λ thrugh tracea simply leaves thigs i the hads f radm ise ɛ i. I classical parametric statistics, the degree-ffreedm is defied i terms f mdel dimesi, ad it is relevat i settigs ther tha regressi. The cicidece that the trace f hat matrix i liear regressi matches the mdel dimesi des t autmatically qualify the trace as a valid mdel idex. I fact, tracea is much wrse tha λ, as it allws e t cmpare A(λ) with A(h) while they are t cmparable. Resamplig is widely used i may phases f statistical aalysis. Wrkig with a idex such as λ that is t meaigful acrss-replicate, hwever, e shuld avid usig resamplig fr badwidth selecti. Besides the egative crrelati, ather criticism agaist crss-validati is that the crssvalidated badwidth cverges very slwly asympttically. Tw aspects i this argumet eed clse scrutiy: i) if the target f cvergece is smethig like a fixed λ, the it is wrry because the ptimal λ chages with data ad λ v may simply be chasig λ. ii) The badwidth is t part f the stchastic settig but a tuig parameter i the estimati prcess, ad its meaig is ly thrugh L(λ), s it s kay fr λ v t be far frm λ as lg as L(λ )/L(λ v ) Lss versus risk Risk calculati is a basic exercise i the theretical aalysis f statistical prcedures, but whe de usig a mdel idex t iterpretable acrss-replicate, the prper use f it ca be tricky. Imagie that yu are studyig the MSE perfrmace f LS regressi empirically, ad yu have 3 mdels M 1 = {µ = E[Y ] = β 0 + β 1 x}, M 2 = {µ = β 0 + β 1 x + β 2 x 2 }, ad M 3 = {µ = β 0 + β 1 x + β 2 x 2 + β 3 x 3 }. After geeratig data, yu calculate ˆµ (j), j = 1, 2, 3, uder M j ad calculate the MSE 1 (ˆµ(j) i µ i ) 2, ad yu average ver replicates the MSEs crrespdig t the same ˆµ (j). Yu d be crazy t take ˆµ (1) frm the first 20 replicates t average with ˆµ (2) frm the ext 40 replicates. While t a much mir extet, the calculati f R(λ) = E[L(λ)] is like mixig ˆµ (1) s frm sme replicates with ˆµ (2) s frm ther replicates i the abve example. Yu may tice that I ever call η λ (x) a estimatr but ly a estimate give the data. May wuld call the miimizer f R(λ) ptimal, but sice I ever bserve a average sample, I csider a risk-ptimal λ meaigless, ad d t care abut ay cvergece twards it. While derivative ccepts based R(λ) have meaig i my bks, R(λ) serves as a imprtat aalytical device. Fr example, t prve V (λ) L(λ) 1 ɛ T ɛ = p (L(λ)), e prceeds by shwig that V (λ) L(λ) 1 ɛ T ɛ = p (R(λ)) ad that L(λ) R(λ) = p (R(λ)); t establish L(λ) = O p (λ + 1 λ 1/2m ), e simply shw that R(λ) = O(λ + 1 λ 1/2m ). 10

11 Risk calculati usig the ρ-idex wuld be cceptually meaigful, ly if it were pssible. Althugh e questis the appeal f lss-ptimal badwidth ad mst d simulatis i terms f it (egative crrelati wuld t be there therwise), may authrs claim that the data d t ctai eugh ifrmati fr e t pursue the lss-ptimal badwidth. I view f the results by Li (1986) ad Hall (1987), such claims are misleadig. 2.5 Refereces Fr the derivati f GML ad its asympttic aalysis, check Wahba (1985). The GML alway pick λ 2m/(2m+1) but fr η(x) super smth the ptimal e is λ 2m/(4m+1). Wahba, G. (1985). A cmparis f GCV ad GML fr chsig the smthig parameter i the geeralized splie smthig prblem, A. Statist. 13, The mdificatis f crss-validati i (12) ad (13) ad the empirical perfrmaces ca be fud i the articles based the theses f Jigyua Wag ad Yug-Ju Kim. Gu, C. ad J. Wag (2003). Pealized likelihd desity estimati: Direct crss-validati ad scalable apprximati, Statist. Si. 13, Kim, Y.-J. ad C. Gu (2004). Smthig splie Gaussia regressi: Mre scalable cmputati via efficiet apprximati, J. Ry. Statist. Sc. Ser. B 66, Details f the EE methd ca be fud i the referece belw. Ku, S. ad B. Efr (2002). Smthers ad the C p, GML ad EE criteria: A gemetric apprach, J. Amer. Statist. Assc. 97, The egative crrelati betwee crss-validated badwidth ad the ptimal badwidth was bserved by may, ad was publicized by Sctt ad Terrell (1987) ad Hall ad Jhste (1992), thugh all tk it at the face value ad tried t fix it, which is uecessary. Sctt, D. W. ad G. R. Terrell (1987). Biased ad ubiased crss-validati i desity estimati, J. Amer. Statist. Assc. 82, Hall, P. ad I. Jhste (1992). Empirical fuctials ad efficiet smthig parameter selecti (with discussi) J. Ry. Statist. Sc Ser. B 54, Detailed discussi f mdel idexig ad the ramificatis i badwidth selecti ca be fud i the fllwig referece. Gu, C. (1998). Mdel idexig ad smthig parameter selecti i parametric fucti estimati (with discussi), Statist. Si. 8,

5.1 Two-Step Conditional Density Estimator

5.1 Two-Step Conditional Density Estimator 5.1 Tw-Step Cditial Desity Estimatr We ca write y = g(x) + e where g(x) is the cditial mea fucti ad e is the regressi errr. Let f e (e j x) be the cditial desity f e give X = x: The the cditial desity

More information

BIO752: Advanced Methods in Biostatistics, II TERM 2, 2010 T. A. Louis. BIO 752: MIDTERM EXAMINATION: ANSWERS 30 November 2010

BIO752: Advanced Methods in Biostatistics, II TERM 2, 2010 T. A. Louis. BIO 752: MIDTERM EXAMINATION: ANSWERS 30 November 2010 BIO752: Advaced Methds i Bistatistics, II TERM 2, 2010 T. A. Luis BIO 752: MIDTERM EXAMINATION: ANSWERS 30 Nvember 2010 Questi #1 (15 pits): Let X ad Y be radm variables with a jit distributi ad assume

More information

Ch. 1 Introduction to Estimation 1/15

Ch. 1 Introduction to Estimation 1/15 Ch. Itrducti t stimati /5 ample stimati Prblem: DSB R S f M f s f f f ; f, φ m tcsπf t + φ t f lectrics dds ise wt usually white BPF & mp t s t + w t st. lg. f & φ X udi mp cs π f + φ t Oscillatr w/ f

More information

D.S.G. POLLOCK: TOPICS IN TIME-SERIES ANALYSIS STATISTICAL FOURIER ANALYSIS

D.S.G. POLLOCK: TOPICS IN TIME-SERIES ANALYSIS STATISTICAL FOURIER ANALYSIS STATISTICAL FOURIER ANALYSIS The Furier Represetati f a Sequece Accrdig t the basic result f Furier aalysis, it is always pssible t apprximate a arbitrary aalytic fucti defied ver a fiite iterval f the

More information

ENGI 4421 Central Limit Theorem Page Central Limit Theorem [Navidi, section 4.11; Devore sections ]

ENGI 4421 Central Limit Theorem Page Central Limit Theorem [Navidi, section 4.11; Devore sections ] ENGI 441 Cetral Limit Therem Page 11-01 Cetral Limit Therem [Navidi, secti 4.11; Devre sectis 5.3-5.4] If X i is t rmally distributed, but E X i, V X i ad is large (apprximately 30 r mre), the, t a gd

More information

Chapter 3.1: Polynomial Functions

Chapter 3.1: Polynomial Functions Ntes 3.1: Ply Fucs Chapter 3.1: Plymial Fuctis I Algebra I ad Algebra II, yu ecutered sme very famus plymial fuctis. I this secti, yu will meet may ther members f the plymial family, what sets them apart

More information

ENGI 4421 Central Limit Theorem Page Central Limit Theorem [Navidi, section 4.11; Devore sections ]

ENGI 4421 Central Limit Theorem Page Central Limit Theorem [Navidi, section 4.11; Devore sections ] ENGI 441 Cetral Limit Therem Page 11-01 Cetral Limit Therem [Navidi, secti 4.11; Devre sectis 5.3-5.4] If X i is t rmally distributed, but E X i, V X i ad is large (apprximately 30 r mre), the, t a gd

More information

Quantum Mechanics for Scientists and Engineers. David Miller

Quantum Mechanics for Scientists and Engineers. David Miller Quatum Mechaics fr Scietists ad Egieers David Miller Time-depedet perturbati thery Time-depedet perturbati thery Time-depedet perturbati basics Time-depedet perturbati thery Fr time-depedet prblems csider

More information

Unifying the Derivations for. the Akaike and Corrected Akaike. Information Criteria. from Statistics & Probability Letters,

Unifying the Derivations for. the Akaike and Corrected Akaike. Information Criteria. from Statistics & Probability Letters, Uifyig the Derivatis fr the Akaike ad Crrected Akaike Ifrmati Criteria frm Statistics & Prbability Letters, Vlume 33, 1997, pages 201{208. by Jseph E. Cavaaugh Departmet f Statistics, Uiversity f Missuri,

More information

Author. Introduction. Author. o Asmir Tobudic. ISE 599 Computational Modeling of Expressive Performance

Author. Introduction. Author. o Asmir Tobudic. ISE 599 Computational Modeling of Expressive Performance ISE 599 Cmputatial Mdelig f Expressive Perfrmace Playig Mzart by Aalgy: Learig Multi-level Timig ad Dyamics Strategies by Gerhard Widmer ad Asmir Tbudic Preseted by Tsug-Ha (Rbert) Chiag April 5, 2006

More information

Solutions. Definitions pertaining to solutions

Solutions. Definitions pertaining to solutions Slutis Defiitis pertaiig t slutis Slute is the substace that is disslved. It is usually preset i the smaller amut. Slvet is the substace that des the disslvig. It is usually preset i the larger amut. Slubility

More information

Grade 3 Mathematics Course Syllabus Prince George s County Public Schools

Grade 3 Mathematics Course Syllabus Prince George s County Public Schools Ctet Grade 3 Mathematics Curse Syllabus Price Gerge s Cuty Public Schls Prerequisites: Ne Curse Descripti: I Grade 3, istructial time shuld fcus fur critical areas: (1) develpig uderstadig f multiplicati

More information

Multi-objective Programming Approach for. Fuzzy Linear Programming Problems

Multi-objective Programming Approach for. Fuzzy Linear Programming Problems Applied Mathematical Scieces Vl. 7 03. 37 8-87 HIKARI Ltd www.m-hikari.cm Multi-bective Prgrammig Apprach fr Fuzzy Liear Prgrammig Prblems P. Padia Departmet f Mathematics Schl f Advaced Scieces VIT Uiversity

More information

A Study on Estimation of Lifetime Distribution with Covariates Under Misspecification

A Study on Estimation of Lifetime Distribution with Covariates Under Misspecification Prceedigs f the Wrld Cgress Egieerig ad Cmputer Sciece 2015 Vl II, Octber 21-23, 2015, Sa Fracisc, USA A Study Estimati f Lifetime Distributi with Cvariates Uder Misspecificati Masahir Ykyama, Member,

More information

Internal vs. external validity. External validity. Internal validity

Internal vs. external validity. External validity. Internal validity Secti 7 Mdel Assessmet Iteral vs. exteral validity Iteral validity refers t whether the aalysis is valid fr the pplati ad sample beig stdied. Exteral validity refers t whether these reslts ca be geeralized

More information

Intermediate Division Solutions

Intermediate Division Solutions Itermediate Divisi Slutis 1. Cmpute the largest 4-digit umber f the frm ABBA which is exactly divisible by 7. Sluti ABBA 1000A + 100B +10B+A 1001A + 110B 1001 is divisible by 7 (1001 7 143), s 1001A is

More information

TECHNICAL REPORT NO Generalization and Regularization in Nonlinear Learning Systems 1

TECHNICAL REPORT NO Generalization and Regularization in Nonlinear Learning Systems 1 DEPARTMENT OF STATISTICS Uiversity f Wiscsi 1210 West Dayt St. Madis, WI 53706 TECHNICAL REPORT NO. 1015 February 28, 2000 i Nliear Learig Systems 1 by Grace 1 Prepared fr the Hadbk f Brai Thery ad Neural

More information

Fourier Series & Fourier Transforms

Fourier Series & Fourier Transforms Experimet 1 Furier Series & Furier Trasfrms MATLAB Simulati Objectives Furier aalysis plays a imprtat rle i cmmuicati thery. The mai bjectives f this experimet are: 1) T gai a gd uderstadig ad practice

More information

K [f(t)] 2 [ (st) /2 K A GENERALIZED MEIJER TRANSFORMATION. Ku(z) ()x) t -)-I e. K(z) r( + ) () (t 2 I) -1/2 e -zt dt, G. L. N. RAO L.

K [f(t)] 2 [ (st) /2 K A GENERALIZED MEIJER TRANSFORMATION. Ku(z) ()x) t -)-I e. K(z) r( + ) () (t 2 I) -1/2 e -zt dt, G. L. N. RAO L. Iterat. J. Math. & Math. Scl. Vl. 8 N. 2 (1985) 359-365 359 A GENERALIZED MEIJER TRANSFORMATION G. L. N. RAO Departmet f Mathematics Jamshedpur C-perative Cllege f the Rachi Uiversity Jamshedpur, Idia

More information

COWLES FOUNDATION FOR RESEARCH IN ECONOMICS YALE UNIVERSITY

COWLES FOUNDATION FOR RESEARCH IN ECONOMICS YALE UNIVERSITY HAC ESTIMATION BY AUTOMATED REGRESSION By Peter C.B. Phillips July 004 COWLES FOUNDATION DISCUSSION PAPER NO. 470 COWLES FOUNDATION FOR RESEARCH IN ECONOMICS YALE UNIVERSITY Bx 088 New Have, Cecticut 0650-88

More information

Physical Chemistry Laboratory I CHEM 445 Experiment 2 Partial Molar Volume (Revised, 01/13/03)

Physical Chemistry Laboratory I CHEM 445 Experiment 2 Partial Molar Volume (Revised, 01/13/03) Physical Chemistry Labratry I CHEM 445 Experimet Partial Mlar lume (Revised, 0/3/03) lume is, t a gd apprximati, a additive prperty. Certaily this apprximati is used i preparig slutis whse ccetratis are

More information

Lecture 21: Signal Subspaces and Sparsity

Lecture 21: Signal Subspaces and Sparsity ECE 830 Fall 00 Statistical Sigal Prcessig istructr: R. Nwak Lecture : Sigal Subspaces ad Sparsity Sigal Subspaces ad Sparsity Recall the classical liear sigal mdel: X = H + w, w N(0, where S = H, is a

More information

Markov processes and the Kolmogorov equations

Markov processes and the Kolmogorov equations Chapter 6 Markv prcesses ad the Klmgrv equatis 6. Stchastic Differetial Equatis Csider the stchastic differetial equati: dx(t) =a(t X(t)) dt + (t X(t)) db(t): (SDE) Here a(t x) ad (t x) are give fuctis,

More information

Study the bias (due to the nite dimensional approximation) and variance of the estimators

Study the bias (due to the nite dimensional approximation) and variance of the estimators 2 Series Methods 2. Geeral Approach A model has parameters (; ) where is ite-dimesioal ad is oparametric. (Sometimes, there is o :) We will focus o regressio. The fuctio is approximated by a series a ite

More information

MODIFIED LEAKY DELAYED LMS ALGORITHM FOR IMPERFECT ESTIMATE SYSTEM DELAY

MODIFIED LEAKY DELAYED LMS ALGORITHM FOR IMPERFECT ESTIMATE SYSTEM DELAY 5th Eurpea Sigal Prcessig Cferece (EUSIPCO 7), Pza, Plad, September 3-7, 7, cpyright by EURASIP MOIFIE LEAKY ELAYE LMS ALGORIHM FOR IMPERFEC ESIMAE SYSEM ELAY Jua R. V. López, Orlad J. bias, ad Rui Seara

More information

Matching a Distribution by Matching Quantiles Estimation

Matching a Distribution by Matching Quantiles Estimation Jural f the America Statistical Assciati ISSN: 0162-1459 (Prit) 1537-274X (Olie) Jural hmepage: http://www.tadflie.cm/li/uasa20 Matchig a Distributi by Matchig Quatiles Estimati Niklas Sgurpuls, Qiwei

More information

Comparative analysis of bayesian control chart estimation and conventional multivariate control chart

Comparative analysis of bayesian control chart estimation and conventional multivariate control chart America Jural f Theretical ad Applied Statistics 3; ( : 7- ublished lie Jauary, 3 (http://www.sciecepublishiggrup.cm//atas di:.648/.atas.3. Cmparative aalysis f bayesia ctrl chart estimati ad cvetial multivariate

More information

, the random variable. and a sample size over the y-values 0:1:10.

, the random variable. and a sample size over the y-values 0:1:10. Lecture 3 (4//9) 000 HW PROBLEM 3(5pts) The estimatr i (c) f PROBLEM, p 000, where { } ~ iid bimial(,, is 000 e f the mst ppular statistics It is the estimatr f the ppulati prprti I PROBLEM we used simulatis

More information

A Hartree-Fock Calculation of the Water Molecule

A Hartree-Fock Calculation of the Water Molecule Chemistry 460 Fall 2017 Dr. Jea M. Stadard Nvember 29, 2017 A Hartree-Fck Calculati f the Water Mlecule Itrducti A example Hartree-Fck calculati f the water mlecule will be preseted. I this case, the water

More information

[1 & α(t & T 1. ' ρ 1

[1 & α(t & T 1. ' ρ 1 NAME 89.304 - IGNEOUS & METAMORPHIC PETROLOGY DENSITY & VISCOSITY OF MAGMAS I. Desity The desity (mass/vlume) f a magma is a imprtat parameter which plays a rle i a umber f aspects f magma behavir ad evluti.

More information

Study of Energy Eigenvalues of Three Dimensional. Quantum Wires with Variable Cross Section

Study of Energy Eigenvalues of Three Dimensional. Quantum Wires with Variable Cross Section Adv. Studies Ther. Phys. Vl. 3 009. 5 3-0 Study f Eergy Eigevalues f Three Dimesial Quatum Wires with Variale Crss Secti M.. Sltai Erde Msa Departmet f physics Islamic Aad Uiversity Share-ey rach Ira alrevahidi@yah.cm

More information

Claude Elysée Lobry Université de Nice, Faculté des Sciences, parc Valrose, NICE, France.

Claude Elysée Lobry Université de Nice, Faculté des Sciences, parc Valrose, NICE, France. CHAOS AND CELLULAR AUTOMATA Claude Elysée Lbry Uiversité de Nice, Faculté des Scieces, parc Valrse, 06000 NICE, Frace. Keywrds: Chas, bifurcati, cellularautmata, cmputersimulatis, dyamical system, ifectius

More information

Statistica Sinica 6(1996), SOME PROBLEMS ON THE ESTIMATION OF UNIMODAL DENSITIES Peter J. Bickel and Jianqing Fan University of California and U

Statistica Sinica 6(1996), SOME PROBLEMS ON THE ESTIMATION OF UNIMODAL DENSITIES Peter J. Bickel and Jianqing Fan University of California and U Statistica Siica 6(996), 23-45 SOME PROBLEMS ON THE ESTIMATION OF UNIMODAL DENSITIES Peter J. Bickel ad Jiaqig Fa Uiversity f Califria ad Uiversity f Nrth Carlia Abstract: I this paper, we study, i sme

More information

Super-efficiency Models, Part II

Super-efficiency Models, Part II Super-efficiec Mdels, Part II Emilia Niskae The 4th f Nvember S steemiaalsi Ctets. Etesis t Variable Returs-t-Scale (0.4) S steemiaalsi Radial Super-efficiec Case Prblems with Radial Super-efficiec Case

More information

Active redundancy allocation in systems. R. Romera; J. Valdés; R. Zequeira*

Active redundancy allocation in systems. R. Romera; J. Valdés; R. Zequeira* Wrkig Paper -6 (3) Statistics ad Ecmetrics Series March Departamet de Estadística y Ecmetría Uiversidad Carls III de Madrid Calle Madrid, 6 893 Getafe (Spai) Fax (34) 9 64-98-49 Active redudacy allcati

More information

Axial Temperature Distribution in W-Tailored Optical Fibers

Axial Temperature Distribution in W-Tailored Optical Fibers Axial Temperature Distributi i W-Tailred Optical ibers Mhamed I. Shehata (m.ismail34@yah.cm), Mustafa H. Aly(drmsaly@gmail.cm) OSA Member, ad M. B. Saleh (Basheer@aast.edu) Arab Academy fr Sciece, Techlgy

More information

Regression Quantiles for Time Series Data ZONGWU CAI. Department of Mathematics. Abstract

Regression Quantiles for Time Series Data ZONGWU CAI. Department of Mathematics.   Abstract Regressi Quatiles fr Time Series Data ZONGWU CAI Departmet f Mathematics Uiversity f Nrth Carlia Charltte, NC 28223, USA E-mail: zcai@ucc.edu Abstract I this article we study parametric estimati f regressi

More information

Mean residual life of coherent systems consisting of multiple types of dependent components

Mean residual life of coherent systems consisting of multiple types of dependent components Mea residual life f cheret systems csistig f multiple types f depedet cmpets Serka Eryilmaz, Frak P.A. Cle y ad Tahai Cle-Maturi z February 20, 208 Abstract Mea residual life is a useful dyamic characteristic

More information

MATH Midterm Examination Victor Matveev October 26, 2016

MATH Midterm Examination Victor Matveev October 26, 2016 MATH 33- Midterm Examiati Victr Matveev Octber 6, 6. (5pts, mi) Suppse f(x) equals si x the iterval < x < (=), ad is a eve peridic extesi f this fucti t the rest f the real lie. Fid the csie series fr

More information

Preliminary Test Single Stage Shrinkage Estimator for the Scale Parameter of Gamma Distribution

Preliminary Test Single Stage Shrinkage Estimator for the Scale Parameter of Gamma Distribution America Jural f Mathematics ad Statistics, (3): 3-3 DOI:.593/j.ajms.3. Prelimiary Test Sigle Stage Shrikage Estimatr fr the Scale Parameter f Gamma Distributi Abbas Najim Salma,*, Aseel Hussei Ali, Mua

More information

Sound Absorption Characteristics of Membrane- Based Sound Absorbers

Sound Absorption Characteristics of Membrane- Based Sound Absorbers Purdue e-pubs Publicatis f the Ray W. Schl f Mechaical Egieerig 8-28-2003 Sud Absrpti Characteristics f Membrae- Based Sud Absrbers J Stuart Blt, blt@purdue.edu Jih Sg Fllw this ad additial wrks at: http://dcs.lib.purdue.edu/herrick

More information

Fourier Method for Solving Transportation. Problems with Mixed Constraints

Fourier Method for Solving Transportation. Problems with Mixed Constraints It. J. Ctemp. Math. Scieces, Vl. 5, 200,. 28, 385-395 Furier Methd fr Slvig Trasprtati Prblems with Mixed Cstraits P. Padia ad G. Nataraja Departmet f Mathematics, Schl f Advaced Scieces V I T Uiversity,

More information

ALE 26. Equilibria for Cell Reactions. What happens to the cell potential as the reaction proceeds over time?

ALE 26. Equilibria for Cell Reactions. What happens to the cell potential as the reaction proceeds over time? Name Chem 163 Secti: Team Number: AL 26. quilibria fr Cell Reactis (Referece: 21.4 Silberberg 5 th editi) What happes t the ptetial as the reacti prceeds ver time? The Mdel: Basis fr the Nerst quati Previusly,

More information

AP Statistics Notes Unit Eight: Introduction to Inference

AP Statistics Notes Unit Eight: Introduction to Inference AP Statistics Ntes Uit Eight: Itrducti t Iferece Syllabus Objectives: 4.1 The studet will estimate ppulati parameters ad margis f errrs fr meas. 4.2 The studet will discuss the prperties f pit estimatrs,

More information

Examination No. 3 - Tuesday, Nov. 15

Examination No. 3 - Tuesday, Nov. 15 NAME (lease rit) SOLUTIONS ECE 35 - DEVICE ELECTRONICS Fall Semester 005 Examiati N 3 - Tuesday, Nv 5 3 4 5 The time fr examiati is hr 5 mi Studets are allwed t use 3 sheets f tes Please shw yur wrk, artial

More information

The Excel FFT Function v1.1 P. T. Debevec February 12, The discrete Fourier transform may be used to identify periodic structures in time ht.

The Excel FFT Function v1.1 P. T. Debevec February 12, The discrete Fourier transform may be used to identify periodic structures in time ht. The Excel FFT Fucti v P T Debevec February 2, 26 The discrete Furier trasfrm may be used t idetify peridic structures i time ht series data Suppse that a physical prcess is represeted by the fucti f time,

More information

Lecture 3: August 31

Lecture 3: August 31 36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,

More information

CHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came.

CHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came. MATH 1342 Ch. 24 April 25 and 27, 2013 Page 1 f 5 CHAPTER 24: INFERENCE IN REGRESSION Chapters 4 and 5: Relatinships between tw quantitative variables. Be able t Make a graph (scatterplt) Summarize the

More information

Portfolio Performance Evaluation in a Modified Mean-Variance-Skewness Framework with Negative Data

Portfolio Performance Evaluation in a Modified Mean-Variance-Skewness Framework with Negative Data Available lie at http://idea.srbiau.ac.ir It. J. Data Evelpmet Aalysis (ISSN 345-458X) Vl., N.3, Year 04 Article ID IJDEA-003,3 pages Research Article Iteratial Jural f Data Evelpmet Aalysis Sciece ad

More information

Smoothing, penalized least squares and splines

Smoothing, penalized least squares and splines Smthing, penalized least squares and splines Duglas Nychka, www.image.ucar.edu/~nychka Lcally weighted averages Penalized least squares smthers Prperties f smthers Splines and Reprducing Kernels The interplatin

More information

A New Method for Finding an Optimal Solution. of Fully Interval Integer Transportation Problems

A New Method for Finding an Optimal Solution. of Fully Interval Integer Transportation Problems Applied Matheatical Scieces, Vl. 4, 200,. 37, 89-830 A New Methd fr Fidig a Optial Sluti f Fully Iterval Iteger Trasprtati Prbles P. Padia ad G. Nataraja Departet f Matheatics, Schl f Advaced Scieces,

More information

Christensen, Mads Græsbøll; Vera-Candeas, Pedro; Somasundaram, Samuel D.; Jakobsson, Andreas

Christensen, Mads Græsbøll; Vera-Candeas, Pedro; Somasundaram, Samuel D.; Jakobsson, Andreas Dwladed frm vb.aau.dk : April 12, 2019 Aalbrg Uiversitet Rbust Subspace-based Fudametal Frequecy Estimati Christese, Mads Græsbøll; Vera-Cadeas, Pedr; Smasudaram, Samuel D.; Jakbss, Adreas Published i:

More information

are specified , are linearly independent Otherwise, they are linearly dependent, and one is expressed by a linear combination of the others

are specified , are linearly independent Otherwise, they are linearly dependent, and one is expressed by a linear combination of the others Chater 3. Higher Order Liear ODEs Kreyszig by YHLee;4; 3-3. Hmgeeus Liear ODEs The stadard frm f the th rder liear ODE ( ) ( ) = : hmgeeus if r( ) = y y y y r Hmgeeus Liear ODE: Suersiti Pricile, Geeral

More information

Machine Learning Brett Bernstein

Machine Learning Brett Bernstein Machie Learig Brett Berstei Week Lecture: Cocept Check Exercises Starred problems are optioal. Statistical Learig Theory. Suppose A = Y = R ad X is some other set. Furthermore, assume P X Y is a discrete

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5 CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio

More information

5.80 Small-Molecule Spectroscopy and Dynamics

5.80 Small-Molecule Spectroscopy and Dynamics MIT OpeCurseWare http://cw.mit.edu 5.8 Small-Mlecule Spectrscpy ad Dyamics Fall 8 Fr ifrmati abut citig these materials r ur Terms f Use, visit: http://cw.mit.edu/terms. 5.8 Lecture #33 Fall, 8 Page f

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

THE ASYMPTOTIC PERFORMANCE OF THE LOG LIKELIHOOD RATIO STATISTIC FOR THE MIXTURE MODEL AND RELATED RESULTS

THE ASYMPTOTIC PERFORMANCE OF THE LOG LIKELIHOOD RATIO STATISTIC FOR THE MIXTURE MODEL AND RELATED RESULTS ON THE ASYMPTOTIC PERFORMANCE OF THE LOG LIKELIHOOD RATIO STATISTIC FOR THE MIXTURE MODEL AND RELATED RESULTS by Jayata Kumar Ghsh Idia Statistical I$titute, Calcutta ad Praab Kumar Se Departmet f Bistatistics

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Review for cumulative test

Review for cumulative test Hrs Math 3 review prblems Jauary, 01 cumulative: Chapters 1- page 1 Review fr cumulative test O Mday, Jauary 7, Hrs Math 3 will have a curse-wide cumulative test cverig Chapters 1-. Yu ca expect the test

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

CS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007

CS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007 CS 477/677 Analysis f Algrithms Fall 2007 Dr. Gerge Bebis Curse Prject Due Date: 11/29/2007 Part1: Cmparisn f Srting Algrithms (70% f the prject grade) The bjective f the first part f the assignment is

More information

NYU Center for Data Science: DS-GA 1003 Machine Learning and Computational Statistics (Spring 2018)

NYU Center for Data Science: DS-GA 1003 Machine Learning and Computational Statistics (Spring 2018) NYU Ceter for Data Sciece: DS-GA 003 Machie Learig ad Computatioal Statistics (Sprig 208) Brett Berstei, David Roseberg, Be Jakubowski Jauary 20, 208 Istructios: Followig most lab ad lecture sectios, we

More information

x 2 x 3 x b 0, then a, b, c log x 1 log z log x log y 1 logb log a dy 4. dx As tangent is perpendicular to the x axis, slope

x 2 x 3 x b 0, then a, b, c log x 1 log z log x log y 1 logb log a dy 4. dx As tangent is perpendicular to the x axis, slope The agle betwee the tagets draw t the parabla y = frm the pit (-,) 5 9 6 Here give pit lies the directri, hece the agle betwee the tagets frm that pit right agle Ratig :EASY The umber f values f c such

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Lecture 9: September 19

Lecture 9: September 19 36-700: Probability ad Mathematical Statistics I Fall 206 Lecturer: Siva Balakrisha Lecture 9: September 9 9. Review ad Outlie Last class we discussed: Statistical estimatio broadly Pot estimatio Bias-Variace

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Empirical Process Theory and Oracle Inequalities

Empirical Process Theory and Oracle Inequalities Stat 928: Statistical Learig Theory Lecture: 10 Empirical Process Theory ad Oracle Iequalities Istructor: Sham Kakade 1 Risk vs Risk See Lecture 0 for a discussio o termiology. 2 The Uio Boud / Boferoi

More information

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio

More information

The Maximum-Likelihood Decoding Performance of Error-Correcting Codes

The Maximum-Likelihood Decoding Performance of Error-Correcting Codes The Maximum-Lielihood Decodig Performace of Error-Correctig Codes Hery D. Pfister ECE Departmet Texas A&M Uiversity August 27th, 2007 (rev. 0) November 2st, 203 (rev. ) Performace of Codes. Notatio X,

More information

Bayesian Estimation for Continuous-Time Sparse Stochastic Processes

Bayesian Estimation for Continuous-Time Sparse Stochastic Processes Bayesia Estimati fr Ctiuus-Time Sparse Stchastic Prcesses Arash Amii, Ulugbek S Kamilv, Studet, IEEE, Emrah Bsta, Studet, IEEE, Michael User, Fellw, IEEE Abstract We csider ctiuus-time sparse stchastic

More information

Chapter 4. Problem Solutions

Chapter 4. Problem Solutions Chapter 4. Prblem Slutis. The great majrity f alpha particles pass thrugh gases ad thi metal fils with deflectis. T what cclusi abut atmic structure des this bservati lead? The fact that mst particles

More information

Pattern Recognition 2014 Support Vector Machines

Pattern Recognition 2014 Support Vector Machines Pattern Recgnitin 2014 Supprt Vectr Machines Ad Feelders Universiteit Utrecht Ad Feelders ( Universiteit Utrecht ) Pattern Recgnitin 1 / 55 Overview 1 Separable Case 2 Kernel Functins 3 Allwing Errrs (Sft

More information

What is Statistical Learning?

What is Statistical Learning? What is Statistical Learning? Sales 5 10 15 20 25 Sales 5 10 15 20 25 Sales 5 10 15 20 25 0 50 100 200 300 TV 0 10 20 30 40 50 Radi 0 20 40 60 80 100 Newspaper Shwn are Sales vs TV, Radi and Newspaper,

More information

On the affine nonlinearity in circuit theory

On the affine nonlinearity in circuit theory O the affie liearity i circuit thery Emauel Gluski The Kieret Cllege the Sea f Galilee; ad Ort Braude Cllege (Carmiel), Israel. gluski@ee.bgu.ac.il; http://www.ee.bgu.ac.il/~gluski/ E. Gluski, O the affie

More information

Unit -2 THEORY OF DILUTE SOLUTIONS

Unit -2 THEORY OF DILUTE SOLUTIONS Uit - THEORY OF DILUTE SOLUTIONS 1) hat is sluti? : It is a hmgeus mixture f tw r mre cmpuds. ) hat is dilute sluti? : It is a sluti i which slute ccetrati is very less. 3) Give a example fr slid- slid

More information

Chapter 5. Root Locus Techniques

Chapter 5. Root Locus Techniques Chapter 5 Rt Lcu Techique Itrducti Sytem perfrmace ad tability dt determied dby cled-lp l ple Typical cled-lp feedback ctrl ytem G Ope-lp TF KG H Zer -, - Ple 0, -, - K Lcati f ple eaily fud Variati f

More information

Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal

Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Hypthesis Testing and Cnfidence Intervals (Part 1): Using the Standard Nrmal Lecture 8 Justin Kern April 2, 2017 Inferential Statistics Hypthesis Testing One sample mean / prprtin Tw sample means / prprtins

More information

18.S096: Homework Problem Set 1 (revised)

18.S096: Homework Problem Set 1 (revised) 8.S096: Homework Problem Set (revised) Topics i Mathematics of Data Sciece (Fall 05) Afoso S. Badeira Due o October 6, 05 Exteded to: October 8, 05 This homework problem set is due o October 6, at the

More information

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9. Sectin 7 Mdel Assessment This sectin is based n Stck and Watsn s Chapter 9. Internal vs. external validity Internal validity refers t whether the analysis is valid fr the ppulatin and sample being studied.

More information

WEST VIRGINIA UNIVERSITY

WEST VIRGINIA UNIVERSITY WEST VIRGINIA UNIVERSITY PLASMA PHYSICS GROUP INTERNAL REPORT PL - 045 Mea Optical epth ad Optical Escape Factr fr Helium Trasitis i Helic Plasmas R.F. Bivi Nvember 000 Revised March 00 TABLE OF CONTENT.0

More information

Statistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions

Statistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions Statistical ad Mathematical Methods DS-GA 00 December 8, 05. Short questios Sample Fial Problems Solutios a. Ax b has a solutio if b is i the rage of A. The dimesio of the rage of A is because A has liearly-idepedet

More information

Study in Cylindrical Coordinates of the Heat Transfer Through a Tow Material-Thermal Impedance

Study in Cylindrical Coordinates of the Heat Transfer Through a Tow Material-Thermal Impedance Research ural f Applied Scieces, Egieerig ad echlgy (): 9-63, 3 ISSN: 4-749; e-issn: 4-7467 Maxwell Scietific Orgaiati, 3 Submitted: uly 4, Accepted: September 8, Published: May, 3 Study i Cylidrical Crdiates

More information

Directional Duality Theory

Directional Duality Theory Suther Illiis Uiversity Carbdale OpeSIUC Discussi Papers Departmet f Ecmics 2004 Directial Duality Thery Daiel Primt Suther Illiis Uiversity Carbdale Rlf Fare Oreg State Uiversity Fllw this ad additial

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

x 1 Outline IAML: Logistic Regression Decision Boundaries Example Data

x 1 Outline IAML: Logistic Regression Decision Boundaries Example Data Outline IAML: Lgistic Regressin Charles Suttn and Victr Lavrenk Schl f Infrmatics Semester Lgistic functin Lgistic regressin Learning lgistic regressin Optimizatin The pwer f nn-linear basis functins Least-squares

More information

Recovery of Third Order Tensors via Convex Optimization

Recovery of Third Order Tensors via Convex Optimization Recvery f Third Order Tesrs via Cvex Optimizati Hlger Rauhut RWTH Aache Uiversity Lehrstuhl C für Mathematik (Aalysis) Ptdriesch 10 5056 Aache Germay Email: rauhut@mathcrwth-aachede Željka Stjaac RWTH

More information

Design and Implementation of Cosine Transforms Employing a CORDIC Processor

Design and Implementation of Cosine Transforms Employing a CORDIC Processor C16 1 Desig ad Implemetati f Csie Trasfrms Emplyig a CORDIC Prcessr Sharaf El-Di El-Nahas, Ammar Mttie Al Hsaiy, Magdy M. Saeb Arab Academy fr Sciece ad Techlgy, Schl f Egieerig, Alexadria, EGYPT ABSTRACT

More information

Local Polynomial Regression

Local Polynomial Regression Local Polyomial Regressio Joh Hughes October 2, 2013 Recall that the oparametric regressio model is Y i f x i ) + ε i, where f is the regressio fuctio ad the ε i are errors such that Eε i 0. The Nadaraya-Watso

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

Machine Learning Brett Bernstein

Machine Learning Brett Bernstein Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

The generation of successive approximation methods for Markov decision processes by using stopping times

The generation of successive approximation methods for Markov decision processes by using stopping times The geerati f successive apprximati methds fr Markv decisi prcesses by usig stppig times Citati fr published versi (APA): va Nue, J. A. E. E., & Wessels, J. (1976). The geerati f successive apprximati

More information

An Investigation of Stratified Jackknife Estimators Using Simulated Establishment Data Under an Unequal Probability Sample Design

An Investigation of Stratified Jackknife Estimators Using Simulated Establishment Data Under an Unequal Probability Sample Design Secti Survey Research Methds SM 9 A Ivestigati f Stratified ackkife Estimatrs Usig Simulated Establishmet Data Uder a Uequal Prbability Sample Desig Abstract Plip Steel, Victria McNerey, h Slata Csiderig

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

MATHEMATICS 9740/01 Paper 1 14 Sep hours

MATHEMATICS 9740/01 Paper 1 14 Sep hours Cadidate Name: Class: JC PRELIMINARY EXAM Higher MATHEMATICS 9740/0 Paper 4 Sep 06 3 hurs Additial Materials: Cver page Aswer papers List f Frmulae (MF5) READ THESE INSTRUCTIONS FIRST Write yur full ame

More information

Lab 1 The Scientific Method

Lab 1 The Scientific Method INTRODUCTION The fllwing labratry exercise is designed t give yu, the student, an pprtunity t explre unknwn systems, r universes, and hypthesize pssible rules which may gvern the behavir within them. Scientific

More information

Resampling Methods. Cross-validation, Bootstrapping. Marek Petrik 2/21/2017

Resampling Methods. Cross-validation, Bootstrapping. Marek Petrik 2/21/2017 Resampling Methds Crss-validatin, Btstrapping Marek Petrik 2/21/2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins in R (Springer, 2013) with

More information