Machine Learning for Signal Processing Regression and Prediction
|
|
- June Blankenship
- 5 years ago
- Views:
Transcription
1 Machne Learnng for Sgnal Processng Regresson and Predcton Class 4. 7 Oct 0 Instructor: Bhsha Raj 7 Oct /8797
2 f... D df Matr Identtes df d d df d d... df dd dd he dervatve of a scalar functon w.r.t. a vector s a vector 7 Oct /8797
3 Matr Identtes f.. D.. D D D DD df df df df d d d d d D.. d df df df d d.. d d d d D df df.. df dd dd d dd dd ddd D D DD he dervatve of a scalar functon w.r.t. a vector s a vector he dervatve w.r.t. a matr s a matr 7 Oct /8797 3
4 Matr Identtes F F F F FN D df d d df df df d... d dfn.. dfn d d df d d df d d.. dfn d d df d dd.. df.. d.. dd.... dfn d d D D D D he dervatve of a vector functon w.r.t. a vector s a matr Note transposton of order 7 Oct /8797 4
5 Dervatves, UV, N UV N NUV UVN In general: Dfferentatng an MN functon b a UV argument results n an MNUV tensor dervatve 7 Oct /8797 5
6 Matr dervatve denttes d Xa Xda d a X X da X s a matr, a s a vector. Soluton ma also be X d AX da X ; d XA X da A s a matr d d a Xa a X X da A XA XAA AA X trace d trace d trace X X da Some basc lnear and quadratc denttes 7 Oct /8797 6
7 A Common Problem Can ou spot the gltches? 7 Oct /8797 7
8 How to f ths problem? Gltches n audo Must be detected How? hen what? Gltches must be fed Delete the gltch Results n a hole Fll n the hole How? 7 Oct /8797 8
9 Interpolaton.. Etend the curve on the left to predct the values n the blan regon Forward predcton Etend the blue curve on the rght leftwards to predct the blan regon Bacward predcton How? Regresson analss.. 7 Oct /8797 9
10 Detectng the Gltch OK NO OK Regresson-based reconstructon can be done anwhere Reconstructed value wll not match actual value Large error of reconstructon dentfes gltches 7 Oct /8797 0
11 What s a regresson Analzng relatonshp between varables Epressed n man forms Wpeda Lnear regresson, Smple regresson, Ordnar least squares, Polnomal regresson, General lnear model, Generalzed lnear model, Dscrete choce, Logstc regresson, Multnomal logt, Med logt, Probt, Multnomal probt,. Generall a tool to predct varables 7 Oct /8797
12 Regressons for predcton = f; Q + e Dfferent possbltes s a scalar s real s categorcal classfcaton s a vector s a vector s a set of real valued varables s a set of categorcal varables s a combnaton of the two f. s a lnear or affne functon f. s a non-lnear functon f. s a tme-seres model 7 Oct /8797
13 A lnear regresson Y Assumpton: relatonshp between varables s lnear A lnear trend ma be found relatng and = dependent varable = eplanator varable Gven, can be predcted as an affne functon of X 7 Oct /8797 3
14 An magnar regresson.. Chec ths sht out Fg.. hat's bonafde, 00%-real data, m frends. I too t mself over the course of two wees. And ths was not a lesurel two wees, ether; I busted m ass da and nght n order to provde ou wth nothng but the best data possble. Now, let's loo a bt more closel at ths data, rememberng that t s absolutel frst-rate. Do ou see the eponental dependence? I sure don't. I see a bunch of crap. Chrst, ths was such a waste of m tme. Banng on m hopes that whoever grades ths wll just loo at the pctures, I drew an eponental through m nose. I beleve the apparent legtmac s enhanced b the fact that I used a complcated computer program to mae the ft. I understand ths s the same process b whch the top quar was dscovered. 7 Oct /8797 4
15 Lnear Regressons = A + b + e e = predcton error Gven a tranng set of {, } values: estmate A and b = A + b + e = A + b + e 3 = A 3 + b+ e 3 If A and b are well estmated, predcton error wll be small 7 Oct /8797 5
16 Lnear Regresson to a scalar = a + b + e = a + b + e 3 = a 3 + b + e 3 Defne: [...] 3 e [ e e...] 3 e X 3... A a b Rewrte A X e 7 Oct /8797 6
17 Learnng the parameters A X e ˆ A X Assumng no error Gven tranng data: several, Can defne a dvergence : D, ŷ Measures how much ŷ dffers from Ideall, f the model s accurate ths should be small Estmate A, b to mnmze D, ŷ 7 Oct /8797 7
18 he predcton error as dvergence Defne dvergence as sum of the squared error n predctng 7 Oct / e e X A ˆ = a + b + e = a + b + e 3 = a 3 + b + e 3... ˆ 3 e e e E D, b b b a a a X A X A X A E
19 Predcton error as dvergence = a + e e = predcton error Fnd the slope a such that the total squared length of the error lnes s mnmzed 7 Oct /8797 9
20 Solvng a lnear regresson A X e Mnmze squared error E X A A X A X XX A - X Dfferentatng w.r.t A and equatng to 0 A A XX - X A 0 de d A A - X XX pnvx A XX - X 7 Oct /8797 0
21 Regresson n multple dmensons = A + b + e = A + b + e 3 = A 3 + b + e 3 s a vector j = j th component of vector Also called multple regresson Equvalent of sang: a = th column of A b j = j th component of b = a + b + e = A + b + e = a + b + e 3 = a 3 + b 3 + e 3 Fundamentall no dfferent from N separate sngle regressons But we can use the relatonshp between s to our beneft 7 Oct /8797
22 Multple Regresson Y [...] 3 E [ e e...] 3 e Y 3 A X... Â b Aˆ X E D vector of ones DIV Aˆ trace Dfferentatng and equatng to 0 7 Oct /8797 Y ˆ Y - A X X daˆ 0 d. Dv Aˆ X Y Aˆ X ˆ YX XX - YpnvX ˆ A XX A YX Aˆ - XX XY
23 A Dfferent Perspectve = + s a nos readng of A Error e s Gaussan Estmate A from e ~ A e N0, I Y [... N ] X [... N ] 7 Oct /8797 3
24 he Lelhood of the data Probablt of observng a specfc, gven, for a partcular matr A Probablt of collecton: Assumng IID for convenence not necessar 7 Oct / e A 0, I ~ e N, ; ; I A A N P ]... [ ]... [ N N X Y N P, ; ; I A A X Y
25 A Mamum Lelhood Estmate A e e ~ N0, I P Y X ep A D log P Y X; A C C Mamzng the log probablt s dentcal to mnmzng the trace Identcal to the least squares soluton Y [... N ] X [... trace N A Y A X Y A X ] A - YX XX Ypnv X A XX XY 7 Oct /
26 Predctng an output From a collecton of tranng data, have learned A Gven for a new nstance, but not, what s? Smple soluton: ˆ A X 7 Oct /8797 6
27 Applng t to our problem Predcton b regresson Forward regresson t = a t- + a t- a t- +e t Bacward regresson t = b t+ + b t+ b t+ +e t 7 Oct /8797 7
28 Applng t to our problem Forward predcton 7 Oct / K t t t K K K t t t K t t t K t t e e e a e Xa t pnv a t X
29 Applng t to our problem Bacward predcton 7 Oct / e e e K t K t t K K K t t t K t t t K t K t b e Xb t pnv b t X
30 Fndng the burst At each tme Learn a forward predctor a t At each tme, predct net sample t est = S a t, t- Compute error: ferr t = t - t est Learn a bacward predct and compute bacward error berr t Compute average predcton error over wndow, threshold 7 Oct /
31 Fllng the hole Learn forward predctor at left edge of hole For each mssng sample At each tme, predct net sample t est = S a t, t- Use estmated samples f real samples are not avalable Learn bacward predctor at left edge of hole For each mssng sample At each tme, predct net sample t est = S b t, t+ Use estmated samples f real samples are not avalable Average forward and bacward predctons 7 Oct /8797 3
32 Reconstructon zoom n Reconstructon area Net gltch Dstorted sgnal Recovered sgnal Interpolaton result Actual data 7 Oct /8797 3
33 Incrementall learnng the regresson A XX Can we learn A ncrementall nstead? As data comes n? he Wdrow Hoff rule a t a t XY Note the structure Can also be done n batch mode! - Requres nowledge of all, pars t t ˆ t t ˆ t a t error Scalar predcton verson 7 Oct /
34 A Predctng a value - XX XY A YX XX ˆ What are we dong eactl? For the eplanaton we are assumng no b X s 0 mean Eplanaton generalzes easl even otherwse Let ˆ C Whtenng C XX ˆ and YX C C Xˆ C X N -0.5 C -0.5 s the whtenng matr for YXˆ 7 Oct / ˆ
35 Predctng a value What are we dong eactl? 7 Oct / YX ˆ ˆ ˆ ˆ ˆ N N N YX ˆ ˆ ˆ ˆ ˆ... ˆ ˆ ˆ
36 Predctng a value ˆ ˆ ˆ Gven tranng nstances, for =..N, estmate for a new test nstance of wth unnown : s smpl a weghted sum of the nstances from the tranng data he weght of an s smpl the nner product between ts correspondng and the new Wth due whtenng and scalng.. 7 Oct /
37 What are we dong: A dfferent perspectve ˆ A YX XX Assumes XX s nvertble What f t s not Dmensonalt of X s greater than number of observatons? Underdetermned In ths case X X wll generall be nvertble A X Y X X ˆ X X Y X 7 Oct /
38 Hgh-dmensonal regresson X X s the Gram Matr 7 Oct / X X Y X ˆ N N N N N N G X YG ˆ
39 Hgh-dmensonal regresson ˆ YG X Normalze Y b the nverse of the gram matr Y YG Worng our wa down.. ˆ YX ˆ 7 Oct /
40 Lnear Regresson n Hgh-dmensonal Spaces ˆ Y YG Gven tranng nstances, for =..N, estmate for a new test nstance of wth unnown : s smpl a weghted sum of the normalzed nstances from the tranng data he normalzaton s done va the Gram Matr he weght of an s smpl the nner product between ts correspondng and the new 7 Oct /
41 Relatonshps are not alwas lnear How do we model these? Multple solutons 7 Oct /8797 4
42 = Aj+e Non-lnear regresson φ [... N ] X X [ φ φ... φ K ] Y = AX+e Replace X wth X n earler equatons for soluton A X X - X Y 7 Oct /8797 4
43 Y = AX+e Problem Replace X wth X n earler equatons for soluton A X X - X Y X ma be n a ver hgh-dmensonal space he hgh-dmensonal space or the transform X ma be unnown.. 7 Oct /
44 he regresson s n hgh dmensons Lnear regresson: Hgh-dmensonal regresson 7 Oct / ˆ YG Y N N N N N G YG Y ˆ
45 Dong t wth Kernels Hgh-dmensonal regresson wth Kernels: Regresson n Kernel Hlbert Space.. 7 Oct / ,,,,,,,,, N N N N N N K K K K K K K K K G K, YG Y K, ˆ
46 A dfferent wa of fndng nonlnear relatonshps: Locall lnear regresson Prevous dscusson: Regresson parameters are optmzed over the entre tranng set Mnmze E all Sngle global regresson s estmated and appled to all future Alternatve: Local regresson b Learn a regresson that s specfc to A 7 Oct /
47 Beng non-commttal: Local Regresson Estmate the regresson to be appled to an usng tranng nstances near E j neghborhood A b he resultant regresson has the form j d, neghborhood j j e Note : ths regresson s specfc to A separate regresson must be learned for ever 7 Oct /
48 Local Regresson j d, neghborhood j j e But what s d? For lnear regresson d s an nner product More generc form: Choose d as a functon of the dstance between and j If d falls off rapdl wth and j the neghbhorhood requrement can be relaed d, j all j e 7 Oct /
49 Kernel Regresson: d = K ˆ K h K h pcal Kernel functons: Gaussan, Laplacan, other denst functons Must fall off rapdl wth ncreasng dstance between and j Regresson s local to ever : Local regresson Actuall a non-parametrc MAP estmator of But frst.. MAP estmators.. 49
50 Map Estmators MAP Mamum A Posteror: Fnd a best guess for statstcall, gven nown = argma Y PY ML Mamum Lelhood: Fnd that value of for whch the statstcal best guess of would have been the observed = argma Y P Y MAP s smpler to vsualze 7 Oct /
51 MAP estmaton: Gaussan PDF Assume X and Y are jontl Gaussan Y he parameters of the F Gaussan are learned from tranng data X F0 7 Oct /8797 5
52 Learnng the parameters of the Gaussan z z N N z C z N N z z z z z C z C C XX YX C C XY YY 7 Oct /8797 5
53 Learnng the parameters of the N Gaussan N z N z z Cz z z z z N z N C z C C N N C XY N XX YX C C XY YY 7 Oct /
54 MAP estmaton: Gaussan PDF Assume X and Y are jontl Gaussan Y he parameters of the F Gaussan are learned from tranng data X F0 7 Oct /
55 MAP Estmator for Gaussan RV Assume X and Y are jontl Gaussan Level set of Gaussan he parameters of the Gaussan are learned from tranng data X 0 Now we are gven an X, but no Y What s Y? 755/
56 MAP estmator for Gaussan RV 0 7 Oct /
57 MAP estmaton: Gaussan PDF Y F X 7 Oct /
58 MAP estmaton: he Gaussan at a partcular value of X F 0 7 Oct /
59 MAP estmaton: he Gaussan at a partcular value of X Most lel value F 0 7 Oct /
60 MAP Estmaton of a Gaussan RV Y = argma P X??? 0 7 Oct /
61 MAP Estmaton of a Gaussan RV 0 7 Oct /8797 6
62 MAP Estmaton of a Gaussan RV Y = argma P X 0 7 Oct /8797 6
63 So what s ths value? Clearl a lne Equaton of Lne: ˆ C Y YX C XX Scalar verson gven; vector verson s dentcal ˆ Y C YX C XX Dervaton? Later n the program a bt Note the smlart to regresson 7 Oct /
64 hs s a multple regresson ˆ Y C YX C XX hs s the MAP estmate of = argma Y PY What about the ML estmate of argma Y P Y Note: Nether of these ma be the regresson lne! MAP estmaton of s the regresson on Y for Gaussan RVs But ths s not the MAP estmaton of the regresson parameter 7 Oct /
65 Its also a mnmum-mean-squared error estmate General prncple of MMSE estmaton: s unnown, s nown Must estmate t such that the epected squared error s mnmzed Err E[ ˆ ] Mnmze above term 7 Oct /
66 Its also a mnmum-mean-squared error estmate Mnmze error: Dfferentatng and equatng to 0: 7 Oct / ] ˆ ˆ [ ] ˆ [ E E Err ] [ ˆ ˆ ˆ ] [ ] ˆ ˆ ˆ [ E E E Err 0 ˆ ] [ ˆ ˆ. d E d Err d ] [ ˆ E he MMSE estmate s the mean of the dstrbuton
67 For the Gaussan: MAP = MMSE Most lel value s also he MEAN value Would be true of an smmetrc dstrbuton 7 Oct /
68 MMSE estmates for mture dstrbutons 68 Let P be a mture denst he MMSE estmate of s gven b Just a weghted combnaton of the MMSE estmates from the component dstrbutons, P P P d P P E, ] [ d P P, ], [ E P
69 MMSE estmates from a Gaussan mture 7 Oct / P s also a Gaussan mture Let P, be a Gaussan Mture, ; N P P P S z z, z,,,, P P P P P P P P P P P P,
70 MMSE estmates from a Gaussan mture 7 Oct / Let P s a Gaussan Mture P P P, ], ; ];[ ; [,,,,,,,, C C C C N P, ;,,,,, Q C C N P Q C C N P P, ;,,,,
71 MMSE estmates from a Gaussan mture 7 Oct / E[ ] s also a mture P s a mture Gaussan denst Q C C N P P, ;,,,, E P E ], [ ] [ C C P E ] [,,,,
72 MMSE estmates from a Gaussan mture A mture of estmates from ndvdual Gaussans 7 Oct /8797 7
73 Voce Morphng Algn tranng recordngs from both speaers Cepstral vector sequence Learn a GMM on jont vectors Gven speech from one speaer, fnd MMSE estmate of the other Snthesze from cepstra 7 Oct /
74 MMSE wth GMM: Voce ransformaton - Festvo GMM transformaton sute oda awb bdl jm slt awb bdl jm slt 7 Oct /
75 A problem wth regressons A XX - XY ML ft s senstve Error s squared Small varatons n data large varatons n weghts Outlers affect t adversel Unstable If dmenson of X >= no. of nstances XX s not nvertble 7 Oct /
76 MAP estmaton of weghts a =a X+e Assume weghts drawn from a Gaussan Pa = N0, I Ma. Lelhood estmate Mamum a posteror estmate aˆ arg ma X aˆ arg ma a log P a log P a, X X; a arg ma log P 7 Oct / a e X, a P a
77 MAP estmaton of weghts Smlar to ML estmate wth an addtonal term 7 Oct / , log arg ma, log arg ma ˆ a X a X a a A A P P P Pa = N0, I Log Pa = C log a C P log X a X a X, a a a X a X a a A C 5 0. log ' arg ma ˆ
78 MAP estmate of weghts dl a XX X I da 0 a XX I - XY Equvalent to dagonal loadng of correlaton matr Improves condton number of correlaton matr Can be nverted wth greater stablt Wll not affect the estmaton from well-condtoned data Also called honov Regularzaton Dual form: Rdge regresson MAP estmate of weghts Not to be confused wth MAP estmate of Y 7 Oct /
79 MAP estmate prors Left: Gaussan Pror on W Rght: Laplacan Pror 7 Oct /
80 MAP estmaton of weghts wth laplacan pror Assume weghts drawn from a Laplacan Pa = l - ep-l - a Mamum a posteror estmate aˆ arg ma A C' a X a X l a No closed form soluton Quadratc programmng soluton requred Non-trval 7 Oct /
81 MAP estmaton of weghts wth laplacan pror Assume weghts drawn from a Laplacan Pa = l - ep-l - a Mamum a posteror estmate aˆ arg ma A C' a X a X l Identcal to L regularzed least-squares estmaton a 7 Oct /8797 8
82 L -regularzed LSE aˆ arg ma C' No closed form soluton Quadratc programmng solutons requred Dual formulaton A a X a X l a aˆ arg ma A C ' a X a X subject to a t LASSO Least absolute shrnage and selecton operator 7 Oct /8797 8
83 LASSO Algorthms Varous conve optmzaton algorthms LARS: Least angle regresson Pathwse coordnate descent.. Matlab code avalable from web 7 Oct /
84 Regularzed least squares Image Credt: bshran Regularzaton results n selecton of suboptmal n least-squares sense soluton One of the loc outsde center honov regularzaton selects shortest soluton L regularzaton selects sparsest soluton 7 Oct /
85 LASSO and Compressve Sensng Y = X a Gven Y and X, estmate sparse W LASSO: X = eplanator varable Y = dependent varable a = weghts of regresson CS: X = measurement matr Y = measurement a = data 7 Oct /
86 An nterestng problem: Predctng War! Economsts measure a number of socal ndcators for countres weel Happness nde Hunger nde Freedom nde wtter records Queston: Wll there be a revoluton or war net wee? 7 Oct /
87 An nterestng problem: Predctng War! Issues: Dssatsfacton bulds up not an nstantaneous phenomenon Usuall War / rebellon buld up much faster Often n hours Important to predct Preparedness for securt Economc mpact 7 Oct /
88 Predctng War O O O 3 O 4 O 5 O 6 O 7 O 8 W W W W W W W W S S S S S S S S Gven w w w3 w4 w5 w6 w7 w8 Sequence of economc ndcators for each wee Sequence of unrest marers for each wee At the end of each wee we now f war happened or not that wee Predct probablt of unrest net wee hs could be a new unrest or persstence of a current one 7 Oct /
89 Predctng me Seres Need tme-seres models HMMs later n the course 7 Oct /
Machine Learning for Signal Processing Regression and Prediction
Machne Learnng for Sgnal Processng Regresson and Predcton Class 4. 5 Oct 06 Instructor: Bhsha Raj 755/8797 A Common Problem Can ou spot the gltches? 755/8797 How to f ths problem? Gltches n audo Must be
More informationMachine Learning for Signal Processing Linear Gaussian Models
Machne Learnng for Sgnal Processng Lnear Gaussan Models Class 7. 30 Oct 204 Instructor: Bhksha Raj 755/8797 Recap: MAP stmators MAP (Mamum A Posteror: Fnd a best guess for (statstcall, gven knon = argma
More informationRegression and Prediction
-755 Machine Learning for Signal Processing Regression and Prediction Class 5. 23 Oct 202 Instructor: Bhisha Raj 23 Oct 202 755/8797 Matri Identities f 2... D df df d d df d 2 d2... df dd dd he derivative
More informationMachine Learning for Signal Processing Linear Gaussian Models
Machne Learnng for Sgnal rocessng Lnear Gaussan Models lass 2. 2 Nov 203 Instructor: Bhsha Raj 2 Nov 203 755/8797 HW3 s up. Admnstrva rojects please send us an update 2 Nov 203 755/8797 2 Recap: MA stmators
More informationExpectation Maximization Mixture Models HMMs
-755 Machne Learnng for Sgnal Processng Mture Models HMMs Class 9. 2 Sep 200 Learnng Dstrbutons for Data Problem: Gven a collecton of eamples from some data, estmate ts dstrbuton Basc deas of Mamum Lelhood
More informationDiscriminative classifier: Logistic Regression. CS534-Machine Learning
Dscrmnatve classfer: Logstc Regresson CS534-Machne Learnng 2 Logstc Regresson Gven tranng set D stc regresson learns the condtonal dstrbuton We ll assume onl to classes and a parametrc form for here s
More informationCIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M
CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute
More informationDiscriminative classifier: Logistic Regression. CS534-Machine Learning
Dscrmnatve classfer: Logstc Regresson CS534-Machne Learnng robablstc Classfer Gven an nstance, hat does a probablstc classfer do dfferentl compared to, sa, perceptron? It does not drectl predct Instead,
More informationGenerative classification models
CS 675 Intro to Machne Learnng Lecture Generatve classfcaton models Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square Data: D { d, d,.., dn} d, Classfcaton represents a dscrete class value Goal: learn
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationCorrelation and Regression. Correlation 9.1. Correlation. Chapter 9
Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,
More informationINF 5860 Machine learning for image classification. Lecture 3 : Image classification and regression part II Anne Solberg January 31, 2018
INF 5860 Machne learnng for mage classfcaton Lecture 3 : Image classfcaton and regresson part II Anne Solberg January 3, 08 Today s topcs Multclass logstc regresson and softma Regularzaton Image classfcaton
More informationFeature Selection: Part 1
CSE 546: Machne Learnng Lecture 5 Feature Selecton: Part 1 Instructor: Sham Kakade 1 Regresson n the hgh dmensonal settng How do we learn when the number of features d s greater than the sample sze n?
More informationLimited Dependent Variables
Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages
More informationMLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012
MLE and Bayesan Estmaton Je Tang Department of Computer Scence & Technology Tsnghua Unversty 01 1 Lnear Regresson? As the frst step, we need to decde how we re gong to represent the functon f. One example:
More informationMACHINE APPLIED MACHINE LEARNING LEARNING. Gaussian Mixture Regression
11 MACHINE APPLIED MACHINE LEARNING LEARNING MACHINE LEARNING Gaussan Mture Regresson 22 MACHINE APPLIED MACHINE LEARNING LEARNING Bref summary of last week s lecture 33 MACHINE APPLIED MACHINE LEARNING
More informationLinear Feature Engineering 11
Lnear Feature Engneerng 11 2 Least-Squares 2.1 Smple least-squares Consder the followng dataset. We have a bunch of nputs x and correspondng outputs y. The partcular values n ths dataset are x y 0.23 0.19
More informatione i is a random error
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown
More informationHomework Assignment 3 Due in class, Thursday October 15
Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.
More informationGeneralized Linear Methods
Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set
More informationClassification as a Regression Problem
Target varable y C C, C,, ; Classfcaton as a Regresson Problem { }, 3 L C K To treat classfcaton as a regresson problem we should transform the target y nto numercal values; The choce of numercal class
More informationSupport Vector Machines
Support Vector Machnes Konstantn Tretyakov (kt@ut.ee) MTAT.03.227 Machne Learnng So far Supervsed machne learnng Lnear models Least squares regresson Fsher s dscrmnant, Perceptron, Logstc model Non-lnear
More informationSupport Vector Machines
Support Vector Machnes Konstantn Tretyakov (kt@ut.ee) MTAT.03.227 Machne Learnng So far So far Supervsed machne learnng Lnear models Non-lnear models Unsupervsed machne learnng Generc scaffoldng So far
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationSupport Vector Machines. Vibhav Gogate The University of Texas at dallas
Support Vector Machnes Vbhav Gogate he Unversty of exas at dallas What We have Learned So Far? 1. Decson rees. Naïve Bayes 3. Lnear Regresson 4. Logstc Regresson 5. Perceptron 6. Neural networks 7. K-Nearest
More informationKernel Methods and SVMs Extension
Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general
More informationClassification learning II
Lecture 8 Classfcaton learnng II Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square Logstc regresson model Defnes a lnear decson boundar Dscrmnant functons: g g g g here g z / e z f, g g - s a logstc functon
More informationCS 3710: Visual Recognition Classification and Detection. Adriana Kovashka Department of Computer Science January 13, 2015
CS 3710: Vsual Recognton Classfcaton and Detecton Adrana Kovashka Department of Computer Scence January 13, 2015 Plan for Today Vsual recognton bascs part 2: Classfcaton and detecton Adrana s research
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationNumerical Methods. ME Mechanical Lab I. Mechanical Engineering ME Lab I
5 9 Mechancal Engneerng -.30 ME Lab I ME.30 Mechancal Lab I Numercal Methods Volt Sne Seres.5 0.5 SIN(X) 0 3 7 5 9 33 37 4 45 49 53 57 6 65 69 73 77 8 85 89 93 97 0-0.5 Normalzed Squared Functon - 0.07
More informationWhich Separator? Spring 1
Whch Separator? 6.034 - Sprng 1 Whch Separator? Mamze the margn to closest ponts 6.034 - Sprng Whch Separator? Mamze the margn to closest ponts 6.034 - Sprng 3 Margn of a pont " # y (w $ + b) proportonal
More information10-701/ Machine Learning, Fall 2005 Homework 3
10-701/15-781 Machne Learnng, Fall 2005 Homework 3 Out: 10/20/05 Due: begnnng of the class 11/01/05 Instructons Contact questons-10701@autonlaborg for queston Problem 1 Regresson and Cross-valdaton [40
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationProbabilistic Classification: Bayes Classifiers. Lecture 6:
Probablstc Classfcaton: Bayes Classfers Lecture : Classfcaton Models Sam Rowes January, Generatve model: p(x, y) = p(y)p(x y). p(y) are called class prors. p(x y) are called class condtonal feature dstrbutons.
More informationResource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis
Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More informationLogistic Regression Maximum Likelihood Estimation
Harvard-MIT Dvson of Health Scences and Technology HST.951J: Medcal Decson Support, Fall 2005 Instructors: Professor Lucla Ohno-Machado and Professor Staal Vnterbo 6.873/HST.951 Medcal Decson Support Fall
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationLecture 10 Support Vector Machines II
Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More informationFor now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results.
Neural Networks : Dervaton compled by Alvn Wan from Professor Jtendra Malk s lecture Ths type of computaton s called deep learnng and s the most popular method for many problems, such as computer vson
More informationThe Gaussian classifier. Nuno Vasconcelos ECE Department, UCSD
he Gaussan classfer Nuno Vasconcelos ECE Department, UCSD Bayesan decson theory recall that we have state of the world X observatons g decson functon L[g,y] loss of predctng y wth g Bayes decson rule s
More informationDifferentiating Gaussian Processes
Dfferentatng Gaussan Processes Andrew McHutchon Aprl 17, 013 1 Frst Order Dervatve of the Posteror Mean The posteror mean of a GP s gven by, f = x, X KX, X 1 y x, X α 1 Only the x, X term depends on the
More informationProbabilistic Classification: Bayes Classifiers 2
CSC Machne Learnng Lecture : Classfcaton II September, Sam Rowes Probablstc Classfcaton: Baes Classfers Generatve model: p(, ) = p()p( ). p() are called class prors. p( ) are called class-condtonal feature
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationSupport Vector Machines
Separatng boundary, defned by w Support Vector Machnes CISC 5800 Professor Danel Leeds Separatng hyperplane splts class 0 and class 1 Plane s defned by lne w perpendcular to plan Is data pont x n class
More informationβ0 + β1xi. You are interested in estimating the unknown parameters β
Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)
More informationLecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationMachine learning: Density estimation
CS 70 Foundatons of AI Lecture 3 Machne learnng: ensty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square ata: ensty estmaton {.. n} x a vector of attrbute values Objectve: estmate the model of
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationOutline. Multivariate Parametric Methods. Multivariate Data. Basic Multivariate Statistics. Steven J Zeil
Outlne Multvarate Parametrc Methods Steven J Zel Old Domnon Unv. Fall 2010 1 Multvarate Data 2 Multvarate ormal Dstrbuton 3 Multvarate Classfcaton Dscrmnants Tunng Complexty Dscrete Features 4 Multvarate
More informationADVANCED MACHINE LEARNING ADVANCED MACHINE LEARNING
1 ADVANCED ACHINE LEARNING ADVANCED ACHINE LEARNING Non-lnear regresson technques 2 ADVANCED ACHINE LEARNING Regresson: Prncple N ap N-dm. nput x to a contnuous output y. Learn a functon of the type: N
More informationSupport Vector Machines
CS 2750: Machne Learnng Support Vector Machnes Prof. Adrana Kovashka Unversty of Pttsburgh February 17, 2016 Announcement Homework 2 deadlne s now 2/29 We ll have covered everythng you need today or at
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More information2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification
E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton
More informationLinear Regression Introduction to Machine Learning. Matt Gormley Lecture 5 September 14, Readings: Bishop, 3.1
School of Computer Scence 10-601 Introducton to Machne Learnng Lnear Regresson Readngs: Bshop, 3.1 Matt Gormle Lecture 5 September 14, 016 1 Homework : Remnders Extenson: due Frda (9/16) at 5:30pm Rectaton
More information1 GSW Iterative Techniques for y = Ax
1 for y = A I m gong to cheat here. here are a lot of teratve technques that can be used to solve the general case of a set of smultaneous equatons (wrtten n the matr form as y = A), but ths chapter sn
More informationStatistical analysis using matlab. HY 439 Presented by: George Fortetsanakis
Statstcal analyss usng matlab HY 439 Presented by: George Fortetsanaks Roadmap Probablty dstrbutons Statstcal estmaton Fttng data to probablty dstrbutons Contnuous dstrbutons Contnuous random varable X
More informationRockefeller College University at Albany
Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n
More informationMaximal Margin Classifier
CS81B/Stat41B: Advanced Topcs n Learnng & Decson Makng Mamal Margn Classfer Lecturer: Mchael Jordan Scrbes: Jana van Greunen Corrected verson - /1/004 1 References/Recommended Readng 1.1 Webstes www.kernel-machnes.org
More informationLinear discriminants. Nuno Vasconcelos ECE Department, UCSD
Lnear dscrmnants Nuno Vasconcelos ECE Department UCSD Classfcaton a classfcaton problem as to tpes of varables e.g. X - vector of observatons features n te orld Y - state class of te orld X R 2 fever blood
More information1 Convex Optimization
Convex Optmzaton We wll consder convex optmzaton problems. Namely, mnmzaton problems where the objectve s convex (we assume no constrants for now). Such problems often arse n machne learnng. For example,
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationMachine Learning for Signal Processing Applications of Linear Gaussian Models
Machne Learnng for Sgnal Processng Applcatons of Lnear Gaussan Models Class 8. 3 Nov 207 Instructor Najm Dehak In collaboraton th Prof Bhksha Raj Recap: MAP stmators MAP (Mamum A Posteror): Fnd most probable
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More informationβ0 + β1xi and want to estimate the unknown
SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal
More informationThe Fundamental Theorem of Algebra. Objective To use the Fundamental Theorem of Algebra to solve polynomial equations with complex solutions
5-6 The Fundamental Theorem of Algebra Content Standards N.CN.7 Solve quadratc equatons wth real coeffcents that have comple solutons. N.CN.8 Etend polnomal denttes to the comple numbers. Also N.CN.9,
More informationGaussian process classification: a message-passing viewpoint
Gaussan process classfcaton: a message-passng vewpont Flpe Rodrgues fmpr@de.uc.pt November 014 Abstract The goal of ths short paper s to provde a message-passng vewpont of the Expectaton Propagaton EP
More informationC4B Machine Learning Answers II. = σ(z) (1 σ(z)) 1 1 e z. e z = σ(1 σ) (1 + e z )
C4B Machne Learnng Answers II.(a) Show that for the logstc sgmod functon dσ(z) dz = σ(z) ( σ(z)) A. Zsserman, Hlary Term 20 Start from the defnton of σ(z) Note that Then σ(z) = σ = dσ(z) dz = + e z e z
More information3.1 ML and Empirical Distribution
67577 Intro. to Machne Learnng Fall semester, 2008/9 Lecture 3: Maxmum Lkelhood/ Maxmum Entropy Dualty Lecturer: Amnon Shashua Scrbe: Amnon Shashua 1 In the prevous lecture we defned the prncple of Maxmum
More informationMaximum Likelihood Estimation (MLE)
Maxmum Lkelhood Estmaton (MLE) Ken Kreutz-Delgado (Nuno Vasconcelos) ECE 175A Wnter 01 UCSD Statstcal Learnng Goal: Gven a relatonshp between a feature vector x and a vector y, and d data samples (x,y
More informationInner Product. Euclidean Space. Orthonormal Basis. Orthogonal
Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,
More informationNeural networks. Nuno Vasconcelos ECE Department, UCSD
Neural networs Nuno Vasconcelos ECE Department, UCSD Classfcaton a classfcaton problem has two types of varables e.g. X - vector of observatons (features) n the world Y - state (class) of the world x X
More informationSTAT 3008 Applied Regression Analysis
STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationPsychology 282 Lecture #24 Outline Regression Diagnostics: Outliers
Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.
More informationLecture 3: Dual problems and Kernels
Lecture 3: Dual problems and Kernels C4B Machne Learnng Hlary 211 A. Zsserman Prmal and dual forms Lnear separablty revsted Feature mappng Kernels for SVMs Kernel trck requrements radal bass functons SVM
More informationHere is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)
Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,
More informationFall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede. . For P such independent random variables (aka degrees of freedom): 1 =
Fall Analss of Epermental Measurements B. Esensten/rev. S. Errede More on : The dstrbuton s the.d.f. for a (normalzed sum of squares of ndependent random varables, each one of whch s dstrbuted as N (,.
More information15-381: Artificial Intelligence. Regression and cross validation
15-381: Artfcal Intellgence Regresson and cross valdaton Where e are Inputs Densty Estmator Probablty Inputs Classfer Predct category Inputs Regressor Predct real no. Today Lnear regresson Gven an nput
More informationRegression Analysis. Regression Analysis
Regresson Analyss Smple Regresson Multvarate Regresson Stepwse Regresson Replcaton and Predcton Error 1 Regresson Analyss In general, we "ft" a model by mnmzng a metrc that represents the error. n mn (y
More informationThe exam is closed book, closed notes except your one-page cheat sheet.
CS 89 Fall 206 Introducton to Machne Learnng Fnal Do not open the exam before you are nstructed to do so The exam s closed book, closed notes except your one-page cheat sheet Usage of electronc devces
More informationModule 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:
More informationSupport Vector Machines
/14/018 Separatng boundary, defned by w Support Vector Machnes CISC 5800 Professor Danel Leeds Separatng hyperplane splts class 0 and class 1 Plane s defned by lne w perpendcular to plan Is data pont x
More informationLogistic Regression. CAP 5610: Machine Learning Instructor: Guo-Jun QI
Logstc Regresson CAP 561: achne Learnng Instructor: Guo-Jun QI Bayes Classfer: A Generatve model odel the posteror dstrbuton P(Y X) Estmate class-condtonal dstrbuton P(X Y) for each Y Estmate pror dstrbuton
More informationMIMA Group. Chapter 2 Bayesian Decision Theory. School of Computer Science and Technology, Shandong University. Xin-Shun SDU
Group M D L M Chapter Bayesan Decson heory Xn-Shun Xu @ SDU School of Computer Scence and echnology, Shandong Unversty Bayesan Decson heory Bayesan decson theory s a statstcal approach to data mnng/pattern
More informationWhy Bayesian? 3. Bayes and Normal Models. State of nature: class. Decision rule. Rev. Thomas Bayes ( ) Bayes Theorem (yes, the famous one)
Why Bayesan? 3. Bayes and Normal Models Alex M. Martnez alex@ece.osu.edu Handouts Handoutsfor forece ECE874 874Sp Sp007 If all our research (n PR was to dsappear and you could only save one theory, whch
More informationChapter 14 Simple Linear Regression
Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng
More informationEEE 241: Linear Systems
EEE : Lnear Systems Summary #: Backpropagaton BACKPROPAGATION The perceptron rule as well as the Wdrow Hoff learnng were desgned to tran sngle layer networks. They suffer from the same dsadvantage: they
More informationThe Ordinary Least Squares (OLS) Estimator
The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal
More information1 Matrix representations of canonical matrices
1 Matrx representatons of canoncal matrces 2-d rotaton around the orgn: ( ) cos θ sn θ R 0 = sn θ cos θ 3-d rotaton around the x-axs: R x = 1 0 0 0 cos θ sn θ 0 sn θ cos θ 3-d rotaton around the y-axs:
More informationLecture 10 Support Vector Machines. Oct
Lecture 10 Support Vector Machnes Oct - 20-2008 Lnear Separators Whch of the lnear separators s optmal? Concept of Margn Recall that n Perceptron, we learned that the convergence rate of the Perceptron
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationLinear Classification, SVMs and Nearest Neighbors
1 CSE 473 Lecture 25 (Chapter 18) Lnear Classfcaton, SVMs and Nearest Neghbors CSE AI faculty + Chrs Bshop, Dan Klen, Stuart Russell, Andrew Moore Motvaton: Face Detecton How do we buld a classfer to dstngush
More informationY = β 0 + β 1 X 1 + β 2 X β k X k + ε
Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty
More informationThe Geometry of Logit and Probit
The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.
More information18-660: Numerical Methods for Engineering Design and Optimization
8-66: Numercal Methods for Engneerng Desgn and Optmzaton n L Department of EE arnege Mellon Unversty Pttsburgh, PA 53 Slde Overve lassfcaton Support vector machne Regularzaton Slde lassfcaton Predct categorcal
More information6 Supplementary Materials
6 Supplementar Materals 61 Proof of Theorem 31 Proof Let m Xt z 1:T : l m Xt X,z 1:t Wethenhave mxt z1:t ˆm HX Xt z 1:T mxt z1:t m HX Xt z 1:T + mxt z 1:T HX We consder each of the two terms n equaton
More information