Multiscale principal component analysis

Size: px
Start display at page:

Download "Multiscale principal component analysis"

Transcription

1 Multscale prcpal compoet aalyss A A Aduo ad A N Gorba Mathematcs Departmet Uversty of ecester ecestershre E 7RH UK E-mal: aaa78@leacu ad ag53@leacu Abstract Prcpal compoet aalyss PCA s a mportat tool eplorg data he covetoal approach to PCA leads to a soluto hch favours the structures th large varaces hs s sestve to outlers ad could obfuscate terestg uderlyg structures Oe of the equvalet deftos of PCA s that t sees the subspaces that mamze the sum of squared parse dstaces betee data proectos hs defto opes up more fleblty the aalyss of prcpal compoets hch s useful ehacg PCA I ths paper e troduce scales to PCA by mamzg oly the sum of parse dstaces betee proectos for pars of datapots th dstaces th a chose terval of values [lu] he resultg prcpal compoet decompostos Multscale PCA deped o pot lu o the plae ad for each pot e defe proectors oto prcpal compoets Cluster aalyss of these proectors reveals the structures the data at varous scales Each structure s descrbed by the egevectors at the medod pot of the cluster hch represet the structure We also use the dstorto of proectos as a crtero for choosg a approprate scale especally for data th outlers hs method as tested o both artfcal dstrbuto of data ad real data For data th multscale structures the method as able to reveal the dfferet structures of the data ad also to reduce the effect of outlers the prcpal compoet aalyss Itroducto It s ofte dffcult to etract meag from multvarate data of hgh dmeso ad hece there s a eed for feature etracto to mae aalyss easer ad to spot treds patters outlers ad other terestg relatoshp ad structures our data I 90 Pearso proposed appromatg hgh dmesoal data th les ad plaes ad hece veted the Prcpal Compoet Aalyss PCA PCA s a lear techque hch trasforms data to a e coordate system usg lear orthogoal trasformato such that the e coordates are ordered by varace he coordate th hghest varace s the frst prcpal compoet; the secod prcpal compoet s the coordate th the secod hghest varace ad so o a eample s gve fgure PCA s a poerful aalyss tool ad t s udged to be oe of the most mportat results of appled lear algebra [6] th may terestg applcatos hch clude: dmeso reducto bld source separato data vsualzato mage compresso ad th relevace may appled dscples such as quattatve face bology pharmaceutcs taoomy healthcare ad may more he prcpal compoets from PCA are lear combato of the orgal compoets ad eve though PCA s lmted tha o-lear dmeso reducto techques t s guarateed to sho geue propertes of the orgal data ad the lo dmeso are meagful [7]

2 Fgure Scatter plot of data he sold red arro ad the dashed blac arro dcate the drecto of the frst ad secod prcpal compoets respectvely color ole Hoever despte the may applcatos of PCA t s ot thout ts drabacs A eample of such drabacs s that PCA s based o the covarace matr hch s sestve to outlers I ths paper outlers are defed as data elemets th large dstace from the other data elemets a data sample Eve though outlers ca be fltered before performg PCA o the dataset hoever some cotets detfyg outlers could be cumbersome I addto to the above datasets are usually osy here e defe oses as data elemets th rather small varace ad the presece of ose data aalyss ca further obfuscate the uderlyg structures of the data beg vestgated [6] Oe of the deftos of PCA s that PCA fds subspaces les plaes or hgher dmesoal subspaces that mamze the sum of pot-to-pot squared dstaces betee the orthogoal proectos of data pots to them et the dstace fucto dst y be defed by a postve defte quadratc form Eucldea dstace for pars of obects y I clear terms PCA sees the dmesoal orthogoal proecto that mamzes dst P P here P s the proecto of vector to plae We ca observe that the mamzato problem gve above favors large parse dstaces Hece other terestg structures hch ca be revealed by smaller parse dstaces may be completely obfuscated Oe eample of such problem arses he usg PCA o data th outlers as the outlers may obfuscate the structures of the data et us cosder the data sho fgure the data s dstrbuted alog a le but th outlers sho crcles Fgure 3 shos the data proecto to the prcpal compoets th the -as as the frst prcpal compoet Hoever f the outlers ere removed the frst prcpal compoet should be close to the le o hch the data s dstrbuted as sho by the arro fgure 3 Fgures 4 ad 5 are the bplots [] of the eample gve above A bplot s useful for vsualzg the magtude ths s represeted th the les ad sg of each varable's cotrbuto to the frst to or three prcpal compoets ad ho each observato represeted as pots o the graph s represeted terms of those compoets he aes represet the prcpal compoets From the bplot belo e ca observe a sgfcat chage the cotrbuto of the varables the PCA due to the presece of the outlers here are several equvalet deftos of prcpal compoets he defto preseted above through mamzato of the sum of pot-to-pot squared dstaces betee the orthogoal proectos of data pots gves more fleblty for geeralzato ad cotrol [3] hch ca be mapulated to reveal some terestg uderlyg structures our data I addto to ths the defto above opes up the relatoshp betee PCA ad multdmesoal scalg [3]

3 Fgure Scatter plot of data dstrbuted alog a le th outlers he outlers are sho crcle Fgure 3 Data proecto to the frst prcpal compoets gve by the aes he arro dcates the drecto of the prcpal compoet f the outlers are removed Fgure 4 he bplot of the data thout outlers Fgure 5 he bplot of the data th outlers I ths paper scale as troduced to ehace the performace of PCA o datasets hat s e ll use the defto of multscale PCA mamzato of the sum of pot-to-pot squared dstaces betee the orthogoal proectos of data pots for the pars of pots th dstaces some tervals he result of ths s PCA decomposto of the data hch deped o the scale chose A further study of these PCA decompostos reveals some uderlyg structures hch could have bee obfuscated by other structures such has the presece of outlers or repeated patters as sho later We also proposed a crtero for determg the approprate scale for computg the prcpal compoets for data th outlers

4 Deftos ad Mathematcal Bacgroud I ths secto e cosder four classcal approaches to PCA hch are equvalet as gve by [] ad e also gve the ecessary mathematcal bacgroud that ll be eeded for ths paper Deftos of PCA et be a lear mafold of dmeso gve the parametrc form as v 0 av av a v here R m m a v R v s a set of orthoormal vectors R m Also let be data elemets here R ad let the data elemets be arraged as the ros of a m matr X such that the m coordates s gve by the colum of X For ths paper the coordates ll be represeted by Gree dces hle the observatos ll be represeted by at dces e s the thcoordate of the th observato For all computatos e assume that the data s cetered hs could be acheved by smple traslato of the data 0 ad v v We shall deote the proecto of data to the plae by P Defto Data appromato by les ad plaes PCA computes the sequeces m such that the sum of squared dstaces from data pots to ther orthogoal proectos o s mmal over all lear mafolds of dmeso m embedded R : MSD X m m he mea squared dstace betee a dataset X ad set of vectors y deoted by MSD X y s N defed as MSD X dst P y y Remar: Dmesoless varables ad ormalzato hs s eactly the defto gve by Pearso 90 [5] Eve though Pearso hs paper o prcpal compoet aalyss dd ot use hs defto of PCA ormalzato to ut varace t s ecessary to use the same dmeso o all aes For eample e caot summarse meters th lograms herefore ormalzato becomes mportat he the data s from dfferet dmesos; hoever the choce of ormalzato should deped o the type of data ad the problem beg solved Defto Varace mamzato For a dataset X ad for a gve v let us costruct a oe-dmesoal dstrbuto B { : v X} here deotes scalar vector product he let us defe emprcal varace of X alog v as Var B here Var s the stadard emprcal varace PCA sees to fd such that the sum of emprcal varaces of X alog v v v ould be mamal over all lear mafolds of dmeso embedded R m : Var B ma Defto 3 mea pot-to-pot squared dstace mamsato

5 PCA problem cossts fdg such sequece betee the orthogoal proectos of data pots o m dmeso embedded R : dst P P ma N that the mea pot-to-pot squared dstace s mamal over all lear mafolds of We o that all orthogoal proectos oto loer-dmesoal space lead to cotracto of all potto-pot dstaces ecept for some that do ot chage ths s equvalet to mmzato of mea squared dstace dstorto: N [ dst dst P P ] ma Defto 4 correlato cacellato: PCA sees such a orthoormal bass v v v hch the covarace matr for X s dagoal Evdetly ths bass the dstrbutos v ad v for have zero correlato Mathematcs Bacgroud As earler stated PCA sees the -dmesoal proecto that mamzes dst P P Usg the Eucldea dstace ths problem ca be stated as DX P ma here a v a R ad m Also m v R ad v v s roecer delta he proecto of a vector to a plae hch s deoted by P v v herefore the problem 3 reduces to mamzg D X v 4 hs s the same as mamzg the equato 5 belo D X v 5 he epresso the bracet gve as v v v v S ~ v 3 6 here ~ S

6 ad each elemet of S ~ s gve as ~ S Here S ~ s symmetrc postve sem-defte because for every y herefore the problem gve by 3 s reduced to ma v v ~ v Sv Subect to v v y y s postve sem-defte et m be the sorted egevalues of the matr S ~ ad e em be the correspodg egevectors a mamzer of the costraed mamzato problem 8 s e em See theorem Hece the orthogoal vectors that mamze 8 are the egevectors correspodg to the hghest egevalues If there are q dstct egevalues q of the matr S ~ such that s of multplcty ad q m e have a case called egevalue degeeracy For each th multplcty the egevectors le a dmesoal subspace orthogoal to the subspace spaed by the odegeerate egevalues For symmetrc matr these egevectors ll be learly depedet ad usg Gram-Schmdt procedure e ca fd orthogoal vectors that spa ths subspace No t s left to sho that the soluto to the problem 8 s actually the prcpal compoets et us eame the matr ~ S 9 Where μ μ μ μ ad S ~ μ 0 he remag terms are zero because the data has bee cetered ~ S cov X We o that gve dataset X th emprcal covarace matr S ad let egevalues of the matr S the correspodg egevector ~ 7 8 be the sorted e e s the prcpal compoets of the data From equato above S S therefore the egevectors of S s also the egevectors of S ~ ad ths s the prcpal compoet of X sce the multplcato of a matr by a postve

7 costat does ot chage the egevectors or ther order Hece e have sho that the soluto to mamzato problem 8 s the prcpal compoet of X No e cosder the elemets the matr S ~ From equato 7 e have S ~ 3 I matr otato the quadratc form 3 ca be rtte as ~ X X S ~ X X S 4 Where ] [ ad s the roecer delta s a symmetrc postve-sem defte matr th zero colum ad ro sum ad ths s useful for descrbg the parse relatoshp betee data elemets as sho emma hch s also avalable [7] heorem : et A be a symmetrc matr ad let the sorted egevalues be gve by ad let e e be the correspodg egevectors he e e s a mamzer of the costraed mamzato problem A ma u u u u Subect to: u u A detaled proof of ths theorem s avalable [7] emma : et be as defed above ad let R the Ad for coordate vectors e have:

8 Hece e see that the matr gve by s useful because the quadratc form assocated th t s the eghted sum of all parse squared dstaces [7] 3 Weghted PCA Defto 3 allos for some fleblty the aalyss of prcpal compoets because e have cotrol over the parse dstaces of proected data By assgg eghts to these parse dstaces e ca mapulate the resultg PCA decomposto of the data We o cosder the problem of fdg the prcpal compoet usg eghted parse dstaces of proected data hs problem s stated belo [ dst P P ] Where D X P Subect to: v v ma = s the o-egatve eght assged to the dstace betee elemet ad ad 0 for he equato 5 reduces to mamzg the equato 6 belo D X v 6 hs s the same as D X v he epresso the bracet gve as here et R ad let C v v v ~ v 5 v M 7 ~ M Equato 8 ca be rtte as ~ M 9 8

9 ] [ C R 0 ] [ C R Each elemet s gve as C R M R M 3 because C R ad herefore e ca rte equato 3 as R M M 4 et ] [ hs ca be rtte the form belo Where 0 for I matr otato the quadratc form 4 ca be rtte as ~ X X M 5 ad ~ X X M herefore the problem gve by 5 s reduced to v v M ~ ma v v Subect to v v 6 here M ~ s a symmetrc postve sem-defte matr ad from theorem the egevectors correspodg to the sorted egevalues of the matr M ~ s a mamzer of the costraed mamzato problem 6 I the case of degeerated egevalues the set m e e s ot uquely defed

10 3 Multscale PCA MPCA I ths secto e troduce the Multscale PCA MPCA algorthm to ehace the robustess of the PCA especally revealg hdde structures that may be preset dataset but hch the covetoal approach mght ot reveal MPCA compute prcpal compoets by mamzg the sum of parse dstaces betee data proecto for oly pars of datapots for hch the dstace s th the chose scale hs s acheved by assgg a eght of to the parse dstace of proectos of ay par of data pots th dstace th the chose scale ad a eght of 0 otherse l u 0 otherse 7 m I the scale terval lu l s the loer lmt of the scale ad u s the upper lmt et d be the ma mmum parse dstace greater tha zero ad d be the mamum parse dstace the data m We select the pars lu from a tragle Δ={lu: d l<u d ma } Wth ths cotrol over the parse dstaces e are able compute PCA at varous scales ad the outcome of ths s scale depedet PCA hch ca reveal terestg uderlyg structures that may be preset data For eample reducg the upper lmt of the scale hle eepg the loer lmt at 0 traslate to computg PCA by cosderg smaller dstaces ad ecludg very large dstaces hs has the effect of mmzg thout eplct ecluso the cotrbuto of certa fluetal data elemets the aalyss of the prcpal compoets 3 he Multscale PCA Algorthm Here e dscuss the Multscale PCA Algorthm Gve the data sample Cetralze the data by subtractg the mea of the varables from each observato 3 Fd the dssmlarty matr by computg the Eucldea dstace 4 Choose a approprate scale betee 0 ad the mamum dstace For easy aalyss a scale betee 0 ad could be chose ad the multpled by the mamum dstace For ths paper he usg scale betee 0 ad e call t stadard scale 5 Calculated the bary eght as gve equato 7 6 Calculate the matr as gve belo 7 Calculate the matr A Y Y here Y s the cetralzed data 8 Fd the sorted egevalues of the matr A descedg order of magtude ad proect the data oto ther correspodg egevectors hs ll be the prcpal compoets at the selected scale

11 o llustrate the result of MPCA o data e cosder some eamples 3 Multscale PCA o Data th repeated patters Eample Here e cosder a eample of a data sample th repeated uderlyg structure See fgure 6-9 Fgure 6 Scatter plot of data th repeated patter Fgure 7 he sold arro ad the dashed arros sho the drecto of the frst ad secod prcpal compoets respectvely usg MPCA at a scale of [0-08] equvalet to stadard scale [0-] hs s the same as usg PCA Fgure 8 he sold arro ad the dashed arro sho the drecto of the frst ad secod prcpal compoets respectvely usg MPCA at a scale of [0-00] equvalet to stadard scale [0-08]

12 Fgure 9 he sold arro ad the dashed arro sho the drecto of the frst ad secod prcpal compoets respectvely usg MPCA at a scale of [0-] equvalet to stadard scale [0-00] From fgure 9 e observe that the PCA reveals the er structure of the data A better ve of ths er structure ad the PCA s gve fgure 0 Fgure 0 Magfed ve of oe cluster the dataset th the sold red arro ad dashed blac arro represetg the drecto of the frst ad secod prcpal compoets respectvely Color ole As llustrated the eample above the prcpal compoets chaged as the scale chaged ad ths as able to reveal some uderlyg structures of the data Fgure oe captures the chages the frst prcpal compoet at varous scales Fgure he dagram llustrate the chage the agle of the frst prcpal compoet as the scale chaged he agle recorded here s the agle gradet betee the frst prcpal compoet usg PCA ad the frst prcpal compoet usg MPCA at a gve scale

13 4 Clusterg Aalyss o the Iterval of Scales o further study these structures e cosder clusterg aalyss o the terval of scales ad e troduce the Rato of Dstorto ths secto 4 Represetg PCA Structures et us cosder the terval of values l u here l loer lmt u upper lmt ad l u he scale l u ca be represeted as pot the plae R as sho fgure he resultg prcpal compoet decompostos MPCA deped o the pots l u o the plae Fgure hs dagram shos the stadard scale represeted as pots o the plae R We ll le to study the PCA structure at dfferet scales; therefore e eed a represetato of the PCA structure for each pot l u o the plae We ca represet the PCA structure at a pot l u by the correspodg orthoormal vectors of prcpal compoets from MPCA at that pot; hoever ths represetato s ot coveet for statstcal aalyss of prcpal compoets If for eample e cosder the case of equdstrbuto of a ormalzed vector v o m sphere the epectato E [ v] 0 hs s because of sphercal symmetry ad the Epectato s the vector the sphere hch s rotato varat ad that s 0 ad ths could be couter tutve he space of prcpal compoet bases s a space of orthoormal bases R m hs s ot a lear space but a rather complcated symmetrc mafold th group O m acto o t We propose to embed ths symmetrc space to a Eucldea space usg the PCA proector represetato ad after that apply stadard statstcal ad data mg procedures et us recall that the prcpal compoet gve by e s the same as e therefore e eed a represetato such that ths codto s satsfed he prcpal compoets are orthogoal aal frame [7] ad oe ay to represet such data s usg the tesor product P e e hch s the proector of our data oto the prcpal compoet e Sce ths product s b-lear e o that e e e e hece e have the same represetato for both a vector ad ts egatve as requred P X e e X e e X s the proecto of data X oto vectors e ad P X e e X s the data X proected oto the frst - prcpal compoet

14 m For ay m orthoormal vectors e em e e If e s oe of e th probablty the m Ee e he rotato varace gves the same result f e s equdstrbuted o ut m m sphere Fgure 3 hs dagram llustrates ho the proecto of vector y chages gve by the dashed red sphere as e moves alog the blue sold -sphere Color ole It follos that the average proecto of E[ P X ] X m herefore e represet the PCA structure of the data at ay pot l uby the sum of the proectors correspodg to MPCA at that pot hs ll be deoted by e e m he full descrpto of the prcpal compoets decompostos of data X s gve by a ordered set cortege of matrces m m If e arrage the e as colums of matr E the X EE X ad EE For m EE Ι MPCA lead to scale depedet PCA structures ad th these PCA structures represeted as defed above e ca study the structures our data further by aalyzg these proectors he PCA structures assocated th to dfferet pots o the plae s sad to be smlar f ther correspodg proectors are smlar 4 Clusterg of Scales We guess that some cases there are clear teral structures the data hch deped o scales Performg MPCA o the data leads to a cotuum of PCA structures depedg o scales used ad to reveal the structures the data e o scales th smlar PCA structures ad separate scales th dssmlar PCA structures hs leads to the dea of clusterg of scales We represet the dstace betee to pots o the scale by the dstace betee ther correspodg PCA structures Clusterg aalyss of the scales group smlar PCA structures together ad ths reveals some structures the data We descrbe each cluster by the proector correspodg to the medod pot of the cluster I a later secto e ll troduce Rato of Dstorto hch s aother crtero that ca be used to select the proectors that descrbe the clusters

15 For eample clusterg aalyses of scales for correspods to cluster aalyss of the MPCA structures he data s proected oto the frst prcpal compoets at varous scales No let each pot l u the plae be represeted by p here p l u l uu such that l u We deote the proector at a pot p by p For ay par of pots p q the space of scales e ca compute the dstace betee the assocated proectors for a gve usg varat orm We recall that the Frobeus orm of a real matr B deoted by F tracebb P P trace P P P P B therefore dstace betee proectors of ay par of pots the space of scale dst p q p q p q Ay stadard clusterg algorthm ca be used to cluster the scale order to reveal hdde structures the data but ths paper agglomeratve herarchcal clusterg as used because e ca measure dstace easly Decdg o the umber of true clusters clusterg aalyss s a classcal p q problem ad oe may at to compare varous dces A typcal eample of such s the statstc SSEt SSEa SSEb a b pseudo t SSE SSE Where SSE s the sum of square of cluster a a a b b pseudo SSE s the sum of square of cluster b SSE s the sum of square of cluster formed by og clusters a ad b a ad b are the umber of elemets clusters a ad b respectvely If a small value of the pseudo t statstc at a step of the herarchcal clusterg s folloed by a dstct large value at the step the cluster form at the step s chose as the optmal cluster It s assume that the mea vector of the to clusters beg merged at the step ca be regarded as dfferet ad should probably ot be merged et us cosder the result of the cluster aalyss of the data eample fgure 6 for e proecto oto frst prcpal compoet For llustrato purpose pots from the subset of ad U have bee selected l ad u u he result s preseted fgure 4 t t Fgure 4 hs dagram shos cluster of scales o the plae Scales belogg to the same cluster are represeted by the same symbol ad color Color ole

16 he pseudo t statstc dcates three meagful clusters hs reaffrms the result dsplayed fgure We represet each of these structures by the egevector of the medod pot of the cluster represetg t he result s gve the table able hs table shos the descrpto of each cluster Each cluster has bee descrbed by the egevector of the proector correspodg to the medod pot of the cluster Cluster Iterval correspodg to Medod pot Proector 009 Egevector e [ ] e [ 0707 e [ ] 00000] Eample We cosder MPCA of the Eergy Effcecy Dataset avalable ole at the UCI mache earg Repostory hs dataset cotas 768 samples ad 8 compoets ad used to predct dfferet outputs We perform MPCA o the data the output varables are ot cluded sce all the data are postve e ormalzed by dvdg by the mea he data proectos to the frst to prcpal compoet ad frst three prcpal compoets respectvely are sho the fgures 5 ad 6 MPCA at stadard scale of [0-0] reveals the structure gve fgures 7 ad 8 he result of the clusterg aalyss of the scale s preseted fgure 9 he dcates four meagful clusters pseudo t statstc Fgure 5 Data proecto to the frst prcpal compoets for PCA Fgure 6 Data proecto to the frst 3 prcpal compoets for PCA

17 Fgure 7 Data proecto to the frst prcpal compoets usg MPCA at stadard scale 0 0 Fgure 8 Data proecto to the frst 3 prcpal compoets usg MPCA at stadard scale 0 0 Fgure 9 hs dagram shos cluster of scales o the plae Scales belogg to the same cluster are represeted by the same symbol ad color Color ole We represet each of these structures by the egevector of the medod pot of the cluster represetg t he result s gve the table able hs table shos the descrpto of each cluster Each cluster has bee descrbed by the egevectors of the proector correspodg to the medod pot of the cluster Cluster Iterval correspodg Egevector to Medod pot 009 e [ ] e [ ] e [ e [ ] e [ ] 0790] e [ ] e [ ] e [ ]

18 43 Multscale PCA o Data th Outlers he presece of outlers our data serves to obfuscate the uderlyg structure of the data PCA MPCA s hoever effectve revealg the uderlyg structure of data th outlers By reducg the upper lmt of the scale e ca effectvely mtgate the effect of outlers the aalyss of the prcpal compoets thout eplct ecluso of these outlers Eample 3 o test the performace of scaled PCA o data th outlers data ere smulated alog o plae ad some outlers ere added to ths data hs data as embedded to a hgher dmesoal space ad e see to recover the orgal plae from the data by usg PCA ad MPCA at varous scales he agle betee the orgal drectoal vector ad the frst prcpal compoet of MPCA at varous scales s gve the apped see table A We cosder a 3-dmesoal data sample hch the elemets are dstrbuted uformly o a plae -d th the drectoal vectors gve as u [ ]; v [ ]; Wth vector u beg the frst prcpal compoet ad fe outlers ere added as ca be see fgure 9 he proecto of the data to the frst prcpal compoets s sho fgure 0; ths has bee flueced by the outlers the data MPCA at stadard scale of 0-08 hoever gves aother structure hch s foud to have captured the data qute ell as sho fgure he result of the clusterg aalyss of the scales s preseted fgure Fgure 9 Scatter plot of data 3-dmeso th a fe outlyg pots Fgure 0 Data proecto to the frst prcpal compoets usg PCA It ca be observed that the outlers have flueced the result of the PCA

19 Fgure Data proecto to the frst prcpal compoets usg MPCA at stadard scale of 0 08 he effects of the outlers have bee mtgated Fgure hs dagram shos cluster of scales o the plae Scales belogg to the same cluster are represeted by the same symbol ad color Color ole We represet each of these structures by the egevector of the medod pot of the cluster represetg t See table 3 able 3 hs table shos the descrpto of each cluster descrbed by the egevectors of the proector correspodg to the medod pot of the cluster Cluster Iterval correspodg to Egevector Medod pot Proector 04 e [ ] e [ ] 00 e [ ] e [ ] 44 Crtero for choosg scale for data th outlers I ths secto e propose a crtero for decdg a approprate scale for MPCA feature etracto especally for data th outlers As metoed earler fdg the prcpal compoets usg defto 3 s equvalet to mmzg the mea squared dstace dstorto N [ dst dst P P ] m Where the dmeso of s strctly less tha the dmeso of the data Hece e propose that a approprate scale for a gve dmeso could be determed by fdg the rato of dstorto

20 For all N N P P such that l u ma l Is the loer lmt of the scale ad u s the upper lmt he rato of dstorto troduced here ca also be used the clusterg aalyss of scales as a crtero to determe the PCA structure that descrbes the cluster 5 Dscusso ad Cocluso 5 Dscusso For eample 3 MPCA at scales l u 0 l u 0 9 l u reveals aother structure the data that has bee obfuscated by the outlers able A ad A3 the apped sho the results of the rato of dstorto for dfferet dmesos hs s also cosstet th the dfferece agle betee the orgal plae ad the prcpal compoet computed usg MPCA at these scales see table A the apped Reducg the upper lmt to a very small umber may cause MPCA to ft ose hle creasg the loer lmt oly may cause MPCA to ft outlers f such s preset the data It s mportat to ote that by usg MPCA some parse dstaces are eempted the aalyss of prcpal compoet ad the percetage of such eempted parse dstace should be ept to a reasoable umber As t ca be observed the table A ad A3 the apped as the loer lmt creased the rato of dstorto appear to mprove eve though the dfferece agle s qute large for some scales but oly because MPCA s fttg outlers herefore addto to the result of the rato of dstorto the percetage of total parse dstaces eempted the computato of the MPCA at dfferet scales especally hel 0 should be cosdered choosg a approprate scale A good scale for MPCA should be oe th mamum rato of dstorto ad least umber of eempted parse dstaces able A4 the apped shos the percetage of parse dstaces of data pots eempted the computato of MPCA at varous scales It ca be cocluded that hle reducg the upper lmt s good for ths data creasg the loer lmt maes MPCA to ft outlers 5 Cocluso Prcpal compoet aalyss of hgh dmeso data favour compoets th hgh varace hs may obfuscate hdde geometrc structures that may be preset the data he defto of PCA as the mamzato of the sum of pot-to-pot squared dstaces betee the orthogoal proectos of data pots s a very coveet defto ad allos for geeralzato I ths paper e troduced multscale PCA as mamzato of the sum of pot-to-pot squared dstaces betee the orthogoal proectos of data pots for the pars of pots th dstaces some tervals scales MPCA s developed to solve the problem of revealg hdde geometrc structures data he result of MPCA o data leads to a cotuum of PCA structures of the data hch s depedet o the tervals chose Aalysg the MPCA structures of data reveals some teral structures of the data especally for data th multscale structures o study the MPCA structures of data e represet the MPCA structure at a gve terval by the cortege of proectors correspodg to MPCA at that terval; ths represetato has good ad meagful statstcal propertes hch are dscussed o reveal uderlyg geometrc structures that may be preset the data clusterg aalyss of the PCA structures at varous scales groups together scales th smlar PCA structures ad separate scales th dssmlar PCA structures

21 For data th clear multscale structures the cluster aalyss reveals some uderlyg structures the data hch covetoal PCA caot reveal due to the fact that such structures are obfuscated by other structures of hgher varace Each meagful cluster correspods to a structure the data ad e represet each cluster by the medod pot of the cluster ad ths represetatve s used to descrbe the structure of the data for cluster We propose the Rato of Dstorto as a crtero for choosg a approprate scale for MPCA for feature etracto especally for data th outlers ad also as a crtero for choosg the PCA structure to descrbe each cluster the clusterg aalyss of scales Applcato of MPCA o artfcal ad real lfe eamples shos that ths ca be useful For data th mult-scale structures the method as able to reveal some uderlyg structure data he method as partcularly useful mtgatg the fluece of outlers o the aalyss of prcpal compoet thout havg to eclude such outlers eplctly

22 oer mt oer mt Apped able A he agle betee the orgal compoet ad the result of the frst prcpal compoet usg MPCA at varous scales for the data eample 3 Upper mt SCAE Note: he MPCA at scale 0- s the same as PCA he cell for PCA as beg mared th a grey-scale bacgroud able A he rato of dstorto of MPCA at varous scales for = hs result s for the data gve eample 3 Upper mt SCAE NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN Note: he MPCA at scale 0- s the same as PCA he cell for PCA as beg mared th a grey-scale bacgroud

23 oer mt oer mt able A3 he rato of dstorto of MPCA at varous scales for = hs result s for the data gve eample 3 Upper mt SCAE NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN Note: he MPCA at scale 0- s the same as PCA he cell for PCA as beg mared th a grey-scale bacgroud able A4 he percetage of parse dstaces ecluded at varous scales of MPCA hs result s for the data gve eample 3 Upper mt SCAE % 0% 0% 0% 0% 0% 0% 585% 347% 7353% 0 647% 3768% 3768% 3768% 3768% 3768% 3768% 43% 68% 000% 0 659% 7650% 7650% 7650% 7650% 7650% 7650% 85% 000% 000% % 9535% 9535% 9535% 9535% 9535% 9535% 000% 000% 000% % 0000% 0000% 0000% 0000% 0000% 000% 000% 000% 000% % 0000% 0000% 0000% 0000% 000% 000% 000% 000% 000% % 0000% 0000% 0000% 000% 000% 000% 000% 000% 000% % 0000% 0000% 000% 000% 000% 000% 000% 000% 000% % 0000% 000% 000% 000% 000% 000% 000% 000% 000% % 000% 000% 000% 000% 000% 000% 000% 000% 000%

24 Refereces [] Jollffe I 00 Prcpal Compoet Aalyss Secod Edto Ne Yor: Sprger [] Gorba A N ad Zovyev A Y 009 Prcpal Graphs ad Mafolds Hadboo of Research o Mache earg Applcatos ad reds: Algorthms Methods ad echques Iformato Scece ReferecePreprt arxv: v [3] Zovyev A 000 Vsualzato of Multdmesoal Data I Russa Krasoyars echcal State Uversty Press [4] Burges C J C 00 Geometrc Methods for Feature Etracto ad Dmesoal Reducto - A Guded our Data Mg ad Koledge Dscovery Hadboo Ne Yor: Sprger ed O Mamo ad Roach d Edto ISBN pp 53-8 [5] Pearso K 90 O les ad plaes of closest ft to systems of pots space Phlosophcal Magaze pp [6] Shles J 005 A tutoral o prcpal compoet aalyss Accessed May 0 03 [7] Arold R ad Jupp P E 03 Statstcs of orthogoal aal frames Bometra pp -6 do: 0093/bomet/ast07 [8] Kore Y ad Carmel 004 Robust lear dmesoalty reducto Vsualzato ad Computer Graphcs IEEE rasactos vol0 o4 pp459-70

3D Geometry for Computer Graphics. Lesson 2: PCA & SVD

3D Geometry for Computer Graphics. Lesson 2: PCA & SVD 3D Geometry for Computer Graphcs Lesso 2: PCA & SVD Last week - egedecomposto We wat to lear how the matrx A works: A 2 Last week - egedecomposto If we look at arbtrary vectors, t does t tell us much.

More information

CS 2750 Machine Learning. Lecture 8. Linear regression. CS 2750 Machine Learning. Linear regression. is a linear combination of input components x

CS 2750 Machine Learning. Lecture 8. Linear regression. CS 2750 Machine Learning. Linear regression. is a linear combination of input components x CS 75 Mache Learg Lecture 8 Lear regresso Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square CS 75 Mache Learg Lear regresso Fucto f : X Y s a lear combato of put compoets f + + + K d d K k - parameters

More information

CS 1675 Introduction to Machine Learning Lecture 12 Support vector machines

CS 1675 Introduction to Machine Learning Lecture 12 Support vector machines CS 675 Itroducto to Mache Learg Lecture Support vector maches Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square Mdterm eam October 9, 7 I-class eam Closed book Stud materal: Lecture otes Correspodg chapters

More information

Support vector machines II

Support vector machines II CS 75 Mache Learg Lecture Support vector maches II Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square Learl separable classes Learl separable classes: here s a hperplae that separates trag staces th o error

More information

Chapter 9 Jordan Block Matrices

Chapter 9 Jordan Block Matrices Chapter 9 Jorda Block atrces I ths chapter we wll solve the followg problem. Gve a lear operator T fd a bass R of F such that the matrx R (T) s as smple as possble. f course smple s a matter of taste.

More information

1 Lyapunov Stability Theory

1 Lyapunov Stability Theory Lyapuov Stablty heory I ths secto we cosder proofs of stablty of equlbra of autoomous systems. hs s stadard theory for olear systems, ad oe of the most mportat tools the aalyss of olear systems. It may

More information

Binary classification: Support Vector Machines

Binary classification: Support Vector Machines CS 57 Itroducto to AI Lecture 6 Bar classfcato: Support Vector Maches Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square CS 57 Itro to AI Supervsed learg Data: D { D, D,.., D} a set of eamples D, (,,,,,

More information

Descriptive Statistics

Descriptive Statistics Page Techcal Math II Descrptve Statstcs Descrptve Statstcs Descrptve statstcs s the body of methods used to represet ad summarze sets of data. A descrpto of how a set of measuremets (for eample, people

More information

Radial Basis Function Networks

Radial Basis Function Networks Radal Bass Fucto Netorks Radal Bass Fucto Netorks A specal types of ANN that have three layers Iput layer Hdde layer Output layer Mappg from put to hdde layer s olear Mappg from hdde to output layer s

More information

CHAPTER 4 RADICAL EXPRESSIONS

CHAPTER 4 RADICAL EXPRESSIONS 6 CHAPTER RADICAL EXPRESSIONS. The th Root of a Real Number A real umber a s called the th root of a real umber b f Thus, for example: s a square root of sce. s also a square root of sce ( ). s a cube

More information

III-16 G. Brief Review of Grand Orthogonality Theorem and impact on Representations (Γ i ) l i = h n = number of irreducible representations.

III-16 G. Brief Review of Grand Orthogonality Theorem and impact on Representations (Γ i ) l i = h n = number of irreducible representations. III- G. Bref evew of Grad Orthogoalty Theorem ad mpact o epresetatos ( ) GOT: h [ () m ] [ () m ] δδ δmm ll GOT puts great restrcto o form of rreducble represetato also o umber: l h umber of rreducble

More information

Lecture 3 Probability review (cont d)

Lecture 3 Probability review (cont d) STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto

More information

CSE 5526: Introduction to Neural Networks Linear Regression

CSE 5526: Introduction to Neural Networks Linear Regression CSE 556: Itroducto to Neural Netorks Lear Regresso Part II 1 Problem statemet Part II Problem statemet Part II 3 Lear regresso th oe varable Gve a set of N pars of data , appromate d by a lear fucto

More information

Regression and the LMS Algorithm

Regression and the LMS Algorithm CSE 556: Itroducto to Neural Netorks Regresso ad the LMS Algorthm CSE 556: Regresso 1 Problem statemet CSE 556: Regresso Lear regresso th oe varable Gve a set of N pars of data {, d }, appromate d b a

More information

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model Chapter 3 Asmptotc Theor ad Stochastc Regressors The ature of eplaator varable s assumed to be o-stochastc or fed repeated samples a regresso aalss Such a assumpto s approprate for those epermets whch

More information

The Mathematical Appendix

The Mathematical Appendix The Mathematcal Appedx Defto A: If ( Λ, Ω, where ( λ λ λ whch the probablty dstrbutos,,..., Defto A. uppose that ( Λ,,..., s a expermet type, the σ-algebra o λ λ λ are defed s deoted by ( (,,...,, σ Ω.

More information

4 Inner Product Spaces

4 Inner Product Spaces 11.MH1 LINEAR ALGEBRA Summary Notes 4 Ier Product Spaces Ier product s the abstracto to geeral vector spaces of the famlar dea of the scalar product of two vectors or 3. I what follows, keep these key

More information

Introduction to local (nonparametric) density estimation. methods

Introduction to local (nonparametric) density estimation. methods Itroducto to local (oparametrc) desty estmato methods A slecture by Yu Lu for ECE 66 Sprg 014 1. Itroducto Ths slecture troduces two local desty estmato methods whch are Parze desty estmato ad k-earest

More information

QR Factorization and Singular Value Decomposition COS 323

QR Factorization and Singular Value Decomposition COS 323 QR Factorzato ad Sgular Value Decomposto COS 33 Why Yet Aother Method? How do we solve least-squares wthout currg codto-squarg effect of ormal equatos (A T A A T b) whe A s sgular, fat, or otherwse poorly-specfed?

More information

TESTS BASED ON MAXIMUM LIKELIHOOD

TESTS BASED ON MAXIMUM LIKELIHOOD ESE 5 Toy E. Smth. The Basc Example. TESTS BASED ON MAXIMUM LIKELIHOOD To llustrate the propertes of maxmum lkelhood estmates ad tests, we cosder the smplest possble case of estmatg the mea of the ormal

More information

Some Different Perspectives on Linear Least Squares

Some Different Perspectives on Linear Least Squares Soe Dfferet Perspectves o Lear Least Squares A stadard proble statstcs s to easure a respose or depedet varable, y, at fed values of oe or ore depedet varables. Soetes there ests a deterstc odel y f (,,

More information

Support vector machines

Support vector machines CS 75 Mache Learg Lecture Support vector maches Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square CS 75 Mache Learg Outle Outle: Algorthms for lear decso boudary Support vector maches Mamum marg hyperplae.

More information

Transforms that are commonly used are separable

Transforms that are commonly used are separable Trasforms s Trasforms that are commoly used are separable Eamples: Two-dmesoal DFT DCT DST adamard We ca the use -D trasforms computg the D separable trasforms: Take -D trasform of the rows > rows ( )

More information

ESS Line Fitting

ESS Line Fitting ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here

More information

Lecture Notes 2. The ability to manipulate matrices is critical in economics.

Lecture Notes 2. The ability to manipulate matrices is critical in economics. Lecture Notes. Revew of Matrces he ablt to mapulate matrces s crtcal ecoomcs.. Matr a rectagular arra of umbers, parameters, or varables placed rows ad colums. Matrces are assocated wth lear equatos. lemets

More information

Summary of the lecture in Biostatistics

Summary of the lecture in Biostatistics Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the

More information

Application of Legendre Bernstein basis transformations to degree elevation and degree reduction

Application of Legendre Bernstein basis transformations to degree elevation and degree reduction Computer Aded Geometrc Desg 9 79 78 www.elsever.com/locate/cagd Applcato of Legedre Berste bass trasformatos to degree elevato ad degree reducto Byug-Gook Lee a Yubeom Park b Jaechl Yoo c a Dvso of Iteret

More information

CHAPTER VI Statistical Analysis of Experimental Data

CHAPTER VI Statistical Analysis of Experimental Data Chapter VI Statstcal Aalyss of Expermetal Data CHAPTER VI Statstcal Aalyss of Expermetal Data Measuremets do ot lead to a uque value. Ths s a result of the multtude of errors (maly radom errors) that ca

More information

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class)

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class) Assgmet 5/MATH 7/Wter 00 Due: Frday, February 9 class (!) (aswers wll be posted rght after class) As usual, there are peces of text, before the questos [], [], themselves. Recall: For the quadratc form

More information

Linear Regression Linear Regression with Shrinkage. Some slides are due to Tommi Jaakkola, MIT AI Lab

Linear Regression Linear Regression with Shrinkage. Some slides are due to Tommi Jaakkola, MIT AI Lab Lear Regresso Lear Regresso th Shrkage Some sldes are due to Tomm Jaakkola, MIT AI Lab Itroducto The goal of regresso s to make quattatve real valued predctos o the bass of a vector of features or attrbutes.

More information

PROJECTION PROBLEM FOR REGULAR POLYGONS

PROJECTION PROBLEM FOR REGULAR POLYGONS Joural of Mathematcal Sceces: Advaces ad Applcatos Volume, Number, 008, Pages 95-50 PROJECTION PROBLEM FOR REGULAR POLYGONS College of Scece Bejg Forestry Uversty Bejg 0008 P. R. Cha e-mal: sl@bjfu.edu.c

More information

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uversty Mchael Bar Sprg 5 Mdterm am, secto Soluto Thursday, February 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes eam.. No calculators of ay kd are allowed..

More information

n -dimensional vectors follow naturally from the one

n -dimensional vectors follow naturally from the one B. Vectors ad sets B. Vectors Ecoomsts study ecoomc pheomea by buldg hghly stylzed models. Uderstadg ad makg use of almost all such models requres a hgh comfort level wth some key mathematcal sklls. I

More information

Supervised learning: Linear regression Logistic regression

Supervised learning: Linear regression Logistic regression CS 57 Itroducto to AI Lecture 4 Supervsed learg: Lear regresso Logstc regresso Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square CS 57 Itro to AI Data: D { D D.. D D Supervsed learg d a set of eamples s

More information

An Introduction to. Support Vector Machine

An Introduction to. Support Vector Machine A Itroducto to Support Vector Mache Support Vector Mache (SVM) A classfer derved from statstcal learg theory by Vapk, et al. 99 SVM became famous whe, usg mages as put, t gave accuracy comparable to eural-etwork

More information

Dimensionality reduction Feature selection

Dimensionality reduction Feature selection CS 750 Mache Learg Lecture 3 Dmesoalty reducto Feature selecto Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square CS 750 Mache Learg Dmesoalty reducto. Motvato. Classfcato problem eample: We have a put data

More information

Unsupervised Learning and Other Neural Networks

Unsupervised Learning and Other Neural Networks CSE 53 Soft Computg NOT PART OF THE FINAL Usupervsed Learg ad Other Neural Networs Itroducto Mture Destes ad Idetfablty ML Estmates Applcato to Normal Mtures Other Neural Networs Itroducto Prevously, all

More information

Objectives of Multiple Regression

Objectives of Multiple Regression Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of

More information

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then Secto 5 Vectors of Radom Varables Whe workg wth several radom varables,,..., to arrage them vector form x, t s ofte coveet We ca the make use of matrx algebra to help us orgaze ad mapulate large umbers

More information

Functions of Random Variables

Functions of Random Variables Fuctos of Radom Varables Chapter Fve Fuctos of Radom Varables 5. Itroducto A geeral egeerg aalyss model s show Fg. 5.. The model output (respose) cotas the performaces of a system or product, such as weght,

More information

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Mean is only appropriate for interval or ratio scales, not ordinal or nominal. Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot

More information

L5 Polynomial / Spline Curves

L5 Polynomial / Spline Curves L5 Polyomal / Sple Curves Cotets Coc sectos Polyomal Curves Hermte Curves Bezer Curves B-Sples No-Uform Ratoal B-Sples (NURBS) Mapulato ad Represetato of Curves Types of Curve Equatos Implct: Descrbe a

More information

Investigating Cellular Automata

Investigating Cellular Automata Researcher: Taylor Dupuy Advsor: Aaro Wootto Semester: Fall 4 Ivestgatg Cellular Automata A Overvew of Cellular Automata: Cellular Automata are smple computer programs that geerate rows of black ad whte

More information

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

To use adaptive cluster sampling we must first make some definitions of the sampling universe: 8.3 ADAPTIVE SAMPLING Most of the methods dscussed samplg theory are lmted to samplg desgs hch the selecto of the samples ca be doe before the survey, so that oe of the decsos about samplg deped ay ay

More information

Simple Linear Regression

Simple Linear Regression Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato

More information

Chapter 2 - Free Vibration of Multi-Degree-of-Freedom Systems - II

Chapter 2 - Free Vibration of Multi-Degree-of-Freedom Systems - II CEE49b Chapter - Free Vbrato of Mult-Degree-of-Freedom Systems - II We ca obta a approxmate soluto to the fudametal atural frequecy through a approxmate formula developed usg eergy prcples by Lord Raylegh

More information

ENGI 3423 Simple Linear Regression Page 12-01

ENGI 3423 Simple Linear Regression Page 12-01 ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable

More information

Analysis of Variance with Weibull Data

Analysis of Variance with Weibull Data Aalyss of Varace wth Webull Data Lahaa Watthaacheewaul Abstract I statstcal data aalyss by aalyss of varace, the usual basc assumptos are that the model s addtve ad the errors are radomly, depedetly, ad

More information

Principal Components. Analysis. Basic Intuition. A Method of Self Organized Learning

Principal Components. Analysis. Basic Intuition. A Method of Self Organized Learning Prcpal Compoets Aalss A Method of Self Orgazed Learg Prcpal Compoets Aalss Stadard techque for data reducto statstcal patter matchg ad sgal processg Usupervsed learg: lear from examples wthout a teacher

More information

Kernel-based Methods and Support Vector Machines

Kernel-based Methods and Support Vector Machines Kerel-based Methods ad Support Vector Maches Larr Holder CptS 570 Mache Learg School of Electrcal Egeerg ad Computer Scece Washgto State Uverst Refereces Muller et al. A Itroducto to Kerel-Based Learg

More information

MULTIDIMENSIONAL HETEROGENEOUS VARIABLE PREDICTION BASED ON EXPERTS STATEMENTS. Gennadiy Lbov, Maxim Gerasimov

MULTIDIMENSIONAL HETEROGENEOUS VARIABLE PREDICTION BASED ON EXPERTS STATEMENTS. Gennadiy Lbov, Maxim Gerasimov Iteratoal Boo Seres "Iformato Scece ad Computg" 97 MULTIIMNSIONAL HTROGNOUS VARIABL PRICTION BAS ON PRTS STATMNTS Geady Lbov Maxm Gerasmov Abstract: I the wors [ ] we proposed a approach of formg a cosesus

More information

Non-uniform Turán-type problems

Non-uniform Turán-type problems Joural of Combatoral Theory, Seres A 111 2005 106 110 wwwelsevercomlocatecta No-uform Turá-type problems DhruvMubay 1, Y Zhao 2 Departmet of Mathematcs, Statstcs, ad Computer Scece, Uversty of Illos at

More information

Point Estimation: definition of estimators

Point Estimation: definition of estimators Pot Estmato: defto of estmators Pot estmator: ay fucto W (X,..., X ) of a data sample. The exercse of pot estmato s to use partcular fuctos of the data order to estmate certa ukow populato parameters.

More information

4. Standard Regression Model and Spatial Dependence Tests

4. Standard Regression Model and Spatial Dependence Tests 4. Stadard Regresso Model ad Spatal Depedece Tests Stadard regresso aalss fals the presece of spatal effects. I case of spatal depedeces ad/or spatal heterogeet a stadard regresso model wll be msspecfed.

More information

Bayesian Inferences for Two Parameter Weibull Distribution Kipkoech W. Cheruiyot 1, Abel Ouko 2, Emily Kirimi 3

Bayesian Inferences for Two Parameter Weibull Distribution Kipkoech W. Cheruiyot 1, Abel Ouko 2, Emily Kirimi 3 IOSR Joural of Mathematcs IOSR-JM e-issn: 78-578, p-issn: 9-765X. Volume, Issue Ver. II Ja - Feb. 05, PP 4- www.osrjourals.org Bayesa Ifereces for Two Parameter Webull Dstrbuto Kpkoech W. Cheruyot, Abel

More information

A conic cutting surface method for linear-quadraticsemidefinite

A conic cutting surface method for linear-quadraticsemidefinite A coc cuttg surface method for lear-quadratcsemdefte programmg Mohammad R. Osoorouch Calfora State Uversty Sa Marcos Sa Marcos, CA Jot wor wth Joh E. Mtchell RPI July 3, 2008 Outle: Secod-order coe: defto

More information

MOLECULAR VIBRATIONS

MOLECULAR VIBRATIONS MOLECULAR VIBRATIONS Here we wsh to vestgate molecular vbratos ad draw a smlarty betwee the theory of molecular vbratos ad Hückel theory. 1. Smple Harmoc Oscllator Recall that the eergy of a oe-dmesoal

More information

MEASURES OF DISPERSION

MEASURES OF DISPERSION MEASURES OF DISPERSION Measure of Cetral Tedecy: Measures of Cetral Tedecy ad Dsperso ) Mathematcal Average: a) Arthmetc mea (A.M.) b) Geometrc mea (G.M.) c) Harmoc mea (H.M.) ) Averages of Posto: a) Meda

More information

A scalar t is an eigenvalue of A if and only if t satisfies the characteristic equation of A: det (A ti) =0

A scalar t is an eigenvalue of A if and only if t satisfies the characteristic equation of A: det (A ti) =0 Chapter 5 a glace: Let e a lear operator whose stadard matrx s wth sze x. he, a ozero vector x s sad to e a egevector of ad f there exsts a scalar sch that (x) x x. he scalar s called a egevale of (or

More information

Econometric Methods. Review of Estimation

Econometric Methods. Review of Estimation Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators

More information

MATH 247/Winter Notes on the adjoint and on normal operators.

MATH 247/Winter Notes on the adjoint and on normal operators. MATH 47/Wter 00 Notes o the adjot ad o ormal operators I these otes, V s a fte dmesoal er product space over, wth gve er * product uv, T, S, T, are lear operators o V U, W are subspaces of V Whe we say

More information

Chapter 14 Logistic Regression Models

Chapter 14 Logistic Regression Models Chapter 4 Logstc Regresso Models I the lear regresso model X β + ε, there are two types of varables explaatory varables X, X,, X k ad study varable y These varables ca be measured o a cotuous scale as

More information

Beam Warming Second-Order Upwind Method

Beam Warming Second-Order Upwind Method Beam Warmg Secod-Order Upwd Method Petr Valeta Jauary 6, 015 Ths documet s a part of the assessmet work for the subject 1DRP Dfferetal Equatos o Computer lectured o FNSPE CTU Prague. Abstract Ths documet

More information

Correlation and Regression Analysis

Correlation and Regression Analysis Chapter V Correlato ad Regresso Aalss R. 5.. So far we have cosdered ol uvarate dstrbutos. Ma a tme, however, we come across problems whch volve two or more varables. Ths wll be the subject matter of the

More information

Median as a Weighted Arithmetic Mean of All Sample Observations

Median as a Weighted Arithmetic Mean of All Sample Observations Meda as a Weghted Arthmetc Mea of All Sample Observatos SK Mshra Dept. of Ecoomcs NEHU, Shllog (Ida). Itroducto: Iumerably may textbooks Statstcs explctly meto that oe of the weakesses (or propertes) of

More information

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations HP 30S Statstcs Averages ad Stadard Devatos Average ad Stadard Devato Practce Fdg Averages ad Stadard Devatos HP 30S Statstcs Averages ad Stadard Devatos Average ad stadard devato The HP 30S provdes several

More information

C-1: Aerodynamics of Airfoils 1 C-2: Aerodynamics of Airfoils 2 C-3: Panel Methods C-4: Thin Airfoil Theory

C-1: Aerodynamics of Airfoils 1 C-2: Aerodynamics of Airfoils 2 C-3: Panel Methods C-4: Thin Airfoil Theory ROAD MAP... AE301 Aerodyamcs I UNIT C: 2-D Arfols C-1: Aerodyamcs of Arfols 1 C-2: Aerodyamcs of Arfols 2 C-3: Pael Methods C-4: Th Arfol Theory AE301 Aerodyamcs I Ut C-3: Lst of Subects Problem Solutos?

More information

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions. Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos

More information

Solving Constrained Flow-Shop Scheduling. Problems with Three Machines

Solving Constrained Flow-Shop Scheduling. Problems with Three Machines It J Cotemp Math Sceces, Vol 5, 2010, o 19, 921-929 Solvg Costraed Flow-Shop Schedulg Problems wth Three Maches P Pada ad P Rajedra Departmet of Mathematcs, School of Advaced Sceces, VIT Uversty, Vellore-632

More information

Simple Linear Regression

Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uversty Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL DISTRIBUTION

BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL DISTRIBUTION Iteratoal Joural of Mathematcs ad Statstcs Studes Vol.4, No.3, pp.5-39, Jue 06 Publshed by Europea Cetre for Research Trag ad Developmet UK (www.eajourals.org BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL

More information

Dr. Shalabh. Indian Institute of Technology Kanpur

Dr. Shalabh. Indian Institute of Technology Kanpur Aalyss of Varace ad Desg of Expermets-I MODULE -I LECTURE - SOME RESULTS ON LINEAR ALGEBRA, MATRIX THEORY AND DISTRIBUTIONS Dr. Shalabh Departmet t of Mathematcs t ad Statstcs t t Ida Isttute of Techology

More information

Chapter 3. Linear Equations and Matrices

Chapter 3. Linear Equations and Matrices Vector Spaces Physcs 8/6/05 hapter Lear Equatos ad Matrces wde varety of physcal problems volve solvg systems of smultaeous lear equatos These systems of lear equatos ca be ecoomcally descrbed ad effcetly

More information

ANALYSIS ON THE NATURE OF THE BASIC EQUATIONS IN SYNERGETIC INTER-REPRESENTATION NETWORK

ANALYSIS ON THE NATURE OF THE BASIC EQUATIONS IN SYNERGETIC INTER-REPRESENTATION NETWORK Far East Joural of Appled Mathematcs Volume, Number, 2008, Pages Ths paper s avalable ole at http://www.pphm.com 2008 Pushpa Publshg House ANALYSIS ON THE NATURE OF THE ASI EQUATIONS IN SYNERGETI INTER-REPRESENTATION

More information

A New Family of Transformations for Lifetime Data

A New Family of Transformations for Lifetime Data Proceedgs of the World Cogress o Egeerg 4 Vol I, WCE 4, July - 4, 4, Lodo, U.K. A New Famly of Trasformatos for Lfetme Data Lakhaa Watthaacheewakul Abstract A famly of trasformatos s the oe of several

More information

A COMPARATIVE STUDY OF THE METHODS OF SOLVING NON-LINEAR PROGRAMMING PROBLEM

A COMPARATIVE STUDY OF THE METHODS OF SOLVING NON-LINEAR PROGRAMMING PROBLEM DAODIL INTERNATIONAL UNIVERSITY JOURNAL O SCIENCE AND TECHNOLOGY, VOLUME, ISSUE, JANUARY 9 A COMPARATIVE STUDY O THE METHODS O SOLVING NON-LINEAR PROGRAMMING PROBLEM Bmal Chadra Das Departmet of Tetle

More information

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions Iteratoal Joural of Computatoal Egeerg Research Vol, 0 Issue, Estmato of Stress- Stregth Relablty model usg fte mxture of expoetal dstrbutos K.Sadhya, T.S.Umamaheswar Departmet of Mathematcs, Lal Bhadur

More information

Singular Value Decomposition. Linear Algebra (3) Singular Value Decomposition. SVD and Eigenvectors. Solving LEs with SVD

Singular Value Decomposition. Linear Algebra (3) Singular Value Decomposition. SVD and Eigenvectors. Solving LEs with SVD Sgular Value Decomosto Lear Algera (3) m Cootes Ay m x matrx wth m ca e decomosed as follows Dagoal matrx A UWV m x x Orthogoal colums U U I w1 0 0 w W M M 0 0 x Orthoormal (Pure rotato) VV V V L 0 L 0

More information

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The

More information

( q Modal Analysis. Eigenvectors = Mode Shapes? Eigenproblem (cont) = x x 2 u 2. u 1. x 1 (4.55) vector and M and K are matrices.

( q Modal Analysis. Eigenvectors = Mode Shapes? Eigenproblem (cont) = x x 2 u 2. u 1. x 1 (4.55) vector and M and K are matrices. 4.3 - Modal Aalyss Physcal coordates are ot always the easest to work Egevectors provde a coveet trasformato to modal coordates Modal coordates are lear combato of physcal coordates Say we have physcal

More information

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data

More information

1 Mixed Quantum State. 2 Density Matrix. CS Density Matrices, von Neumann Entropy 3/7/07 Spring 2007 Lecture 13. ψ = α x x. ρ = p i ψ i ψ i.

1 Mixed Quantum State. 2 Density Matrix. CS Density Matrices, von Neumann Entropy 3/7/07 Spring 2007 Lecture 13. ψ = α x x. ρ = p i ψ i ψ i. CS 94- Desty Matrces, vo Neuma Etropy 3/7/07 Sprg 007 Lecture 3 I ths lecture, we wll dscuss the bascs of quatum formato theory I partcular, we wll dscuss mxed quatum states, desty matrces, vo Neuma etropy

More information

Generalized Minimum Perpendicular Distance Square Method of Estimation

Generalized Minimum Perpendicular Distance Square Method of Estimation Appled Mathematcs,, 3, 945-949 http://dx.do.org/.436/am..366 Publshed Ole December (http://.scrp.org/joural/am) Geeralzed Mmum Perpedcular Dstace Square Method of Estmato Rezaul Karm, Morshed Alam, M.

More information

Lecture 07: Poles and Zeros

Lecture 07: Poles and Zeros Lecture 07: Poles ad Zeros Defto of poles ad zeros The trasfer fucto provdes a bass for determg mportat system respose characterstcs wthout solvg the complete dfferetal equato. As defed, the trasfer fucto

More information

A Study of the Reproducibility of Measurements with HUR Leg Extension/Curl Research Line

A Study of the Reproducibility of Measurements with HUR Leg Extension/Curl Research Line HUR Techcal Report 000--9 verso.05 / Frak Borg (borgbros@ett.f) A Study of the Reproducblty of Measuremets wth HUR Leg Eteso/Curl Research Le A mportat property of measuremets s that the results should

More information

Analysis of Lagrange Interpolation Formula

Analysis of Lagrange Interpolation Formula P IJISET - Iteratoal Joural of Iovatve Scece, Egeerg & Techology, Vol. Issue, December 4. www.jset.com ISS 348 7968 Aalyss of Lagrage Iterpolato Formula Vjay Dahya PDepartmet of MathematcsMaharaja Surajmal

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple

More information

A tighter lower bound on the circuit size of the hardest Boolean functions

A tighter lower bound on the circuit size of the hardest Boolean functions Electroc Colloquum o Computatoal Complexty, Report No. 86 2011) A tghter lower boud o the crcut sze of the hardest Boolea fuctos Masak Yamamoto Abstract I [IPL2005], Fradse ad Mlterse mproved bouds o the

More information

Section l h l Stem=Tens. 8l Leaf=Ones. 8h l 03. 9h 58

Section l h l Stem=Tens. 8l Leaf=Ones. 8h l 03. 9h 58 Secto.. 6l 34 6h 667899 7l 44 7h Stem=Tes 8l 344 Leaf=Oes 8h 5557899 9l 3 9h 58 Ths dsplay brgs out the gap the data: There are o scores the hgh 7's. 6. a. beams cylders 9 5 8 88533 6 6 98877643 7 488

More information

x y exp λ'. x exp λ 2. x exp 1.

x y exp λ'. x exp λ 2. x exp 1. egecosmcd Egevalue-egevector of the secod dervatve operator d /d hs leads to Fourer seres (se, cose, Legedre, Bessel, Chebyshev, etc hs s a eample of a systematc way of geeratg a set of mutually orthogoal

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1 STA 08 Appled Lear Models: Regresso Aalyss Sprg 0 Soluto for Homework #. Let Y the dollar cost per year, X the umber of vsts per year. The the mathematcal relato betwee X ad Y s: Y 300 + X. Ths s a fuctoal

More information

Regresso What s a Model? 1. Ofte Descrbe Relatoshp betwee Varables 2. Types - Determstc Models (o radomess) - Probablstc Models (wth radomess) EPI 809/Sprg 2008 9 Determstc Models 1. Hypothesze

More information

Johns Hopkins University Department of Biostatistics Math Review for Introductory Courses

Johns Hopkins University Department of Biostatistics Math Review for Introductory Courses Johs Hopks Uverst Departmet of Bostatstcs Math Revew for Itroductor Courses Ratoale Bostatstcs courses wll rel o some fudametal mathematcal relatoshps, fuctos ad otato. The purpose of ths Math Revew s

More information

Lecture 12: Multilayer perceptrons II

Lecture 12: Multilayer perceptrons II Lecture : Multlayer perceptros II Bayes dscrmats ad MLPs he role of hdde uts A eample Itroducto to Patter Recoto Rcardo Guterrez-Osua Wrht State Uversty Bayes dscrmats ad MLPs ( As we have see throuhout

More information

Lecture 1 Review of Fundamental Statistical Concepts

Lecture 1 Review of Fundamental Statistical Concepts Lecture Revew of Fudametal Statstcal Cocepts Measures of Cetral Tedecy ad Dsperso A word about otato for ths class: Idvduals a populato are desgated, where the dex rages from to N, ad N s the total umber

More information

Generalization of the Dissimilarity Measure of Fuzzy Sets

Generalization of the Dissimilarity Measure of Fuzzy Sets Iteratoal Mathematcal Forum 2 2007 o. 68 3395-3400 Geeralzato of the Dssmlarty Measure of Fuzzy Sets Faramarz Faghh Boformatcs Laboratory Naobotechology Research Ceter vesa Research Isttute CECR Tehra

More information

Unimodality Tests for Global Optimization of Single Variable Functions Using Statistical Methods

Unimodality Tests for Global Optimization of Single Variable Functions Using Statistical Methods Malaysa Umodalty Joural Tests of Mathematcal for Global Optmzato Sceces (): of 05 Sgle - 5 Varable (007) Fuctos Usg Statstcal Methods Umodalty Tests for Global Optmzato of Sgle Varable Fuctos Usg Statstcal

More information

Lattices. Mathematical background

Lattices. Mathematical background Lattces Mathematcal backgroud Lattces : -dmesoal Eucldea space. That s, { T x } x x = (,, ) :,. T T If x= ( x,, x), y = ( y,, y), the xy, = xy (er product of xad y) x = /2 xx, (Eucldea legth or orm of

More information

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018 /3/08 Sstems & Bomedcal Egeerg Departmet SBE 304: Bo-Statstcs Smple Lear Regresso ad Correlato Dr. Ama Eldeb Fall 07 Descrptve Orgasg, summarsg & descrbg data Statstcs Correlatoal Relatoshps Iferetal Geeralsg

More information

Chapter 4 (Part 1): Non-Parametric Classification (Sections ) Pattern Classification 4.3) Announcements

Chapter 4 (Part 1): Non-Parametric Classification (Sections ) Pattern Classification 4.3) Announcements Aoucemets No-Parametrc Desty Estmato Techques HW assged Most of ths lecture was o the blacboard. These sldes cover the same materal as preseted DHS Bometrcs CSE 90-a Lecture 7 CSE90a Fall 06 CSE90a Fall

More information