Fast Universal Background Model (UBM) Training on GPUs using Compute Unified Device Architecture (CUDA)

Size: px
Start display at page:

Download "Fast Universal Background Model (UBM) Training on GPUs using Compute Unified Device Architecture (CUDA)"

Transcription

1 Inernaonal Journal of Elecrcal & Compuer Scences IJECS-IJENS Vol: No: 04 5 Fas Unversal Background Model (UBM) ranng on GPUs usng Compue Unfed Devce Archecure (CUDA) M. Azhar, C. Ergün. Compuer Engneerng Deparmen, Easern Mederranean Unversy Absrac Unversal Background Modelng (UBM) s an alernave hypoheszed modelng ha s used exensvely n Speaker Verfcaon (SV) sysems. ranng he background models from large speech daa requres a sgnfcan amoun of memory and compuaonal load. In hs paper a parallel mplemenaon of speaker verfcaon sysem based on Gaussan Mxure Modelng Unversal Background Modelng (GMM UBM) desgned for many-core archecure of NVIDIA s Graphcs Processng Uns (GPU) usng CUDA sngle nsrucon mulple hreads (SIM) model s presened. CUDA mplemenaon of hese algorhms s desgned n such a way ha he speed of compuaon of he algorhm ncreases wh number of GPU cores. In hs expermen 30 mes speedup for k-means cluserng and 6 mes speedup for Expecaon Maxmzaon (EM) was acheved for an npu of abou 350K frames of 6 dmensons and 04 mxures on GeForce GX 570 (NVIDIA Ferm Seres) wh 480 cores when compared o a sngle hreaded mplemenaon on he radonal CPU. Index erm Compue Unfed Devce Archecure (CUDA), Expecaon Maxmzaon (EM), Speaker Verfcaon, Unversal Background Model (UBM) I. INRODUCION UNIVERSAL BACKGROUND MODELING s an approach o approxmae he mposer model n speaker verfcaon sysems. In hs approach, pooled ranng daa from a large number of speakers s ulzed o consruc a large mxure model. he lkelhood of background speakers used o form reference. he UBM model parameers are also raned by esmang parameers λ of he GMM ha maches he dsrbuon of he ranng feaure vecors usng ranng speech from a speaker. he mos popular, well-esablshed and robus mehod s he Maxmum Lkelhood (ML) whch eravely refnes he GMM parameers. One mporan propery of ML esmaon s ha for a large enough se of ranng, he model esmae converges o he rue model parameers as he daa lengh grows. Snce here s no closedform for compung he ML esmae, an erave echnque known as Expecaon Maxmzaon s used []. M. A. s wh Compuer Engneerng Deparmen, Easern Mederranean Unversy, Famagusa, Mersn 0 urkey (mohammad.azhar@emu.edu.r). C. E. s wh Compuer Engneerng Deparmen, Easern Mederranean Unversy, Famagusa, Mersn 0 urkey (cem.ergun@emu.edu.r). Opmzaon of he SV sysems can help o decrease developmen me. I should be kep n mnd ha he mxure based modelng echnques can be easly opmzed usng parallel mplemenaon. he exsence of a massvely parallel compung echnology such as CUDA has changed he face of compung n he pas years. Currenly havng some of he mos effcen rangs on Green500 ls [] ndcaes ha oday s GPUs are more sued for scenfc compung wh less power consumpon. In hs paper a research on parallelzng UBM parameer esmaon of SV s performed. In general, a man SV sysem ncludes fron end processng (nose cancellaon, feaure exracon and selecon), background model generaon, adapaon and esng phases. In hs paper he focus wll be on background model parameers generaon for Mel-Frequency Cepsrum Coeffcens (MFCC) daa and parallelzaon of uns such as MFCC, k-means and Expecaon Maxmzaon algorhm. he conrbuon s on wo dfferen archecures of NVIDIA GPUs namely GX 85 (G00 Archecure) and GX 570 (Ferm Archecure). he speedup resuls of he laer wh he radonal CPU archecure, boh n sngle hreaded and mul-core mplemenaons wll be compared. I wll be observed how he use of even naïve GPU mplemenaon can resul n faser calculaons and how dfferen hardware archecures can affec he speed. Also wll be explaned how he echnques n GPU sofware mplemenaon drecly arge opmaly of he performance. GPUs have a very hgh floang pon calculaon hroughpu (+ Flop/s peak performance as opposed o a Core 7 CPU - heorecal peak performance of 09 GFlops/s) [3]. In hs paper, he naure of he problem was nroduced wh double precson floang pons. he menoned GPUs perform double precson operaons heorecally 75 GFlops and 86 GFLops for GX85 and GX570 respecvely. Consderng he sze of he problem (abou 350K samples of 6 dmensons) he soluon s no memory-bound and comes down o he compuaon lm. For he me beng, shared memory approaches were no sepped no as requres a more programmacally complex mplemenaon and suffers from ssues such as memory coalescng [4]. he res of he paper s organzed n he followng manner. In secon II, he MFCC, k-means and EM algorhms are descrbed o generae UBM parameers. In secon III, he archecure of CUDA Ferm IJECS-IJENS Augus 0 IJENS

2 Inernaonal Journal of Elecrcal & Compuer Scences IJECS-IJENS Vol: No: wll be presened. In secon IV he desgn of laer algorhms s dscussed boh for CPU Parallel and CUDA. he expermenaon and resuls of dfferen archecure models and analyss of performance booss are presened n secon V. Secon VI concludes he paper and presens some fuure work. II. FROND-END MODULE AND UBM GENERAION A. Mel-Frequency Cepsrum Coeffcens Mel-frequency Cepsrum coeffcens are well known feaures used o descrbe speech sgnal. hey are based on he evdence ha he nformaon carred by low-frequency componens of he speech sgnal s phonecally more mporan for humans han he nformaon carred by hgh frequency componens [5]. he Mel scale reflecs hs by usng a nonlnear warpng of he frequency scale,.e., by reducng he frequency resoluon as he frequency ncreases. he process of exracng MFCC from connuous speech s llusraed n Fg. : where s[n] s he wndowed speech segmen of lengh N. he power specrum s compued by akng he absolue value of he DF. hs s represened as he second block n Fg. hen he power specrum s passed hrough each Mel fler of he fler bank for calculang he oupu power of each fler. he fnal processng sep s o apply Dscree Cosne ransform (DC) o he logarhm of he fler-bank coeffcens, yeldng he MFCC parameers as C m N f N f log m e j cos j N f j m =,, D () where D s he number of MFCC parameers calculaed for each frame. In (), can be seen ha C 0 represens he average log energy of he specrum. Snce s preferred o use a feaure se whch s no suscepble o varyng background nose and loudness of speech, C 0 s omed, resulng n a D- dmensonal MFCC feaure vecor as, DF. Wndowed speech segmen Magnude specra of band flers x C, C,... C, D B. k-means Cluserng k-means Cluserng s a mehod ha helps o accelerae convergence [6]. In hs mehod wh a se of observaons (x, x,, x n ), each of whch a vecor of d dmensons are clusered no k ses (k n) S = {S, S,, S k } such ha he sum of square n each cluser s mnmzed: arg s mn k x js (3) x (4) where µ s he mean of pon n S. j Energy DC MFCC feaure vecor f (khz) Fg.. Block dagram of he MFCC compuaon process [5] In order o calculae he MFCC of a speech segmen, he frs block of he fgure obans he Dscree Fourer ransform (DF) of he segmen. A hs sep, he lengh of hs sequence s ncreased from N o N by nserng zeros o mprove he resoluon n he frequency doman (zero paddng). hen he segmen s ransformed no he frequency doman as C. Expecaon Maxmzaon Algorhm Expecaon-maxmzaon (EM) algorhm s a mehod ha eravely esmaes he lkelhood [7]. I s smlar o he k- means cluserng algorhm for Gaussan mxures snce hey boh look for he cener of clusers and refnemen s done eravely. For a gven se of observaon vecors X = { x, x,, x } and assumng vecors are ndependen and dencally dsrbued (d), he bes fng model λ s he one ha maxmzes, p( X ) p( ). (5) hs s a non-lnear problem; herefore λ canno be drecly calculaed. he EM algorhm consss of he followng seps: S k N n0 s n e kn j N k =,, N () IJECS-IJENS Augus 0 IJENS

3 Inernaonal Journal of Elecrcal & Compuer Scences IJECS-IJENS Vol: No: Alg.. Expecaon Maxmzaon Procedure In each EM eraon, he ML esmaes for he means, varances and weghs (a pror mxure probably) for a parcular speaker model are compued as follows: Mxure Wegh: m Means:, ) (6) x, ) x (7) x, ) Varances: where. Choose an nal model.. Fnd a new model so ha p(x ) > p(x ). 3. Repea sep unl he dfference 4. p(x ) - p(x ) has reached a convergence hreshold or you have reached he maxmum number of eraons. x, ) x (8) x, ) m, and are updaed values of m, and respecvely, and, and x refer o arbrary elemens of he vecors n, and x respecvely and, x s he shorhand for dalog. he poseror probably for he h acousc class s gven by, m b ( x ) x, ) (9) m b ( x ) M k k k One of he parameers ha should be seleced before ranng a Gaussan mxure speaker model s he order M of he mxure densy. he model parameers mus also be nalzed pror o he EM algorhm. hese selecons are expermenally deermned for a gven ask. In hs expermen, he order of mxure densy (M) s 04. he EM algorhm s guaraneed o fnd a locally opmal maxmum lkelhood model regardless of he sarng pon. However may have several local maxma and dfferen sarng models can lead o dfferen local maxma [8]. Alernavely, k-means algorhm can be used o perform a cluserng of he ranng daa no M acousc classes [9]. he M nal esmaes for he mxure means and he covarance marces can be seleced as he mean and he covarance of each cluser. III. COMPUE UNIFIED DEVICE ARCHIECURE CUDA [0] s he parallel compung archecure developed by NVIDIA and s he compung engne n NVIDIA graphcal processng uns (GPUs). hs archecure s avalable hrough dfferen programmng languages and suppors oher applcaon programmng nerfaces, such as CUDA FORRAN, OpenCL, and DrecCompue. CUDA helps o solve many complex compuaonal problems n a more effcen way han on a CPU. he CUDA parallel programmng model s desgned o overcome he challenge of developng applcaon sofware ha ransparenly scales s parallelsm whle mananng a low learnng curve for programmers famlar wh sandard programmng languages such as C. CUDA has several advanages over radonal general purpose compuaon on GPUs []. One of hem s ha he GPU code can access dfferen addresses n memory. Anoher advanage s avalably of shared memory whch s a locally accessble memory ha s shared beween a group of hreads. hs helps o reduce he global memory accesses and as a resul provdng a hgher bandwdh. However, here are some lmaons. One s ha he daa ransfer beween he devce and he hos memory s slower ha whn-devce memory ransfers. hreads are acvaed n groups of 3, wh housands of hem runnng n oal. Alhough [] dscusses ha hs may no always be he case, ncludng [3] and curren mplemenaon n whch a small number of hreads per block group were used o ge beer resuls. Anoher lmaon s ha CUDA echnology s only avalable for NVIDIA GPUs. I only suppors round-o-neares mode of IEEE 754 [4] sandard for double precson calculaons. On CUDA archecure, he problems are dvded no subproblems and each sub-problem no fner peces ha can cooperavely run n parallel by all hreads whn he block. Each block of hreads can be scheduled on any of he avalable processor cores, n any order, concurrenly or sequenally, so ha a compled CUDA program can execue on any number of processor cores as llusraed by Fg. : Fg.. Auomac Scalably [0] Blocks are organzed no a one-dmensonal, wodmensonal, or hree-dmensonal grd of hread blocks as IJECS-IJENS Augus 0 IJENS

4 Inernaonal Journal of Elecrcal & Compuer Scences IJECS-IJENS Vol: No: llusraed by Fg. 3. he number of hread blocks n a grd s usually dcaed by he sze of he daa beng processed or he number of processors n he sysem, whch can grealy exceed. redundancy of he calculaons.. Cores = 4;. N = 40; // Wnowsze 3. RangeSze = N / Cores; 4. For core I (concurrenly) 5. For k from [RangeSze * I] o [RangeSze * ( I + ) ] 6. Inalze Real and Imagnary varables 7. For n from 0 o * N 8. Calculae Real & Imagnary 0. Calculae DF Alg.. CPU DF Parallel In hs algorhm, he me complexy s O N. On CPU Parallelsm, he complexy s only dvded by a number much smaller han N. herefore he complexy remans he same. he CUDA DF mplemenaon n naure also follows he CPU model wh he excepon ha each eraon of loop k s run on a separae hread. In hs case he complexy s reduced O N. here are also memory ransfers beween hos and o CUDA devce as shown n Alg. 3. Fg. 3. Grd of hread Blocks [0] Each runnng block s dvded no Sngle Insrucon Mulple hreads groups called warps. he sze of hese warps are equal. he runnng warps are scheduled n a mely manner (me-slced). he hread scheduler swches beween warps o maxmze he use of he mulprocessors compuaonal resources. A. MFCC IV. IMPLEMENAION AND DESIGN Due o he naure of speech daa, s very normal o have unvoced sound segmens. Here, Voce Acvaon Daa (VAD) s used o dscard slence perods [5]. he oher fronend operaons lke segmenaon and wndowng are also appled o each frame n order o produce he correspondng feaure se. he complexy of MFCC exracon s manly domnaed by Dscree Fourer ransform (DF) of each O where N s s he wndow wndow segmen whch s sze. In hs sudy only he DF sep of MFCC s parallelzed as shown n Fg.. he CPU DF pseudo-code for wndow sze of 40 s as llusraed n Alg.. In order o have load balancng, he execuon of he loop a lne 5 of Alg. was spl no I number of cores. Each core s responsble for execung operaons of a range of wndows sze frames. In curren case an Inel Core 7-90 [3] CPU was used o perform he parallel ask on 4 cores. Noe ha he cosne and sne values are pre-compued o decrease he N s. Allocae & ransfer Sne and Cosne ables from hos o devce memory.. Allocae & ransfer Curren Wndow Frames from hos o devce memory. 3. Allocae space for DF oupu on devce memory 4. Perform DF on CUDA wh grd sze of 6 and block sze of 6 5. Copy he oupu from devce o hos memory Alg. 3. CUDA DF Parallel B. k-means Cluserng he k-means cluserng algorhm pseudo-code n a nushell s as shown n Alg. 4. Snce nal selecon of mean s random, mean vecors of sze M mxures by D dmensons were chosen from he feaure vecors.. Random nalzaon of mean vecors.. = number of frames; 3. M= order of mxures 4. D= number of dmensons; 5. For each frame 6. For each mxure M 7. For each feaure D 8. Calculae mnmum dsance 9. Calculae average of seleced feaure vecors belongng o he same cenrod. Alg. 4. k-means Pseudo-Code In parallel mplemenaon s advsed o gve more work load o each processor o maxmze he ulzaon. In he case of k-means cluserng, he major loop ha s parallelzable s loop whch ncludes nner loop M and D. he compuaon complexy of k-means cluserng s On n dk log. Snce k and d are fxed, he problem wll come down o n number of enes o be clusered [5]. In boh CPU and GPU approach IJECS-IJENS Augus 0 IJENS

5 Inernaonal Journal of Elecrcal & Compuer Scences IJECS-IJENS Vol: No: he mos ouer loop was parallelzed whch n hs case s, gven he fac ha he dsances can be calculaed concurrenly. For CUDA devce mplemenaon he calculaons were performed for each feaure vecor smulaneously. As a resul he complexy wll be O n dk log n where n n. Addonally, here are memory ransfers from and o devce. he order of CUDA acons for k-means algorhm s shown below: Alg. 5. k-means CUDA Noe ha all of -dmensonal jagged arrays are frs convered no -dmensonal arrays for devce memory allocaon. C. Expecaon Maxmzaon As explaned n secon II. C., he frs sep of expecaon maxmzaon s o nalze parameers mxure weghs (6), means (7) and varances (8). he compuaon complexy for background speaker model s drecly exraced from Alg. 6 as ON MD where N s he number of feaure vecors each havng dmenson of D and each speaker havng M mxures. In order o preven redundan calculaons, frs he common par of hose parameers s calculaed whch s:. Random nalzaon of mean vecors.. = number of frames; 3. M= order of mxures 4. D= number of dmensons; 5. ransfer nalzed random mean vecors o devce. 6. ransfer feaure vecors o devce. 7. Allocae label memory on devce. 8. Allocae mean average memory on devce. 9. Allocae dsance memory on devce. 0. Perform k-means on CUDA devce wh grd sze of /0+ and block sze of 0.. ransfer label memory back o hos.. ransfer mean-average memory back o hos. 3. Calculae average of seleced feaure vecors belongng o he same cenrod., ) (0) where s he number of mxures. Followng he (9), he denomnaor of (9) s calculaed, so laer can be appled for he poseror probably. hen s connued wh calculang poseror probably, sum of means and sum of varances. hen (6), (7) and (8) were appled o calculae he updaed values m, and. hs concludes one eraon of EM Algorhm. In hs mplemenaon he eraons are repeaed 50 mes. he pseudo-code for EM algorhm s llusraed n Alg. 6.. Inalze mean, varance and wegh values.. = Number of feaure vecors. 3. M = Order of mxures 4. D = Number of dmensons n a feaure vecor 5. For eraon beween 0 and 50 { 6. For k beween 0 and M 7. Calculae deermnan of sgma[k] 8. For beween 0 and 9. For k beween 0 and M 0. For l beween 0 and D. prepare sum.. calculae sum_p_denomnaor 3. For k beween 0 and M 4. For l beween 0 and D 5. Inalze Sum_mean and Sum_varance 6. For beween 0 and 7. Inalze Sum 8. For l beween 0 and D 9. Calculae Sum 0. Calculae Sum_p. For l beween 0 and D. Calculae Sum_mean 3. Calculae Sum_varance 4. For l beween 0 and D 5. Calculae Mean and Varance 6. Calculae k } Alg. 6. Expecaon Maxmzaon Pseudo-Code for CPU he menonable pons of parallelzaon n Alg. 6 s where loops of sze exss. Snce he convergence eraons are dependen and have o be execued sequenally, he mos domnan parallelzable loop s. In case of CPU parallelsm, a smlar mehod s appled as seen n Alg. o dsrbue he load. In a nushell he followng asks are performed n CPU Parallel mode:. Inalze mean, varance and wegh values.. = Number of feaure vecors. (around 350K) 3. M = Order of mxures (04) 4. D = Number of dmensons n a feaure vecor (6) 5. For eraon beween 0 and 50 { 6. Parallel For k beween 0 and M 7. Calculae deermnan sgma[k]; 8. Parallel For beween 0 and 9. Calculae sum_p_denomnaor; // Updang mean, wegh and varance 0. For k beween 0 and M. For l beween 0 and D. sum_mean, sum_var nalzaon 3. Parallel For beween 0 and 4. Calculae sum_mean and sum_var 5. For l beween 0 and D 6. Updae mean,varance 7. Updae wegh } Alg. 7. EM Pseudo-Code for Parallel CPU he D loops are lef sequenal because n he small loops he cos of creang mulple hreads s more han he calculaon cos n a sngle hread. On he oher hand he K loop s no parallelzed for he fac ha he nner loop s bg IJECS-IJENS Augus 0 IJENS

6 Inernaonal Journal of Elecrcal & Compuer Scences IJECS-IJENS Vol: No: enough o cover he cos of hread acvaon. In CUDA Parallel mode he procedures were slced no smaller groups o handle he number of hreads beer. Snce he CUDA code s SIM all hreads run dencal code only on dfferen pars of he memory. For example n he case of nalzng sum of means and sum of varances s only needed o have D hreads whle for updang hem hreads are acvaed. hs calls for ask separaon. I can be undersood from Alg. 7 ha for EM algorhm here are more memory ransfers/allocaons are nvolved. I s also known ha he CUDA code wll run on devce as kernels. Kernels are C exenson funcons of CUDA echnology. hey can be lunched from he hos code o run on many CUDA hreads. he mporance of grd sze and block sze wll be dscussed n secon V. he followng s he EM algorhm desgned o run on CUDA devce:. Inalze mean, varance and wegh values.. = Number of feaure vecors. (around 350K) 3. M = Order of mxures (04) 4. D = Number of dmensons n a feaure vecor (6) 5. Allocae memory on devce and ransfer he nalzed mean, varance and wegh. 6. ransfer feaure vecors o devce memory. 7. For eraon beween 0 and 50 { 8. Calculae Deermnan of Sgma on blocks of sze M/; 9. Calculae Sum p denomnaor on / 0 + blocks of sze 0; 0. For k beween 0 and M. Inalze SumMean and SumVarance on block of sze D;. Calculae SumMean and SumVarance on / 3 + blocks of sze 3; 3. Updae Mean & Varance on block of sze D; 4. Updae Wegh on block of sze ; } 5. ransfer mean, wegh and varance from devce o hos memory Alg. 8. EM Pseudo-Code for Parallel CUDA he complexy of hs EM algorhm s reduced o O MD. V. EXPERIMENS he parallel algorhms were mplemened usng CUDA verson 3.. he expermens were performed on a PC wh GX 85 and GX 570 GPUs and an Inel Core 7-90 CPU. he CPU has 4 cores (8 hyper-hreads) runnng a.66 GHz. he man memory s 3 GB (DDR3-600) wh he peak bandwdh of.8 GB/sec. he specfcaon of used GPUs can be seen n able I. ABLE I SPECIFICAIONS OF NVIDIA GPUS GPU Cores DRAM Processor Clock Memory Bandwdh GX GB.47 GHz 59 GB/s GX GB.46 GHz 5 GB/s he resuls of calculaons were presened whle omng he fle read/wre operaons o concenrae on algorhm s speedup. he resuls were performed on around daa sze of 350,000 feaure vecors belongng o 9 female speakers. he able II shows he acual mes aken o perform MFCC and k- means algorhm n seconds. ABLE II IME OF MFCC AND K-MEANS IN SECOND Algorhm CPU CPU CUDA CUDA Parallel (GX 85) (GX 570) MFCC k-means I can be observed ha he expermened MFCC algorhm has poor performance compared o radonal CPU algorhm. he reasons for hs are he followng: - he only parallelzed funcon whn MFCC was dscree flourer ransform algorhm. Alhough he complexy s reduced n heory, gven he sze of he problem for a sngle DF run, s no feasble o dsrbue he subproblem no oo many hreads. For hs expermen he sze of ouer loop was 40 and hence 40 hreads were acvaed, bu because of he low occupancy level he me needed o read/wre no devce global memory affeced he performance. - he oher reason s low hroughpu of memory ransfer beween hos and devce. he DF algorhm s called almos 350,000 mes and each me copes he wndow frames from hos o devce and copes he resuls of DF back o hos memory. In k-means algorhm a consderable performance mprovemen can be seen when daa-parallelsm s compared o sngle hreaded CPU. However he me aken on boh GPU archecures s he same. One reason for hs can be lack of use of shared memory o benef from he archecure specfc advanages. hs may also ndcae ha curren algorhm suffers from memory paron campng [6]. Global memory accesses go hrough parons. Successve 56-bye regons of global memory are assgned o successve parons. he problem of paron campng s when global memory accesses a an nsan use a subse of parons. hs may hde he rue poenal of speedup scalably across cores. For opmal performance GPU accesses should be dsrbued evenly among parons. he resuls of EM algorhm are measured n pars o show he execuon mes more clearly. able III shows he EM me n seconds. ABLE III IME OF EXPECAION MAXIMIZAION IN SECONDS Algorhm CPU CPU Parallel GX 85 GX 570 SUM_P Per EM Ieraon Updang Mean,Var & Wegh Per EM Ieraon EM oal per eraon Agan, faser calculaons on more cores and furher on many cores of GPUs can be seen. able IV shows he speedup of all algorhms on dfferen sysems. he speedups are IJECS-IJENS Augus 0 IJENS

7 Inernaonal Journal of Elecrcal & Compuer Scences IJECS-IJENS Vol: No: calculaed usng / where s he sngle hreaded CPU me and s he arge parallel model me. Addonally, Fg. 4 shows he graph correspondng o able IV. ABLE IV SPEEDUP OF MFCC, K-MEANS AND EM ALGORIHMS Sage CPU CPU Parallel GX 85 GX 570 MFCC k-means EM I s observed ha he ncrease n he number of cores wll decrease he calculaon me. However, he echnque of choosng he correc grd and block szes should be aken no consderaon. oo lle hreads n a block may race agans he occupancy facor of he acve blocks [0]. oo many hreads n a block may preven he use of shared memory snce here s a lm of 6KB on GX 85 and 48KB on GX 570 models per block. In hs sudy, a naïve GPU code ha uses he global memory was provded. he number of hreads used per CUDA block vares from o 3. ha s because curren mplemenaons performed beer n low block dmenson szes. Accordng o [] n some cases such as [3] a beer performance may be acheved n very low block szes. Fg. 4. Speedup scalably VI. CONCLUSION In hs paper, a CUDA based mplemenaon of DF, k- means and EM algorhms were nroduced. Also, he CPU parallelsm of he menoned algorhms for beer comparson was presened. he resuls demonsrae he advanage of parallelzaon, specfcally usng GPUs o speed up calculaon of k-means up o 30 mes and EM up o 7.9 mes of a sngle hreaded CPU. In he fuure, he effecs of usng shared memory on he performance wll be suded. Addonally, runnng he algorhms concurrenly on more han one GPU wll be presened o show how GPUs can benef from cross devce memory access [0]. Addonally, he advanage of concurren kernels and mul sreams wll be expermened. Furhermore, a comparson beween he performance hs of sngle-precson and double-precson floang pon calculaons wll be observed. REFERENCES [] N. M. Lard, and D.B. Rubn A. P. Dempser, "Maxmum Lkelhood from Incomplee Daa va EM Algorhm," Journal of he Royal Sascal Socey, vol. 39, pp. -38, 977. [] Green500. (0, June) Green500. [Onlne]. hp:// [3] Inel. (0, June) Prevous Generaon Inel Core 7 Processor. [Onlne]. hp:// x.hm [4] N. Kumar, S. Saoor, and I. Buck, "Fas Parallel Expecaon Maxmzaon for Gaussan Mxure Models on GPUs Usng CUDA," n Hgh Performance Compung and Communcaon, h IEEE Inernaonal Conference on, 009, pp [5] R. W. Schafer and L. R. abner, "Dgal Repesenaons of Speech Sgnals," IEEE Proceedngs, vol. 63, pp , Aprl 975. [6] Rchard O. Duda, Peer E. Har, and Davd G. Srok, Paern Classfcaon, nd ed.: John Wley and Sons, 00. [7] Wkpeda. (0, June) Expecaon-maxmzaon algorhm. [Onlne]. hp://en.wkpeda.org/wk/expecaon_maxmzaon_algorhm [8] D.A. Rose, R.C. Reynolds, "Robus ex-ndependen speaker denfcaon usng Gaussan mxure speaker models," Speech and Audo Processng, IEEE ransacons on, vol. 3, no., p. 7, Jan 995. [9] A. M. Kondoz, Dgal Speech: Codng for Low B Rae Communcaon Sysems, nd ed.: Wley, 004. [0] NVIDIA. (0, March) NVIDIA CUDA Programmng Gude - Verson 4.0. [Onlne]. hp://developer.nvda.com/cuda-oolk-40 [] Wkpeda. (0, June) CUDA. [Onlne]. hp://en.wkpeda.org/wk/cuda [] V. Volkov. (0, June) Beer Performance a Lower Occupancy, GC On-Demand: GC 00. [Onlne]. hp:// [3] Anas Mohd Nazlee and Noohul Basheer Zan Al Fawnzu Azmad Hussn, "ranformaon of CPU-based Applcaons o Leverage on Graphcs Processors usng CUDA," IJECS: Inernaonal Journal of Elecrcal & Compuer Scences, vol. 0, no., pp , February 00. [4] "IEEE Sandard for Floang-Pon Arhmec," IEEE Sd , pp. -58, Augus 008. [5] "Speech Codec Deecor Based Speaker Verfcaon Sysem n a Mul- Coder Envronmen," docoral dsseraon, Compuer Engneerng Deparmen, Easern Mederranean Unversy, Famagusa, Mersn 0 urkey, 004. C. Ergün,. [6] NVIDIA. (009, March) Opmzng CUDA, Ausralan Naonal Unversy: CUDA uoral. [Onlne]. hp://cs.anu.edu.au/fles/sysems/gpuwksp/pdfs/04_opmzngcud A_full.pdf IJECS-IJENS Augus 0 IJENS

Clustering (Bishop ch 9)

Clustering (Bishop ch 9) Cluserng (Bshop ch 9) Reference: Daa Mnng by Margare Dunham (a slde source) 1 Cluserng Cluserng s unsupervsed learnng, here are no class labels Wan o fnd groups of smlar nsances Ofen use a dsance measure

More information

Robustness Experiments with Two Variance Components

Robustness Experiments with Two Variance Components Naonal Insue of Sandards and Technology (NIST) Informaon Technology Laboraory (ITL) Sascal Engneerng Dvson (SED) Robusness Expermens wh Two Varance Componens by Ana Ivelsse Avlés avles@ns.gov Conference

More information

Outline. Probabilistic Model Learning. Probabilistic Model Learning. Probabilistic Model for Time-series Data: Hidden Markov Model

Outline. Probabilistic Model Learning. Probabilistic Model Learning. Probabilistic Model for Time-series Data: Hidden Markov Model Probablsc Model for Tme-seres Daa: Hdden Markov Model Hrosh Mamsuka Bonformacs Cener Kyoo Unversy Oulne Three Problems for probablsc models n machne learnng. Compung lkelhood 2. Learnng 3. Parsng (predcon

More information

Variants of Pegasos. December 11, 2009

Variants of Pegasos. December 11, 2009 Inroducon Varans of Pegasos SooWoong Ryu bshboy@sanford.edu December, 009 Youngsoo Cho yc344@sanford.edu Developng a new SVM algorhm s ongong research opc. Among many exng SVM algorhms, we wll focus on

More information

CHAPTER 10: LINEAR DISCRIMINATION

CHAPTER 10: LINEAR DISCRIMINATION CHAPER : LINEAR DISCRIMINAION Dscrmnan-based Classfcaon 3 In classfcaon h K classes (C,C,, C k ) We defned dscrmnan funcon g j (), j=,,,k hen gven an es eample, e chose (predced) s class label as C f g

More information

Math 128b Project. Jude Yuen

Math 128b Project. Jude Yuen Mah 8b Proec Jude Yuen . Inroducon Le { Z } be a sequence of observed ndependen vecor varables. If he elemens of Z have a on normal dsrbuon hen { Z } has a mean vecor Z and a varancecovarance marx z. Geomercally

More information

Cubic Bezier Homotopy Function for Solving Exponential Equations

Cubic Bezier Homotopy Function for Solving Exponential Equations Penerb Journal of Advanced Research n Compung and Applcaons ISSN (onlne: 46-97 Vol. 4, No.. Pages -8, 6 omoopy Funcon for Solvng Eponenal Equaons S. S. Raml *,,. Mohamad Nor,a, N. S. Saharzan,b and M.

More information

Dynamic Team Decision Theory. EECS 558 Project Shrutivandana Sharma and David Shuman December 10, 2005

Dynamic Team Decision Theory. EECS 558 Project Shrutivandana Sharma and David Shuman December 10, 2005 Dynamc Team Decson Theory EECS 558 Proec Shruvandana Sharma and Davd Shuman December 0, 005 Oulne Inroducon o Team Decson Theory Decomposon of he Dynamc Team Decson Problem Equvalence of Sac and Dynamc

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 4

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 4 CS434a/54a: Paern Recognon Prof. Olga Veksler Lecure 4 Oulne Normal Random Varable Properes Dscrmnan funcons Why Normal Random Varables? Analycally racable Works well when observaon comes form a corruped

More information

Single-loop System Reliability-Based Design & Topology Optimization (SRBDO/SRBTO): A Matrix-based System Reliability (MSR) Method

Single-loop System Reliability-Based Design & Topology Optimization (SRBDO/SRBTO): A Matrix-based System Reliability (MSR) Method 10 h US Naonal Congress on Compuaonal Mechancs Columbus, Oho 16-19, 2009 Sngle-loop Sysem Relably-Based Desgn & Topology Opmzaon (SRBDO/SRBTO): A Marx-based Sysem Relably (MSR) Mehod Tam Nguyen, Junho

More information

WiH Wei He

WiH Wei He Sysem Idenfcaon of onlnear Sae-Space Space Baery odels WH We He wehe@calce.umd.edu Advsor: Dr. Chaochao Chen Deparmen of echancal Engneerng Unversy of aryland, College Par 1 Unversy of aryland Bacground

More information

Fall 2010 Graduate Course on Dynamic Learning

Fall 2010 Graduate Course on Dynamic Learning Fall 200 Graduae Course on Dynamc Learnng Chaper 4: Parcle Flers Sepember 27, 200 Byoung-Tak Zhang School of Compuer Scence and Engneerng & Cognve Scence and Bran Scence Programs Seoul aonal Unversy hp://b.snu.ac.kr/~bzhang/

More information

V.Abramov - FURTHER ANALYSIS OF CONFIDENCE INTERVALS FOR LARGE CLIENT/SERVER COMPUTER NETWORKS

V.Abramov - FURTHER ANALYSIS OF CONFIDENCE INTERVALS FOR LARGE CLIENT/SERVER COMPUTER NETWORKS R&RATA # Vol.) 8, March FURTHER AALYSIS OF COFIDECE ITERVALS FOR LARGE CLIET/SERVER COMPUTER ETWORKS Vyacheslav Abramov School of Mahemacal Scences, Monash Unversy, Buldng 8, Level 4, Clayon Campus, Wellngon

More information

A New Method for Computing EM Algorithm Parameters in Speaker Identification Using Gaussian Mixture Models

A New Method for Computing EM Algorithm Parameters in Speaker Identification Using Gaussian Mixture Models 0 IACSI Hong Kong Conferences IPCSI vol. 9 (0) (0) IACSI Press, Sngaore A New ehod for Comung E Algorhm Parameers n Seaker Idenfcaon Usng Gaussan xure odels ohsen Bazyar +, Ahmad Keshavarz, and Khaoon

More information

Boosted LMS-based Piecewise Linear Adaptive Filters

Boosted LMS-based Piecewise Linear Adaptive Filters 016 4h European Sgnal Processng Conference EUSIPCO) Boosed LMS-based Pecewse Lnear Adapve Flers Darush Kar and Iman Marvan Deparmen of Elecrcal and Elecroncs Engneerng Blken Unversy, Ankara, Turkey {kar,

More information

Robust and Accurate Cancer Classification with Gene Expression Profiling

Robust and Accurate Cancer Classification with Gene Expression Profiling Robus and Accurae Cancer Classfcaon wh Gene Expresson Proflng (Compuaonal ysems Bology, 2005) Auhor: Hafeng L, Keshu Zhang, ao Jang Oulne Background LDA (lnear dscrmnan analyss) and small sample sze problem

More information

( ) [ ] MAP Decision Rule

( ) [ ] MAP Decision Rule Announcemens Bayes Decson Theory wh Normal Dsrbuons HW0 due oday HW o be assgned soon Proec descrpon posed Bomercs CSE 90 Lecure 4 CSE90, Sprng 04 CSE90, Sprng 04 Key Probables 4 ω class label X feaure

More information

FTCS Solution to the Heat Equation

FTCS Solution to the Heat Equation FTCS Soluon o he Hea Equaon ME 448/548 Noes Gerald Reckenwald Porland Sae Unversy Deparmen of Mechancal Engneerng gerry@pdxedu ME 448/548: FTCS Soluon o he Hea Equaon Overvew Use he forward fne d erence

More information

Solution in semi infinite diffusion couples (error function analysis)

Solution in semi infinite diffusion couples (error function analysis) Soluon n sem nfne dffuson couples (error funcon analyss) Le us consder now he sem nfne dffuson couple of wo blocks wh concenraon of and I means ha, n a A- bnary sysem, s bondng beween wo blocks made of

More information

On One Analytic Method of. Constructing Program Controls

On One Analytic Method of. Constructing Program Controls Appled Mahemacal Scences, Vol. 9, 05, no. 8, 409-407 HIKARI Ld, www.m-hkar.com hp://dx.do.org/0.988/ams.05.54349 On One Analyc Mehod of Consrucng Program Conrols A. N. Kvko, S. V. Chsyakov and Yu. E. Balyna

More information

Department of Economics University of Toronto

Department of Economics University of Toronto Deparmen of Economcs Unversy of Torono ECO408F M.A. Economercs Lecure Noes on Heeroskedascy Heeroskedascy o Ths lecure nvolves lookng a modfcaons we need o make o deal wh he regresson model when some of

More information

In the complete model, these slopes are ANALYSIS OF VARIANCE FOR THE COMPLETE TWO-WAY MODEL. (! i+1 -! i ) + [(!") i+1,q - [(!

In the complete model, these slopes are ANALYSIS OF VARIANCE FOR THE COMPLETE TWO-WAY MODEL. (! i+1 -! i ) + [(!) i+1,q - [(! ANALYSIS OF VARIANCE FOR THE COMPLETE TWO-WAY MODEL The frs hng o es n wo-way ANOVA: Is here neracon? "No neracon" means: The man effecs model would f. Ths n urn means: In he neracon plo (wh A on he horzonal

More information

Lecture 6: Learning for Control (Generalised Linear Regression)

Lecture 6: Learning for Control (Generalised Linear Regression) Lecure 6: Learnng for Conrol (Generalsed Lnear Regresson) Conens: Lnear Mehods for Regresson Leas Squares, Gauss Markov heorem Recursve Leas Squares Lecure 6: RLSC - Prof. Sehu Vjayakumar Lnear Regresson

More information

John Geweke a and Gianni Amisano b a Departments of Economics and Statistics, University of Iowa, USA b European Central Bank, Frankfurt, Germany

John Geweke a and Gianni Amisano b a Departments of Economics and Statistics, University of Iowa, USA b European Central Bank, Frankfurt, Germany Herarchcal Markov Normal Mxure models wh Applcaons o Fnancal Asse Reurns Appendx: Proofs of Theorems and Condonal Poseror Dsrbuons John Geweke a and Gann Amsano b a Deparmens of Economcs and Sascs, Unversy

More information

ECE 366 Honors Section Fall 2009 Project Description

ECE 366 Honors Section Fall 2009 Project Description ECE 366 Honors Secon Fall 2009 Projec Descrpon Inroducon: Muscal genres are caegorcal labels creaed by humans o characerze dfferen ypes of musc. A muscal genre s characerzed by he common characerscs shared

More information

Econ107 Applied Econometrics Topic 5: Specification: Choosing Independent Variables (Studenmund, Chapter 6)

Econ107 Applied Econometrics Topic 5: Specification: Choosing Independent Variables (Studenmund, Chapter 6) Econ7 Appled Economercs Topc 5: Specfcaon: Choosng Independen Varables (Sudenmund, Chaper 6 Specfcaon errors ha we wll deal wh: wrong ndependen varable; wrong funconal form. Ths lecure deals wh wrong ndependen

More information

Lecture VI Regression

Lecture VI Regression Lecure VI Regresson (Lnear Mehods for Regresson) Conens: Lnear Mehods for Regresson Leas Squares, Gauss Markov heorem Recursve Leas Squares Lecure VI: MLSC - Dr. Sehu Vjayakumar Lnear Regresson Model M

More information

TSS = SST + SSE An orthogonal partition of the total SS

TSS = SST + SSE An orthogonal partition of the total SS ANOVA: Topc 4. Orhogonal conrass [ST&D p. 183] H 0 : µ 1 = µ =... = µ H 1 : The mean of a leas one reamen group s dfferen To es hs hypohess, a basc ANOVA allocaes he varaon among reamen means (SST) equally

More information

Advanced Machine Learning & Perception

Advanced Machine Learning & Perception Advanced Machne Learnng & Percepon Insrucor: Tony Jebara SVM Feaure & Kernel Selecon SVM Eensons Feaure Selecon (Flerng and Wrappng) SVM Feaure Selecon SVM Kernel Selecon SVM Eensons Classfcaon Feaure/Kernel

More information

Computing Relevance, Similarity: The Vector Space Model

Computing Relevance, Similarity: The Vector Space Model Compung Relevance, Smlary: The Vecor Space Model Based on Larson and Hears s sldes a UC-Bereley hp://.sms.bereley.edu/courses/s0/f00/ aabase Managemen Sysems, R. Ramarshnan ocumen Vecors v ocumens are

More information

CS 536: Machine Learning. Nonparametric Density Estimation Unsupervised Learning - Clustering

CS 536: Machine Learning. Nonparametric Density Estimation Unsupervised Learning - Clustering CS 536: Machne Learnng Nonparamerc Densy Esmaon Unsupervsed Learnng - Cluserng Fall 2005 Ahmed Elgammal Dep of Compuer Scence Rugers Unversy CS 536 Densy Esmaon - Cluserng - 1 Oulnes Densy esmaon Nonparamerc

More information

Hidden Markov Models

Hidden Markov Models 11-755 Machne Learnng for Sgnal Processng Hdden Markov Models Class 15. 12 Oc 2010 1 Admnsrva HW2 due Tuesday Is everyone on he projecs page? Where are your projec proposals? 2 Recap: Wha s an HMM Probablsc

More information

Phone Recognition on TIMIT Database Using Mel-Frequency Cepstral Coefficients and Support Vector Machine Classifier

Phone Recognition on TIMIT Database Using Mel-Frequency Cepstral Coefficients and Support Vector Machine Classifier IJISET - Inernaonal Journal of Innovave Scence, Engneerng & Technology, Vol. Issue, January 05. www.jse.com Phone Recognon on TIMIT Daabase Usng Mel-Frequency Cepsral Coeffcens and Suppor Vecor Machne

More information

Machine Learning Linear Regression

Machine Learning Linear Regression Machne Learnng Lnear Regresson Lesson 3 Lnear Regresson Bascs of Regresson Leas Squares esmaon Polynomal Regresson Bass funcons Regresson model Regularzed Regresson Sascal Regresson Mamum Lkelhood (ML)

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Lnear Response Theory: The connecon beween QFT and expermens 3.1. Basc conceps and deas Q: ow do we measure he conducvy of a meal? A: we frs nroduce a weak elecrc feld E, and hen measure

More information

Bayes rule for a classification problem INF Discriminant functions for the normal density. Euclidean distance. Mahalanobis distance

Bayes rule for a classification problem INF Discriminant functions for the normal density. Euclidean distance. Mahalanobis distance INF 43 3.. Repeon Anne Solberg (anne@f.uo.no Bayes rule for a classfcaon problem Suppose we have J, =,...J classes. s he class label for a pxel, and x s he observed feaure vecor. We can use Bayes rule

More information

Lecture 11 SVM cont

Lecture 11 SVM cont Lecure SVM con. 0 008 Wha we have done so far We have esalshed ha we wan o fnd a lnear decson oundary whose margn s he larges We know how o measure he margn of a lnear decson oundary Tha s: he mnmum geomerc

More information

CS 268: Packet Scheduling

CS 268: Packet Scheduling Pace Schedulng Decde when and wha pace o send on oupu ln - Usually mplemened a oupu nerface CS 68: Pace Schedulng flow Ion Soca March 9, 004 Classfer flow flow n Buffer managemen Scheduler soca@cs.bereley.edu

More information

HEAT CONDUCTION PROBLEM IN A TWO-LAYERED HOLLOW CYLINDER BY USING THE GREEN S FUNCTION METHOD

HEAT CONDUCTION PROBLEM IN A TWO-LAYERED HOLLOW CYLINDER BY USING THE GREEN S FUNCTION METHOD Journal of Appled Mahemacs and Compuaonal Mechancs 3, (), 45-5 HEAT CONDUCTION PROBLEM IN A TWO-LAYERED HOLLOW CYLINDER BY USING THE GREEN S FUNCTION METHOD Sansław Kukla, Urszula Sedlecka Insue of Mahemacs,

More information

EEL 6266 Power System Operation and Control. Chapter 5 Unit Commitment

EEL 6266 Power System Operation and Control. Chapter 5 Unit Commitment EEL 6266 Power Sysem Operaon and Conrol Chaper 5 Un Commmen Dynamc programmng chef advanage over enumeraon schemes s he reducon n he dmensonaly of he problem n a src prory order scheme, here are only N

More information

Chapter 6 DETECTION AND ESTIMATION: Model of digital communication system. Fundamental issues in digital communications are

Chapter 6 DETECTION AND ESTIMATION: Model of digital communication system. Fundamental issues in digital communications are Chaper 6 DEECIO AD EIMAIO: Fundamenal ssues n dgal communcaons are. Deecon and. Esmaon Deecon heory: I deals wh he desgn and evaluaon of decson makng processor ha observes he receved sgnal and guesses

More information

GENERATING CERTAIN QUINTIC IRREDUCIBLE POLYNOMIALS OVER FINITE FIELDS. Youngwoo Ahn and Kitae Kim

GENERATING CERTAIN QUINTIC IRREDUCIBLE POLYNOMIALS OVER FINITE FIELDS. Youngwoo Ahn and Kitae Kim Korean J. Mah. 19 (2011), No. 3, pp. 263 272 GENERATING CERTAIN QUINTIC IRREDUCIBLE POLYNOMIALS OVER FINITE FIELDS Youngwoo Ahn and Kae Km Absrac. In he paper [1], an explc correspondence beween ceran

More information

. The geometric multiplicity is dim[ker( λi. number of linearly independent eigenvectors associated with this eigenvalue.

. The geometric multiplicity is dim[ker( λi. number of linearly independent eigenvectors associated with this eigenvalue. Lnear Algebra Lecure # Noes We connue wh he dscusson of egenvalues, egenvecors, and dagonalzably of marces We wan o know, n parcular wha condons wll assure ha a marx can be dagonalzed and wha he obsrucons

More information

Dual Approximate Dynamic Programming for Large Scale Hydro Valleys

Dual Approximate Dynamic Programming for Large Scale Hydro Valleys Dual Approxmae Dynamc Programmng for Large Scale Hydro Valleys Perre Carpener and Jean-Phlppe Chanceler 1 ENSTA ParsTech and ENPC ParsTech CMM Workshop, January 2016 1 Jon work wh J.-C. Alas, suppored

More information

Performance Analysis for a Network having Standby Redundant Unit with Waiting in Repair

Performance Analysis for a Network having Standby Redundant Unit with Waiting in Repair TECHNI Inernaonal Journal of Compung Scence Communcaon Technologes VOL.5 NO. July 22 (ISSN 974-3375 erformance nalyss for a Nework havng Sby edundan Un wh ang n epar Jendra Sngh 2 abns orwal 2 Deparmen

More information

Approximate Analytic Solution of (2+1) - Dimensional Zakharov-Kuznetsov(Zk) Equations Using Homotopy

Approximate Analytic Solution of (2+1) - Dimensional Zakharov-Kuznetsov(Zk) Equations Using Homotopy Arcle Inernaonal Journal of Modern Mahemacal Scences, 4, (): - Inernaonal Journal of Modern Mahemacal Scences Journal homepage: www.modernscenfcpress.com/journals/jmms.aspx ISSN: 66-86X Florda, USA Approxmae

More information

( ) () we define the interaction representation by the unitary transformation () = ()

( ) () we define the interaction representation by the unitary transformation () = () Hgher Order Perurbaon Theory Mchael Fowler 3/7/6 The neracon Represenaon Recall ha n he frs par of hs course sequence, we dscussed he chrödnger and Hesenberg represenaons of quanum mechancs here n he chrödnger

More information

Time-interval analysis of β decay. V. Horvat and J. C. Hardy

Time-interval analysis of β decay. V. Horvat and J. C. Hardy Tme-nerval analyss of β decay V. Horva and J. C. Hardy Work on he even analyss of β decay [1] connued and resuled n he developmen of a novel mehod of bea-decay me-nerval analyss ha produces hghly accurae

More information

New M-Estimator Objective Function. in Simultaneous Equations Model. (A Comparative Study)

New M-Estimator Objective Function. in Simultaneous Equations Model. (A Comparative Study) Inernaonal Mahemacal Forum, Vol. 8, 3, no., 7 - HIKARI Ld, www.m-hkar.com hp://dx.do.org/.988/mf.3.3488 New M-Esmaor Objecve Funcon n Smulaneous Equaons Model (A Comparave Sudy) Ahmed H. Youssef Professor

More information

Introduction ( Week 1-2) Course introduction A brief introduction to molecular biology A brief introduction to sequence comparison Part I: Algorithms

Introduction ( Week 1-2) Course introduction A brief introduction to molecular biology A brief introduction to sequence comparison Part I: Algorithms Course organzaon Inroducon Wee -2) Course nroducon A bref nroducon o molecular bology A bref nroducon o sequence comparson Par I: Algorhms for Sequence Analyss Wee 3-8) Chaper -3, Models and heores» Probably

More information

. The geometric multiplicity is dim[ker( λi. A )], i.e. the number of linearly independent eigenvectors associated with this eigenvalue.

. The geometric multiplicity is dim[ker( λi. A )], i.e. the number of linearly independent eigenvectors associated with this eigenvalue. Mah E-b Lecure #0 Noes We connue wh he dscusson of egenvalues, egenvecors, and dagonalzably of marces We wan o know, n parcular wha condons wll assure ha a marx can be dagonalzed and wha he obsrucons are

More information

An introduction to Support Vector Machine

An introduction to Support Vector Machine An nroducon o Suppor Vecor Machne 報告者 : 黃立德 References: Smon Haykn, "Neural Neworks: a comprehensve foundaon, second edon, 999, Chaper 2,6 Nello Chrsann, John Shawe-Tayer, An Inroducon o Suppor Vecor Machnes,

More information

Efficient Asynchronous Channel Hopping Design for Cognitive Radio Networks

Efficient Asynchronous Channel Hopping Design for Cognitive Radio Networks Effcen Asynchronous Channel Hoppng Desgn for Cognve Rado Neworks Chh-Mn Chao, Chen-Yu Hsu, and Yun-ng Lng Absrac In a cognve rado nework (CRN), a necessary condon for nodes o communcae wh each oher s ha

More information

January Examinations 2012

January Examinations 2012 Page of 5 EC79 January Examnaons No. of Pages: 5 No. of Quesons: 8 Subjec ECONOMICS (POSTGRADUATE) Tle of Paper EC79 QUANTITATIVE METHODS FOR BUSINESS AND FINANCE Tme Allowed Two Hours ( hours) Insrucons

More information

Volatility Interpolation

Volatility Interpolation Volaly Inerpolaon Prelmnary Verson March 00 Jesper Andreasen and Bran Huge Danse Mares, Copenhagen wan.daddy@danseban.com brno@danseban.com Elecronc copy avalable a: hp://ssrn.com/absrac=69497 Inro Local

More information

UNIVERSITAT AUTÒNOMA DE BARCELONA MARCH 2017 EXAMINATION

UNIVERSITAT AUTÒNOMA DE BARCELONA MARCH 2017 EXAMINATION INTERNATIONAL TRADE T. J. KEHOE UNIVERSITAT AUTÒNOMA DE BARCELONA MARCH 27 EXAMINATION Please answer wo of he hree quesons. You can consul class noes, workng papers, and arcles whle you are workng on he

More information

Detection of Waving Hands from Images Using Time Series of Intensity Values

Detection of Waving Hands from Images Using Time Series of Intensity Values Deecon of Wavng Hands from Images Usng Tme eres of Inensy Values Koa IRIE, Kazunor UMEDA Chuo Unversy, Tokyo, Japan re@sensor.mech.chuo-u.ac.jp, umeda@mech.chuo-u.ac.jp Absrac Ths paper proposes a mehod

More information

Learning Objectives. Self Organization Map. Hamming Distance(1/5) Introduction. Hamming Distance(3/5) Hamming Distance(2/5) 15/04/2015

Learning Objectives. Self Organization Map. Hamming Distance(1/5) Introduction. Hamming Distance(3/5) Hamming Distance(2/5) 15/04/2015 /4/ Learnng Objecves Self Organzaon Map Learnng whou Exaples. Inroducon. MAXNET 3. Cluserng 4. Feaure Map. Self-organzng Feaure Map 6. Concluson 38 Inroducon. Learnng whou exaples. Daa are npu o he syse

More information

Including the ordinary differential of distance with time as velocity makes a system of ordinary differential equations.

Including the ordinary differential of distance with time as velocity makes a system of ordinary differential equations. Soluons o Ordnary Derenal Equaons An ordnary derenal equaon has only one ndependen varable. A sysem o ordnary derenal equaons consss o several derenal equaons each wh he same ndependen varable. An eample

More information

DEEP UNFOLDING FOR MULTICHANNEL SOURCE SEPARATION SUPPLEMENTARY MATERIAL

DEEP UNFOLDING FOR MULTICHANNEL SOURCE SEPARATION SUPPLEMENTARY MATERIAL DEEP UNFOLDING FOR MULTICHANNEL SOURCE SEPARATION SUPPLEMENTARY MATERIAL Sco Wsdom, John Hershey 2, Jonahan Le Roux 2, and Shnj Waanabe 2 Deparmen o Elecrcal Engneerng, Unversy o Washngon, Seale, WA, USA

More information

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. Ths documen s downloaded from DR-NTU, Nanyang Technologcal Unversy Lbrary, Sngapore. Tle A smplfed verb machng algorhm for word paron n vsual speech processng( Acceped verson ) Auhor(s) Foo, Say We; Yong,

More information

Li An-Ping. Beijing , P.R.China

Li An-Ping. Beijing , P.R.China A New Type of Cpher: DICING_csb L An-Png Bejng 100085, P.R.Chna apl0001@sna.com Absrac: In hs paper, we wll propose a new ype of cpher named DICING_csb, whch s derved from our prevous sream cpher DICING.

More information

Kernel-Based Bayesian Filtering for Object Tracking

Kernel-Based Bayesian Filtering for Object Tracking Kernel-Based Bayesan Flerng for Objec Trackng Bohyung Han Yng Zhu Dorn Comancu Larry Davs Dep. of Compuer Scence Real-Tme Vson and Modelng Inegraed Daa and Sysems Unversy of Maryland Semens Corporae Research

More information

5th International Conference on Advanced Design and Manufacturing Engineering (ICADME 2015)

5th International Conference on Advanced Design and Manufacturing Engineering (ICADME 2015) 5h Inernaonal onference on Advanced Desgn and Manufacurng Engneerng (IADME 5 The Falure Rae Expermenal Sudy of Specal N Machne Tool hunshan He, a, *, La Pan,b and Bng Hu 3,c,,3 ollege of Mechancal and

More information

Sampling Procedure of the Sum of two Binary Markov Process Realizations

Sampling Procedure of the Sum of two Binary Markov Process Realizations Samplng Procedure of he Sum of wo Bnary Markov Process Realzaons YURY GORITSKIY Dep. of Mahemacal Modelng of Moscow Power Insue (Techncal Unversy), Moscow, RUSSIA, E-mal: gorsky@yandex.ru VLADIMIR KAZAKOV

More information

Bernoulli process with 282 ky periodicity is detected in the R-N reversals of the earth s magnetic field

Bernoulli process with 282 ky periodicity is detected in the R-N reversals of the earth s magnetic field Submed o: Suden Essay Awards n Magnecs Bernoull process wh 8 ky perodcy s deeced n he R-N reversals of he earh s magnec feld Jozsef Gara Deparmen of Earh Scences Florda Inernaonal Unversy Unversy Park,

More information

Using Fuzzy Pattern Recognition to Detect Unknown Malicious Executables Code

Using Fuzzy Pattern Recognition to Detect Unknown Malicious Executables Code Usng Fuzzy Paern Recognon o Deec Unknown Malcous Execuables Code Boyun Zhang,, Janpng Yn, and Jngbo Hao School of Compuer Scence, Naonal Unversy of Defense Technology, Changsha 40073, Chna hnxzby@yahoo.com.cn

More information

Online Supplement for Dynamic Multi-Technology. Production-Inventory Problem with Emissions Trading

Online Supplement for Dynamic Multi-Technology. Production-Inventory Problem with Emissions Trading Onlne Supplemen for Dynamc Mul-Technology Producon-Invenory Problem wh Emssons Tradng by We Zhang Zhongsheng Hua Yu Xa and Baofeng Huo Proof of Lemma For any ( qr ) Θ s easy o verfy ha he lnear programmng

More information

Attribute Reduction Algorithm Based on Discernibility Matrix with Algebraic Method GAO Jing1,a, Ma Hui1, Han Zhidong2,b

Attribute Reduction Algorithm Based on Discernibility Matrix with Algebraic Method GAO Jing1,a, Ma Hui1, Han Zhidong2,b Inernaonal Indusral Informacs and Compuer Engneerng Conference (IIICEC 05) Arbue educon Algorhm Based on Dscernbly Marx wh Algebrac Mehod GAO Jng,a, Ma Hu, Han Zhdong,b Informaon School, Capal Unversy

More information

Chapter 6: AC Circuits

Chapter 6: AC Circuits Chaper 6: AC Crcus Chaper 6: Oulne Phasors and he AC Seady Sae AC Crcus A sable, lnear crcu operang n he seady sae wh snusodal excaon (.e., snusodal seady sae. Complee response forced response naural response.

More information

Joint Channel Estimation and Resource Allocation for MIMO Systems Part I: Single-User Analysis

Joint Channel Estimation and Resource Allocation for MIMO Systems Part I: Single-User Analysis 624 IEEE RANSACIONS ON WIRELESS COUNICAIONS, VOL. 9, NO. 2, FEBRUARY 200 Jon Channel Esmaon and Resource Allocaon for IO Sysems Par I: Sngle-User Analyss Alkan Soysal, ember, IEEE, and Sennur Ulukus, ember,

More information

Appendix to Online Clustering with Experts

Appendix to Online Clustering with Experts A Appendx o Onlne Cluserng wh Expers Furher dscusson of expermens. Here we furher dscuss expermenal resuls repored n he paper. Ineresngly, we observe ha OCE (and n parcular Learn- ) racks he bes exper

More information

[ ] 2. [ ]3 + (Δx i + Δx i 1 ) / 2. Δx i-1 Δx i Δx i+1. TPG4160 Reservoir Simulation 2018 Lecture note 3. page 1 of 5

[ ] 2. [ ]3 + (Δx i + Δx i 1 ) / 2. Δx i-1 Δx i Δx i+1. TPG4160 Reservoir Simulation 2018 Lecture note 3. page 1 of 5 TPG460 Reservor Smulaon 08 page of 5 DISCRETIZATIO OF THE FOW EQUATIOS As we already have seen, fne dfference appromaons of he paral dervaves appearng n he flow equaons may be obaned from Taylor seres

More information

Dynamic Team Decision Theory

Dynamic Team Decision Theory Dynamc Team Decson Theory EECS 558 Proec Repor Shruvandana Sharma and Davd Shuman December, 005 I. Inroducon Whle he sochasc conrol problem feaures one decson maker acng over me, many complex conrolled

More information

Notes on the stability of dynamic systems and the use of Eigen Values.

Notes on the stability of dynamic systems and the use of Eigen Values. Noes on he sabl of dnamc ssems and he use of Egen Values. Source: Macro II course noes, Dr. Davd Bessler s Tme Seres course noes, zarads (999) Ineremporal Macroeconomcs chaper 4 & Techncal ppend, and Hamlon

More information

Machine Learning 2nd Edition

Machine Learning 2nd Edition INTRODUCTION TO Lecure Sldes for Machne Learnng nd Edon ETHEM ALPAYDIN, modfed by Leonardo Bobadlla and some pars from hp://www.cs.au.ac.l/~aparzn/machnelearnng/ The MIT Press, 00 alpaydn@boun.edu.r hp://www.cmpe.boun.edu.r/~ehem/mle

More information

Hidden Markov Models Following a lecture by Andrew W. Moore Carnegie Mellon University

Hidden Markov Models Following a lecture by Andrew W. Moore Carnegie Mellon University Hdden Markov Models Followng a lecure by Andrew W. Moore Carnege Mellon Unversy www.cs.cmu.edu/~awm/uorals A Markov Sysem Has N saes, called s, s 2.. s N s 2 There are dscree meseps, 0,, s s 3 N 3 0 Hdden

More information

A Novel Efficient Stopping Criterion for BICM-ID System

A Novel Efficient Stopping Criterion for BICM-ID System A Novel Effcen Soppng Creron for BICM-ID Sysem Xao Yng, L Janpng Communcaon Unversy of Chna Absrac Ths paper devses a novel effcen soppng creron for b-nerleaved coded modulaon wh erave decodng (BICM-ID)

More information

CS286.2 Lecture 14: Quantum de Finetti Theorems II

CS286.2 Lecture 14: Quantum de Finetti Theorems II CS286.2 Lecure 14: Quanum de Fne Theorems II Scrbe: Mara Okounkova 1 Saemen of he heorem Recall he las saemen of he quanum de Fne heorem from he prevous lecure. Theorem 1 Quanum de Fne). Le ρ Dens C 2

More information

The Analysis of the Thickness-predictive Model Based on the SVM Xiu-ming Zhao1,a,Yan Wang2,band Zhimin Bi3,c

The Analysis of the Thickness-predictive Model Based on the SVM Xiu-ming Zhao1,a,Yan Wang2,band Zhimin Bi3,c h Naonal Conference on Elecrcal, Elecroncs and Compuer Engneerng (NCEECE The Analyss of he Thcknesspredcve Model Based on he SVM Xumng Zhao,a,Yan Wang,band Zhmn B,c School of Conrol Scence and Engneerng,

More information

Neural Networks-Based Time Series Prediction Using Long and Short Term Dependence in the Learning Process

Neural Networks-Based Time Series Prediction Using Long and Short Term Dependence in the Learning Process Neural Neworks-Based Tme Seres Predcon Usng Long and Shor Term Dependence n he Learnng Process J. Puchea, D. Paño and B. Kuchen, Absrac In hs work a feedforward neural neworksbased nonlnear auoregresson

More information

Comb Filters. Comb Filters

Comb Filters. Comb Filters The smple flers dscussed so far are characered eher by a sngle passband and/or a sngle sopband There are applcaons where flers wh mulple passbands and sopbands are requred Thecomb fler s an example of

More information

Chapter Lagrangian Interpolation

Chapter Lagrangian Interpolation Chaper 5.4 agrangan Inerpolaon Afer readng hs chaper you should be able o:. dere agrangan mehod of nerpolaon. sole problems usng agrangan mehod of nerpolaon and. use agrangan nerpolans o fnd deraes and

More information

Let s treat the problem of the response of a system to an applied external force. Again,

Let s treat the problem of the response of a system to an applied external force. Again, Page 33 QUANTUM LNEAR RESPONSE FUNCTON Le s rea he problem of he response of a sysem o an appled exernal force. Agan, H() H f () A H + V () Exernal agen acng on nernal varable Hamlonan for equlbrum sysem

More information

Existence and Uniqueness Results for Random Impulsive Integro-Differential Equation

Existence and Uniqueness Results for Random Impulsive Integro-Differential Equation Global Journal of Pure and Appled Mahemacs. ISSN 973-768 Volume 4, Number 6 (8), pp. 89-87 Research Inda Publcaons hp://www.rpublcaon.com Exsence and Unqueness Resuls for Random Impulsve Inegro-Dfferenal

More information

Predicting and Preventing Emerging Outbreaks of Crime

Predicting and Preventing Emerging Outbreaks of Crime Predcng and Prevenng Emergng Oubreaks of Crme Danel B. Nell Even and Paern Deecon Laboraory H.J. Henz III College, Carnege Mellon Unversy nell@cs.cmu.edu Jon work wh Seh Flaxman, Amru Nagasunder, Wl Gorr

More information

doi: info:doi/ /

doi: info:doi/ / do: nfo:do/0.063/.322393 nernaonal Conference on Power Conrol and Opmzaon, Bal, ndonesa, -3, June 2009 A COLOR FEATURES-BASED METHOD FOR OBJECT TRACKNG EMPLOYNG A PARTCLE FLTER ALGORTHM Bud Sugand, Hyoungseop

More information

Chapter 4. Neural Networks Based on Competition

Chapter 4. Neural Networks Based on Competition Chaper 4. Neural Neworks Based on Compeon Compeon s mporan for NN Compeon beween neurons has been observed n bologcal nerve sysems Compeon s mporan n solvng many problems To classfy an npu paern _1 no

More information

GPU Implementation of Parallel Support Vector Machine Algorithm with Applications to Intruder Detection

GPU Implementation of Parallel Support Vector Machine Algorithm with Applications to Intruder Detection JOURNAL OF COMPUTERS, VOL. 9, NO. 5, MAY 204 7 GPU Implemenaon of Parallel Suppor Vecor Machne Algorhm wh Applcaons o Inruder Deecon Xueqn Zhang, Yfeng Zhang Eas Chna Unversy of Scence and Technology,

More information

( t) Outline of program: BGC1: Survival and event history analysis Oslo, March-May Recapitulation. The additive regression model

( t) Outline of program: BGC1: Survival and event history analysis Oslo, March-May Recapitulation. The additive regression model BGC1: Survval and even hsory analyss Oslo, March-May 212 Monday May 7h and Tuesday May 8h The addve regresson model Ørnulf Borgan Deparmen of Mahemacs Unversy of Oslo Oulne of program: Recapulaon Counng

More information

Graduate Macroeconomics 2 Problem set 5. - Solutions

Graduate Macroeconomics 2 Problem set 5. - Solutions Graduae Macroeconomcs 2 Problem se. - Soluons Queson 1 To answer hs queson we need he frms frs order condons and he equaon ha deermnes he number of frms n equlbrum. The frms frs order condons are: F K

More information

Video-Based Face Recognition Using Adaptive Hidden Markov Models

Video-Based Face Recognition Using Adaptive Hidden Markov Models Vdeo-Based Face Recognon Usng Adapve Hdden Markov Models Xaomng Lu and suhan Chen Elecrcal and Compuer Engneerng, Carnege Mellon Unversy, Psburgh, PA, 523, U.S.A. xaomng@andrew.cmu.edu suhan@cmu.edu Absrac

More information

ELASTIC MODULUS ESTIMATION OF CHOPPED CARBON FIBER TAPE REINFORCED THERMOPLASTICS USING THE MONTE CARLO SIMULATION

ELASTIC MODULUS ESTIMATION OF CHOPPED CARBON FIBER TAPE REINFORCED THERMOPLASTICS USING THE MONTE CARLO SIMULATION THE 19 TH INTERNATIONAL ONFERENE ON OMPOSITE MATERIALS ELASTI MODULUS ESTIMATION OF HOPPED ARBON FIBER TAPE REINFORED THERMOPLASTIS USING THE MONTE ARLO SIMULATION Y. Sao 1*, J. Takahash 1, T. Masuo 1,

More information

Analysis And Evaluation of Econometric Time Series Models: Dynamic Transfer Function Approach

Analysis And Evaluation of Econometric Time Series Models: Dynamic Transfer Function Approach 1 Appeared n Proceedng of he 62 h Annual Sesson of he SLAAS (2006) pp 96. Analyss And Evaluaon of Economerc Tme Seres Models: Dynamc Transfer Funcon Approach T.M.J.A.COORAY Deparmen of Mahemacs Unversy

More information

Tight results for Next Fit and Worst Fit with resource augmentation

Tight results for Next Fit and Worst Fit with resource augmentation Tgh resuls for Nex F and Wors F wh resource augmenaon Joan Boyar Leah Epsen Asaf Levn Asrac I s well known ha he wo smple algorhms for he classc n packng prolem, NF and WF oh have an approxmaon rao of

More information

A Deterministic Algorithm for Summarizing Asynchronous Streams over a Sliding Window

A Deterministic Algorithm for Summarizing Asynchronous Streams over a Sliding Window A Deermnsc Algorhm for Summarzng Asynchronous Sreams over a Sldng ndow Cosas Busch Rensselaer Polyechnc Insue Srkana Trhapura Iowa Sae Unversy Oulne of Talk Inroducon Algorhm Analyss Tme C Daa sream: 3

More information

Pattern Classification (III) & Pattern Verification

Pattern Classification (III) & Pattern Verification Preare by Prof. Hu Jang CSE638 --4 CSE638 3. Seech & Language Processng o.5 Paern Classfcaon III & Paern Verfcaon Prof. Hu Jang Dearmen of Comuer Scence an Engneerng York Unversy Moel Parameer Esmaon Maxmum

More information

ABSTRACT KEYWORDS. Bonus-malus systems, frequency component, severity component. 1. INTRODUCTION

ABSTRACT KEYWORDS. Bonus-malus systems, frequency component, severity component. 1. INTRODUCTION EERAIED BU-MAU YTEM ITH A FREQUECY AD A EVERITY CMET A IDIVIDUA BAI I AUTMBIE IURACE* BY RAHIM MAHMUDVAD AD HEI HAAI ABTRACT Frangos and Vronos (2001) proposed an opmal bonus-malus sysems wh a frequency

More information

Reactive Methods to Solve the Berth AllocationProblem with Stochastic Arrival and Handling Times

Reactive Methods to Solve the Berth AllocationProblem with Stochastic Arrival and Handling Times Reacve Mehods o Solve he Berh AllocaonProblem wh Sochasc Arrval and Handlng Tmes Nsh Umang* Mchel Berlare* * TRANSP-OR, Ecole Polyechnque Fédérale de Lausanne Frs Workshop on Large Scale Opmzaon November

More information

P R = P 0. The system is shown on the next figure:

P R = P 0. The system is shown on the next figure: TPG460 Reservor Smulaon 08 page of INTRODUCTION TO RESERVOIR SIMULATION Analycal and numercal soluons of smple one-dmensonal, one-phase flow equaons As an nroducon o reservor smulaon, we wll revew he smples

More information