Performance Prediction of Parallel Low-level Image Processing Operations
|
|
- Juliana Randall
- 6 years ago
- Views:
Transcription
1 Prformanc Prdiction of Paralll Low-lvl Imag Procssing Orations Zoltan Juhasz Dt. of Information Tchnology and Automation Univrsity of Vszrm Vszrm, P.O.Box 158, H-8201, Hungary hon: , fax: Abstract. An analytical rformanc rdiction mthod is rsntd for rdicting th rformanc of transutr basd low-lvl imag rocssing orations. Th mthod is basd on th fixd-sizd sdu formula and uss a systm ovrhad function to rrsnt th communication and comutation ovrhad as wll as th hardwar and imlmntation dndnt (.g. toology) charactristics of th aralll systm. To dtrmin th ffct of communication on rformanc, first th rortis of th transutr link ar invstigatd to crat a siml link modl, thn th communication attrns of th data-aralll algorithms ar analysd to driv th ovrhad function. Using this ovrhad function, th aralll xcution tim, sdu and scalability of a givn algorithm can b rdictd in advanc, without systm building, hling dvlors to dsign fficint and scalabl transutr-basd aralll rograms. 1. Introduction With th raid advancmnt of hardwar tchnology, massivly aralll comutrs ar bcoming mor and mor widly usd. Unfortunatly, th utilisation of ths machins, i.. th actual dlivrd comutational rformanc on ral roblms, is oftn only a fraction of th machins advrtisd ak rformanc [1]. This is mainly du to th infficint aralllisation of th roblms. Finding th bst aralll algorithm for a givn architctur is a comlx task. Unlik squntial rograms, whr th rformanc can b asily dtrmind by th rocssor sd and th numbr of rquird machin instructions, th rformanc of aralll algorithms dnds on svral othr factors, such as grain siz, communication sd, intrconnction toology, rocss allocation, tc., which maks th dsign has inhrntly mor difficult. An fficint rformanc rdiction mthod nabls th rogrammr to invstigat th ffct of aramtr and algorithm changs on th rformanc. It also hls to dtrmin what hardwar latform is bst suitd to a givn aralll algorithm and rovids information to hardwar architcts on what systm aramtrs nd to b changd to achiv imrovd rformanc. Paralll softwar is xctd to b scalabl, i.. th rogram should run on a varity of architcturs (from a fw-rocssor small scal systm to a massivly aralll machin consisting of thousands of rocssors) without any modification. This rorty nabls cost-ffctiv rototying on a small systm, and imrovs maintainability and ortability.
2 A good rformanc rdiction mthod should b siml, accurat and quick to b incororatd into th dsign has of th dvlomnt, othrwis xrimnting bcoms too costly. Th most frquntly followd mthod is th masur-and-modify aroach. In this itrativ rocss, th xcution tim of th imlmntd algorithm is masurd and if ncssary, changs ar mad to (hofully) imrov rformanc. This mthod, although siml, dos not rovid insight into th dtails of th imlmntation and it rquirs th us of th full-scal aralll systm. Simulation mthods ar somtims usd to rdict th rformanc of aralll rograms but thy ar vry tim-consuming and for larg roblms thy bcom too comlx to b fasibl [2]. Analytical mthods us modls of th architctur and th algorithm, and xrss th xcution tim in a closd form xrssion [1][3-5]. Thy allow xamining th ffct of systm aramtr changs and rdicting rformanc using only a fw-rocssor systm. Accurat rdiction is scially imortant in imag rocssing alications, whr oftn svr tim constraints must b mt. For achiving fast xcution, many rocssors must b mloyd, thrfor it is vital to know th rformanc bfor urchasing or building an xnsiv massivly aralll comutr. Furthrmor, as imag rocssing algorithms ar alid on fixd siz imags, roblms cannot b scald u to achiv highr rformanc on largr machins. Thrfor it is imortant to analys th givn imlmntation of th algorithm in th initial dvlomnt has to roduc an fficint and scalabl aralll algorithm. This ar rsnts an analytical rformanc rdiction mthod that can b usd to rdict th xcution tim and othr rformanc mtrics of low-lvl imag rocssing orations running on a rocssor aralll systm. Th assumd architcturs ar a ondimnsional and a two-dimnsional array of transutrs, connctd in a static narstnighbour attrn. Th most imortant art of th mthod is th calculation of th ovrhad function, as it is th ovrhad that dtrmins th achivabl sdu. Bcaus of th array toology, communication has a major ffct on th ovrhad. Th communication ovrhad, thrfor, is xamind carfully. Th mthod is illustratd on thr imag rocssing orations, th addition, convolution and histogram calculation, which rrsnt thr diffrnt classs of algorithms, th orations with no, local and global communication, rsctivly. Th dvlod analytical modl rovids an accurat and fast rdiction of rformanc. It rquirs only a fw systm aramtrs that can b obtaind from hardwar scification or from masurmnts on a two-transutr systm. Th mthod can hl rogrammrs in rformanc dbugging to dtct critical arts of an imlmntation and systm dsignrs to rdict rformanc as wll as th ffcts of changing hardwar aramtrs. Th organisation of th ar is th following. Sction 2 dvlos th analytical rformanc rdiction mthod and dscribs how it can b usd to imrov and analys rformanc. In Sction 3 th transutr communication mdium, th link is analysd for its rformanc in ordr to build a siml data transfr modl dscribing link bhaviour. Th ffct of rotocol choic, data siz and mssag startu tim on th communication latncy is xamind. Sction 4 invstigats th diffrnt communication attrns found in dataaralll, low-lvl imag rocssing algorithms and dtrmins th communication ovrhad. Combining th hardwar-lvl link modl, th communication and comutation ovrhad formulas, th systm ovrhad functions for th slctd orations ar drivd. Sction 5 rsnts and comars th rdictd and xrimntal rformanc rsults. Th accuracy of th analytical mthod is shown. Critical arts in th algorithm-machin combination ar highlightd and imrovmnts ar mad in ordr to achiv bttr rformanc. Finally, in Sction 6 conclusions ar mad and dirctions for futur work ar highlightd.
3 2. Dvloing th rformanc modl Th main motivation for dvloing th following analytical modl was to rovid a tool for th rogrammr to rdict th rformanc of a givn imlmntation of a transutr-basd mssag-assing aralll rogram rior to imlmntation. Th modl should rovid an accurat rdiction of xcution tim and sdu and mak it ossibl to study th ffcts of systm aramtr changs. Svral rformanc mtrics ar usd for rformanc rdiction. Our modl is basd on th roblm siz bound sdu and is somwhat influncd by th mthod of isofficincy analysis [3]. Unlik isofficincy analysis, howvr, our mthod uss xact rdiction of xcution tim and sdu, not only asymtotic analysis. W dfin aralll xcution tim of an algorithm as T = T 1 / + T o () whr T 1 is th xcution tim of th srial vrsion of th algorithm on on rocssor, is th numbr of rocssors in th systm and T o () is th ovrhad function that rrsnts communication ovrhad, synchronisation dlays and th xcution of unaralllisabl arts of th algorithm. Sdu is dfind as S() = T 1 /T. Substituting th xrssion for aralll xcution tim into th sdu formula, w arriv at S b g = =, (1) + To ( ) + To ( ) 1 1 T 1 whr W is th roblm siz (th numbr of orations to b rformd) and t c is th unit cost of an oration. Th focus of th analysis is on th trm T o ()/Wt c, sinc this trm will dtrmin th ovrall rformanc. To achiv idal rformanc and scalability, this trm should b invrs roortional to. If this is not ossibl, th ovrhad should b constant or in th worst cas, a slowly incrasing function of. Onc th ovrhad function is known, th rquird rformanc mtric can b calculatd. Th valu of T 1 can b obtaind from a trial run or can b calculatd from rocssor scifications. Th modl can b thought of as a layrd systm [6], whr th hardwar-layr is rrsntd by t c and by th communication systm aramtrs in T o (). Th aralll and alication layrs ar rrsntd by th ovrhad function, which xrsss th communication attrn (consquntly th aralll dcomosition) of th algorithm. Wt c 3. Analysing link rformanc To dscrib th ffct of communication on th ovrhad accuratly, it is ncssary to xamin th transutr link and dtrmin how th choic of rotocol and data siz affcts communication latncy and bandwidth. Transutr links can orat at 5, 10 or 20 MBit/sc link sd. Givn th 11 bit r byt (2 start bit, 8 data bits, 1 sto bit) srial transmission mthod of th links [7], th idal transfr rat is v / 11 MByt/sc, whr v dnots th sd of th givn link. Tabl 1 lists th idal transfr rat valus for uni-dirctional and bi-dirctional data transfr at diffrnt link sds as wll as th ral transfr rats that wr obtaind by Inmos s xrimnt [7]. Th Inmos masurmnt was carrid out using th transutr s intrnal mmory, thrfor communication tim in alications using xtrnal mmory is incrasd by th corrsonding xtrnal mmory accss tim. Not that th bi-dirctional rats ar only tims of th uni-dirctional on.
4 Tabl 1. Idal and actual link data transfr rats Link sd Transutr link transfr rats [MByt/sc] [MBit/sc] uni-dirctional bi-dirctional Idal Inmos Idal Inmos In a transutr rogram th hysical link rmains hiddn to th rogrammr, communication is carrid out via namd channls. Dnding on how rocsss ar mad onto rocssors, ths channls can b intrnal or xtrnal channls. Th way data is transfrrd ovr a channl is dfind by rotocols. Occam and th othr aralll languags for th transutr offr a choic of rotocols to b usd for th communication [8]. In an idal systm th transfr tim is xctd to b roortional to th data siz N, and b indndnt of th transmission mthod and rotocol usd. In ral systms, such as transutr ons, this is not th cas. Th ffct of th two most frquntly usd rotocols, th CHAN OF rimitiv.ty and th CHAN OF countd array, on data transfr is xamind. For simlicity rasons only th CHAN OF BYTE and th CHAN OF INT16::[]BYTE rotocols ar discussd hr, sinc th othr tys can b drivd from ths two. Th link data transfr tim is masurd on a two-transutr systm, whr th two transutrs ar connctd by a dirct, v = 20 MBit/sc, link. 3.1 Th CHAN OF BYTE rotocol Th idal transmission tim, T w', of a on word (on word = on byt) mssag ovr a link can b calculatd as T w = 11. Givn a 20 MBit/sc link T v w' rsults in µsc. In ral systms, howvr, th byt rotocol basd transmission tim is largr du to som intrnal rocssing and th rsnc of th acknowldg signal. Furthrmor, thr is th xtra startu tim that must b addd to th transmission tim ach tim a mssag (in this cas a byt) is bing snt ovr th link. Thrfor th transfr tim, T transfr, of N byts of data is not Ttransfr = N Tw, but rathr Ttransfr = N btw + Ts g=t Byt, whr T w is th ral transmission tim of on byt ovr th link and T s is th mssag startu tim. Startu tim, T s can b stimatd from th assmbly quivalnt of th data transfr cod or simly from masuring th tim of snding N byts of data ovr th link. Tabl 2 includs th masurd transfr tims obtaind by transmitting data of four diffrnt sizs, 4, 16, 64 and 256 KByts, ovr an xtrnal and intrnal channl. Th masurmnts wr rformd using a 20 MBit/sc link. From th masurd data, T Byt bcoms µsc. Tabl 2 also lists th transfr rat valus corrsonding to th masurd data transfr tims. As it has bn xctd, snding data ovr a channl in individual byts is vry infficint (transfr rat = MByt/sc) du to th larg startu tim. It is shown that although th intrnal channl is imlmntd by data transfr in mmory, th transfr rat is aroximatly qual to that of th xtrnal channl. Tabl 2. Data transfr tims and bandwidth rsults xtrnal channl intrnal channl Imag siz tim [msc] MByt/sc tim [msc] MByt/sc
5 Tabl 3. Unit transfr tim valus for diffrnt rimitiv tys rimitiv ty T ty [msc] BYTE INT INT 0.01 REAL REAL If instad of th rimitiv ty BYTE, othr tys, such as INT16, INT, REAL32 or REAL64, should b usd, th transmission tim T ty of a rimitiv data ty is incrasd in roortion to th siz of th nw ty in byts. Tabl 3 lists th valus of T ty for th diffrnt rimitiv tys. 3.2 Th CHAN OF INT16::[]BYTE rotocol It is xctd that th countd array rotocol, whr data is snt ovr th link in ackts, is mor fficint sinc th numbr of communication initiations (i.. startus) is rducd to th numbr of ackts. Lt k dnot th siz of a data ackt. Thn w can dfin th ackt transfr tim function, t (k), which givs th tim of transmitting on ackt ovr th link as t bk g = bk + 2gTw + Ts (2) whr th xtra 2 byts in th ackt lngth ar du to th INT16 tag that rcds th data ackt. From this it follows that th total tim, T transfr, of transmitting N byts of data in ackts of lngth k is givn as or by using Eq. (2), as T N = bk + 2g T + T (3) k transfr w s T transfr N = k t b k g (4) Again, T w and T s is stimatd from masurd data. Tabl 4 lists th masurd transfr tim rsults obtaind by transmitting byts of data in varying lngth ackts. Paramtrs T w and T s can b calculatd from ths data using last squars curv fitting. This rsults in T w = µsc and T s = µsc (at 20 MBit/sc link sd). It can b sn that for small ackt sizs th countd array rotocol is also vry infficint du to th alrady mntiond startu tim and that for small ackt sizs th xtra 2 byts of th INT16 rfix bcoms a larg ovrhad. Howvr, for larg ackt sizs this mthod bcoms vry fficint, almost achiving th bst transfr rat givn by Inmos. Figur 1 shows th lot of th masurd transfr tim valus in function of th ackt lngth, as wll as Eq. (3) fittd onto ths data oints. Tabl 4. Transfr tims for snding N byts of data in varying siz ackts ackt siz ,024 2,048 4,096 8,192 16,384 tim [msc]
6 250 masurd data 200 t (k)=(n/k)[(k+2)t w +T s ] ] c s m[ mit k = Packt lngth, k Figur 1. Data transfr tim in function of ackt siz Th masurd transfr rat varis from to MByt/sc, dnding on th ackt lngth usd for th data transfr. Tabl 4 and Fig. 1 indicat that th ackt basd transfr mthod is fficint for ackt lngth largr than 64. For larg ackts th startu tim bcoms ngligibl comard to th tim of transfrring th data in th ackt. Again, using othr rimitiv tys, such as INT16, INT, REAL32 or REAL64 tys, transfr tim will b altrd by th siz of th nw ty xrssd in byts. Tabl 5. Unit transfr tim and startu tim valus for diffrnt rimitiv tys rimitiv ty Tw [msc] Ts [msc] BYTE INT INT REAL REAL Ovrhad Calculation This sction xamins th aralll imlmntations of th slctd imag rocssing orations, addition, convolution and histogram calculation, in ordr to dtrmin thir communication attrns and basd on this to dvlo fficint communication mthods that minimis th communication ovrhad. Finally an ovrhad modl is dfind for ach cas, which will b usd in th rformanc rdiction Addition: No communication Th addition oration is th idal on for aralll imlmntation. Each rocssor holds on N / sgmnt of th imag of siz N in its local mmory. Sinc th addition is a strict oint oration, ach rocssor can calculat th nw ixl valu indndntly of th othrs, thrfor thr is no ovrhad involvd in th oration, thus th ovrhad function is T o = 0.
7 4.2. Convolution: Narst-nighbour communication Sinc th convolution oration dnds on a small nighbourhood of th givn ixl, whn rforming calculations on th dgs of ths sgmnts, th rocssor rquirs thos ixls, which blong to a nighbour rocssor s sgmnt. Th common ractic is to stor a largr, ovrlad sgmnt on ach rocssor that includs th ixls of th boundary rgion of th nighbour sgmnt. This way th comlt convolution can b rformd locally on ach rocssor, only th bordr valus must b xchangd. This bordr udat oration (bordr swa) must b rformd bfor th convolution is calculatd On-dimnsional array imlmntation Figur 2 illustrats a sgmnt of a rocssor on-dimnsional array with its bordr aras. Th imag siz is N N, th sgmnt siz is N / N. Th swa rocss can b carrid out in two diffrnt ways. On mthod is to swa th bordr aras aftr on anothr (squntial swa). Th othr mthod is to carry out all swas at th sam tim by making us of th bi-dirctional communication rorty of th transutr links (aralll swa). A N B N b b C D Figur 2. Imag sgmnt and its bordrs on a 1-D array Squntial swa Th swa of on b N siz ara is gnrally carrid out by snding b ackts with lngth k = N to th nighbour rocssor. Thus, th swa tim of on ara is b t N j. If ach rocssor rforms th xchang of th four aras squntially, du to th synchronisation ffct, th total tim will b roortional to th lngth of th array. Furthrmor, sinc ach link must accommodat two swa orations, th total swa tim is T = 2b 1 t N swa b g j (5) Th comlxity of th squntial swa oration is Θ(). Howvr, w can gratly imrov th rformanc of th swa if w us th links in aralll, ach rforming a bi-dirctional transfr as it is dscribd nxt. Paralll swa In th aralll imlmntation of th swa oration th to (A, B) and bottom (C, D) aras can b xchangd simultanously. Furthrmor, ach link can accommodat data transmission in both dirctions, thus th snd and rciv orations can b rformd at th sam tim. As th transfr tim in th bi-dirctional mod is a. 1.4 tims of th unidirctional on (s Tabl 1), th total aralll swa tim is givn as
8 T = 1. 4b t N j (6) swa Using this mthod, th swa oration has bcom a constant tim oration and is indndnt of th ntwork siz, Two-dimnsional array imlmntation Whn a two-dimnsional rocssor array is usd, thr ar four bordrs to considr and also sgmnt sizs ar slightly diffrnt from th on-dimnsional array cas, as it is shown in Figur 3. Sinc th squntial imlmntation of th swa oration is vry infficint, hr w considr th 2-D xtnsion of th aralll swa only. Th sgmnt is of siz N / N /, th siz of th outr bordr aras is b N / + 2bj and th siz of th innr bordrs is b N /. To simlify th imlmntation of th swa oration w will us b N / + 2b j bordr siz for th innr aras as wll. b A B E F N G H b C D Figur 3. Imag sgmnt and its bordrs on a 2-D rocssor array Th transutr links would allow th aralll xchang of all th four boundaris simultanously, but bcaus of th cornrs of th outr bordr aras, which blong to diagonally nighbouring rocssors, th swa must b rformd in two sts: a horizontal and a vrtical swa. Using a ackt of lngth k = N / + 2 b for th xchang, th total twodimnsional swa tim is givn by N T = 2. 8b t + 2bj (7) swa Now th swa oration is a dcrasing function of. It coms from Eq. (2) that T swa has a lowr bound of 2.8t (3), thrfor T swa msc Histogram calculation: global communication During histogram calculation th rocssors calculat th local histogram of thir local imag sgmnts in aralll. Ths local rsults ar snt to a mastr rocss to calculat th global histogram. This global histogram is thn snt back (broadcast) to ach nod to mak furthr histogram-basd comutation ossibl. Th algorithm is thus mad u of calculation, gathr and broadcast sts.
9 On-dimnsional array imlmntation In gathr and broadcast basd orations data must travl along th links of th array to rach its dstination. Th tim duration of this travl is roortional to th diamtr of th ntwork. In a on-dimnsional array this is d = - 1. Assuming a 256 gray lvl imag, th total ovrhad of th histogram calculation on a 1-D array is T = b 1gd 2 t b256g + 256t i (8) o INT c whr t INT (256) is th ackt transfr function using INT data ty and t c is th cost of th addition oration Two-dimnsional array imlmntation A squar array roducs similar rsult. Th only diffrnc is in th diamtr of th ntwork, d = 2 1j. Thus, th total ovrhad in th 2-D cas is T = 2 1jd2 t b256g + 256t i (9) o INT c 5. Prdiction Rsults 5.1. Excution tim and sdu rdiction Using th abov dvlod ovrhad functions and th srial xcution tim valus, T 1, listd in Tabl 6, th aralll xcution tim, sdu and fficincy can b rdictd for an arbitrary siz ntwork. To rsrv sac, only th xcution tim rsults for a 1-D array ar shown blow; sdu and fficincy can b drivd from this information. Convolution Figur 4 illustrats th rdictd and masurd xcution tim for th convolution oration imlmntd on a on-dimnsional transutr array. Whn using th squntial swa mthod, th ovrhad is incrasing with and communication tim will dominat xcution tim for N / 2, and consquntly th xcution tim will incras and sdu will start to dcras. Th aralll swa mthod roducs a constant tim ovrhad function, thrfor th xcution tim in this cas will dcras monotonly for vry good scalability. Tabl 6. Srial xcution tims for diffrnt imag sizs imag Srial xcution tim, T1 [msc] siz addition conv 3 3 conv 5 5 conv 7 7 histogram N, rsulting in
10 ] c s , b=3 5 5, b=2 3 3, b=1 rdictd, squntial swa rdictd, aralll swa masurd m[ ) ( mit g ol , b=3 5 5, b=2 3 3, b= log (rocssors) Figur 4. Prdictd and masurd xcution tim for convolution on a 1-D array Th two-dimnsional array imlmntation of th convolution oration dislays vn bttr rformanc, as th swa oration is a dcrasing function of. Thrfor th xcution tim is monotonly dcrasing until N and consquntly th scalability of th algorithm is idal. Th accuracy of th rdiction mthod is shown in Tabl 7. Th largst diffrnc is %, whil th avrag diffrnc is lss than 0.16 %. Tabl 7. Th rror of xcution tim rdiction for th convolution oration Prdiction rror for convolution [%] Histogram Th rdictd and masurd xcution tims of th histogram oration ar shown in Figur 5 for a on-dimnsional array. Th ffct of th incrasing ovrhad function (Eq. 8) is obvious. Communication tim will vry soon start to dominat and aftr raching th balanc oint incras xcution tim. It can b sn also that th imlmntation is snsitiv for th roblm siz. For small imag sizs (.g ) thr is almost no bnfit from using aralllism, whil for larg imags ( or largr) w can gain rformanc incras u to rocssors.
11 ] c s m[ ) ( mit g ol rdiction data masurd data log (rocssors) Figur 5. Prdictd and masurd xcution tim for th histogram oration on a 1-D array Th accuracy of th rdiction mthod is shown in Tabl 8. Th largst diffrnc is %, whil th avrag diffrnc is lss than 0.75 %. Th accuracy of our analytical mthod is notabl whn comard to rsults of othr rformanc rdiction mthods [1][6]. Tabl 8. Th rror of xcution tim rdiction for th histogram oration Prdiction rror for histogram [%] Dtcting rformanc-critical oints, imroving th imlmntation Th dscribd analytical modl nabls th rogrammr or systm dsignr to invstigat th ffcts of algorithmic and/or systm aramtr changs on th rformanc. This can b vry usful in th rformanc tuning or in th architctur slction rocss. Our modl dnds on th following aramtrs: t c (imlicit in T 1 ), T s mssag startu tim, communication sd v (imlicit in T w ), th diffrnt roblm dcomositions ar xrssd in th communication attrn, thus in th communication ovrhad. Th ffct of changs in ths aramtrs can b xamind and asily visualisd with an aroriat lotting rogram. Whil in a strict transutr systm th communication and rocssor sd aramtrs ar constants, ths aramtrs mak it ossibl to comar th rformanc of a givn aralll algorithm on diffrnt mssag assing architcturs (.g. th C40 rocssors or workstation clustrs, tc.).
12 Th modl also hls to dtct th critical oints of th algorithms that rduc rformanc. By analysing th ovrhad function, th sourc of th rformanc limiting trms can b idntifid and by changing th roblm dcomosition, th communication stratgy or th rotocol, th ffct of ths trms can b waknd or liminatd. Convolution W saw that th convolution oration using th aralll swa tchniqu rforms idally on both th on-dimnsional and th two-dimnsional arrays. This rsult was xctd naturally as only narst-nighbour communication was usd. Th swa orations usd th most straightforward communication stratgy, data was bing snt by ackts of lngth qual to on row of a sgmnt. This rsultd in th ovrhad functions of Eq. (6) and Eq. (7). From ths quations it can b sn that if th full amount of data is snt in on ackt, instad of b smallr ackts, th frquncy of communication can b rducd, thrfor th startu tim will hav to b accountd for only onc. This rsults in th following ovrhad functions: T = 1. 4t b N j for th linar rocssor array and swa swa T = 2. 8t b N / + 2bjj for th two-dimnsional array, and in imrovd rformanc consquntly. Histogram W hav sn in Sction 5.1 that th histogram oration bcoms infficint as incrass du to th global communication. Nxt w show that it is ossibl to us mor fficint mthods for communication and this will rsult in imrovd rformanc. Th dominating factors in th ovrhad functions, Eq. (8) and Eq. (9), ar th gathr and broadcast arts. Sinc th broadcast oration snds th sam data to all rocssors, this tim can b rducd by using ilining tchniqu during th transmission [9]. If w can divid th data N into smallr ackts of lngth k and dnot th distanc to b travlld by th mssag with m, thn th broadcast tim is givn as N T = + m 1 t k broadcast c h b g k (10) If k = 1, thn Eq. (10) bcoms b N + m 1gt b1g. In this cas a byt buffr and individual byt transfr can b usd instad of th countd array rotocol, rsulting in T = b N + m 1g T (11) broadcast Eq. (10) is minimisd at diffrnt valus of k for diffrnt distancs. It is shown in [10] that th otimal broadcast mthod giving th shortst communication tim is dscribd by th following function: T b broadcast m g = Byt + m 1 t 32 if 1< m 3 R 256 c h b g 32 INT 256 c h b g 16 INT 256 Sc h b g 8 INT 256 c h b g 4 INT b T + m 1 t 16 if 3 < m 7 + m 1 t 8 if 7 < m 63 + m 1 t 4 if 63 < m m 1gT INT if 127 < m (12)
13 ] c s Using this imrovd broadcast mthod and a diffrnt communication and rocss allocation stratgy, th rformanc of th histogram calculation can b imrovd. Th basic ida of th suggstd modification is that instad of th traditional imlmntation of th gathr rocss, whr ach rocssor is waiting for its nighbours and du to this synchronisation th histogram rocss bcoms squntial, w try to artition th ntwork in a way that th combining rocsss could b xcutd in aralll. With this artitioning th combining of th local histograms can b rformd in log 2 tim sts instad of th -1 sts of th traditional mthod [10]. Alying Eq. (8) and (9) on ths rducd diamtr arrays, th total ovrhad in th histogram oration rsults in th followings for a 1-D and 2-D array, rsctivly: T F I = t b256g t log + T G J INT 2 2 H (13) K o c broadcast 1 To = t b 256g + 256tc log + T broadcast 1j INT 2 (14) 2 Th rformanc achivd with this imrovd histogram algorithm is comard to th traditional and th idal tr-connctd ntwork imlmntation in Figur 6. It is shown that th rformanc has bn imrovd with ths modifications and th rang of th fficint aralll oration has bn xtndd. Unfortunatly, th ovrhad function is still roortional to, thrfor th communication tim will again limit th achivabl sdu. Obviously, if th givn transutr systm can b rconfigurd dynamically into a binary tr toology, th bst rformanc can b achivd. In this ar howvr, a static toology has bn assumd traditional histogram imrovd histogram histogram on tr toology m[ ) ( mit g ol log (rocssors) Figur 6. Comarison of th rformanc of diffrnt histogram algorithms
14 6. Conclusions and Futur Work This ar rsntd an analytical rformanc rdiction mthod for transutr basd lowlvl, aralll imag rocssing orations. Th slctd orations rrsnt th algorithm classs with no, local and global communication. Th analytical modl is basd on th fixd siz sdu formula and uss a global ovrhad function to xrss th ffcts of comutation, communication ovrhad, synchronisation dlays and hardwar aramtrs. Sinc communication is a dominant factor of th ovrhad, th ffct of communication on th rformanc is xamind. Two frquntly usd communication rotocols hav bn analysd and link data transfr modls hav bn drivd for both. Ths modls ar usd to rrsnt hardwar aramtrs in th ovrhad function. Th nar-nighbour communication class convolution and th gathr-broadcast class histogram algorithms hav bn xamind on a on-dimnsional and on a two-dimnsional transutr array and aftr analysing thir communication attrns th ovrhad function for ach oration has bn dvlod. Exrimntal data was obtaind to comar th validity of th link transfr modls and th ovrhad functions. All masurmnts hav bn carrid out on a 16-nod transutr systm quid with T800 rocssors. Th ovrhad functions wr usd to rdict th xcution tim of th givn algorithmmachin combination. Th sdu and scalability mtrics can b drivd from th aralll xcution tim. Th ffct of systm aramtr changs on rformanc can b studid with th modl. It has bn shown that by idntifying rformanc critical arts of th algorithm, it is oftn ossibl to imrov rformanc by a chang in th communication stratgy. Th mthod rovids accurat rdiction and can hl rogrammrs, systm dsignr ffctivly to dsign high-rformanc, fficint scalabl algorithms and systms. Futur work would includ th gnralisation of th mthod for othr algorithms and diffrnt hardwar architcturs including th T9000 rocssor, aralll DSP systms and workstation clustrs. Acknowldgmnt This work was suortd in art by th Hungarian National Scinc Foundation (OTKA) undr Grant F Rfrncs [1] Mark J. Clmnt and Michal J. Quinn, Analytical Prformanc Prdiction on Multicomutrs, in Procdings Surcomuting '93, [2] E. Glnb, Multirocssor Prformanc, John Wily & Sons, [3] A. Y. Grama, A. Guta and V. Kumar, Isofficincy: Masuring th Scalability of Paralll Algorithms and Architcturs, IEEE Paralll and Distributd Tchnology, Vol. 1, No. 3, 12-21, August [4] X. Zhang, Y. Yan, Q. Ma, Masuring and Analyzing Paralll Comuting Scalability, in Proc Int. Conf. on Paralll Procssing, CRC Prss, August [5] Mark E. Crovlla and Thomas J. LBlanc, Th Sarch for Lost Cycls: A Nw Aroach to Paralll Program Prformanc Evaluation, Tchnical Rort 479, Th Univrsity of Rochstr, Dcmbr [6] M. J. Zmrly t al., Charactrising Comutational Krnls to Prdict Prformanc of Paralll Systms, in A. D Gloria t al. (Eds.), Transutr Alications and Systms '94, , IOS Prss, [7] Inmos Ltd., Th Transutr Databook, Prntic Hall, [8] Inmos Ltd., occam 2 Rfrnc Manual, Prntic Hall, [9] P. Brtskas, J. N. Tsitsiklis, Paralll and Distributd Comutation, Prntic Hall, [10] Z. Juhasz, Efficint Communication Mthods for Minimising Ovrhad in Paralll Imag Procssing Algorithms, in Proc. 2nd Austrian-Hungarian Worksho on Transutr Alications, 1994.
Random Access Techniques: ALOHA (cont.)
Random Accss Tchniqus: ALOHA (cont.) 1 Exampl [ Aloha avoiding collision ] A pur ALOHA ntwork transmits a 200-bit fram on a shard channl Of 200 kbps at tim. What is th rquirmnt to mak this fram collision
More informationPARTITION HOLE DESIGN FOR MAXIMIZING OR MINIMIZING THE FUNDAMENTAL EIGENFREQUENCY OF A DOUBLE CAVITY BY TOPOLOGY OPTIMIZATION
ICSV4 Cns Australia 9- July, 007 PARTITION HOLE DESIGN FOR MAXIMIZING OR MINIMIZING THE FUNDAMENTAL EIGENFREQUENCY OF A DOUBLE CAVITY BY TOPOLOGY OPTIMIZATION Jin Woo L and Yoon Young Kim National Crativ
More informationEXST Regression Techniques Page 1
EXST704 - Rgrssion Tchniqus Pag 1 Masurmnt rrors in X W hav assumd that all variation is in Y. Masurmnt rror in this variabl will not ffct th rsults, as long as thy ar uncorrlatd and unbiasd, sinc thy
More information4. Money cannot be neutral in the short-run the neutrality of money is exclusively a medium run phenomenon.
PART I TRUE/FALSE/UNCERTAIN (5 points ach) 1. Lik xpansionary montary policy, xpansionary fiscal policy rturns output in th mdium run to its natural lvl, and incrass prics. Thrfor, fiscal policy is also
More informationRESPONSE OF DUFFING OSCILLATOR UNDER NARROW-BAND RANDOM EXCITATION
Th rd Intrnational Confrnc on Comutational Mchanics and Virtual Enginring COMEC 9 9 OCTOBER 9, Brasov, Romania RESPONSE O DUING OSCILLATOR UNDER NARROW-BAND RANDOM EXCITATION Ptr STAN, Mtallurgical High
More informationSCHUR S THEOREM REU SUMMER 2005
SCHUR S THEOREM REU SUMMER 2005 1. Combinatorial aroach Prhas th first rsult in th subjct blongs to I. Schur and dats back to 1916. On of his motivation was to study th local vrsion of th famous quation
More informationSearch sequence databases 3 10/25/2016
Sarch squnc databass 3 10/25/2016 Etrm valu distribution Ø Suppos X is a random variabl with probability dnsity function p(, w sampl a larg numbr S of indpndnt valus of X from this distribution for an
More informationData Assimilation 1. Alan O Neill National Centre for Earth Observation UK
Data Assimilation 1 Alan O Nill National Cntr for Earth Obsrvation UK Plan Motivation & basic idas Univariat (scalar) data assimilation Multivariat (vctor) data assimilation 3d-Variational Mthod (& optimal
More informationUnit 6: Solving Exponential Equations and More
Habrman MTH 111 Sction II: Eonntial and Logarithmic Functions Unit 6: Solving Eonntial Equations and Mor EXAMPLE: Solv th quation 10 100 for. Obtain an act solution. This quation is so asy to solv that
More informationThe Application of Phase Type Distributions for Modelling Queuing Systems
Th Alication of Phas Ty Distributions for Modlling Quuing Systms Eimutis VAAKEVICIUS Dartmnt of Mathmatical Rsarch in Systms Kaunas Univrsity of Tchnology Kaunas, T - 568, ithuania ABSTRACT Quuing modls
More informationIn this lecture... Subsonic and supersonic nozzles Working of these nozzles Performance parameters for nozzles
Lct-30 Lct-30 In this lctur... Subsonic and suprsonic nozzls Working of ths nozzls rformanc paramtrs for nozzls rof. Bhaskar Roy, rof. A M radp, Dpartmnt of Arospac, II Bombay Lct-30 Variation of fluid
More informationMEASURING HEAT FLUX FROM A COMPONENT ON A PCB
MEASURING HEAT FLUX FROM A COMPONENT ON A PCB INTRODUCTION Elctronic circuit boards consist of componnts which gnrats substantial amounts of hat during thir opration. A clar knowldg of th lvl of hat dissipation
More information22/ Breakdown of the Born-Oppenheimer approximation. Selection rules for rotational-vibrational transitions. P, R branches.
Subjct Chmistry Papr No and Titl Modul No and Titl Modul Tag 8/ Physical Spctroscopy / Brakdown of th Born-Oppnhimr approximation. Slction ruls for rotational-vibrational transitions. P, R branchs. CHE_P8_M
More informationThe University of Alabama in Huntsville Electrical and Computer Engineering Homework #4 Solution CPE Spring 2008
Th Univrsity of Alabama in Huntsvill Elctrical and Comutr Enginring Homwork # Solution CE 6 Sring 8 Chatr : roblms ( oints, ( oints, ( oints, 8( oints, ( oints. You hav a RAID systm whr failurs occur at
More informationA Propagating Wave Packet Group Velocity Dispersion
Lctur 8 Phys 375 A Propagating Wav Packt Group Vlocity Disprsion Ovrviw and Motivation: In th last lctur w lookd at a localizd solution t) to th 1D fr-particl Schrödingr quation (SE) that corrsponds to
More informationPerformance analysis of some CFAR detectors in homogeneous Pearson-distributed clutter
SETIT 5 3 rd Intrnational Confrnc: Scincs of Elctronic, Tchnologis of Information and Tlcommunications arch 7-31, 5 TNISIA Prformanc analysis of som CFAR dtctors in homognous Parson-distributd cluttr iani
More informationEinstein Equations for Tetrad Fields
Apiron, Vol 13, No, Octobr 006 6 Einstin Equations for Ttrad Filds Ali Rıza ŞAHİN, R T L Istanbul (Turky) Evry mtric tnsor can b xprssd by th innr product of ttrad filds W prov that Einstin quations for
More informationA Prey-Predator Model with an Alternative Food for the Predator, Harvesting of Both the Species and with A Gestation Period for Interaction
Int. J. Opn Problms Compt. Math., Vol., o., Jun 008 A Pry-Prdator Modl with an Altrnativ Food for th Prdator, Harvsting of Both th Spcis and with A Gstation Priod for Intraction K. L. arayan and. CH. P.
More informationu x v x dx u x v x v x u x dx d u x v x u x v x dx u x v x dx Integration by Parts Formula
7. Intgration by Parts Each drivativ formula givs ris to a corrsponding intgral formula, as w v sn many tims. Th drivativ product rul yilds a vry usful intgration tchniqu calld intgration by parts. Starting
More information4.2 Design of Sections for Flexure
4. Dsign of Sctions for Flxur This sction covrs th following topics Prliminary Dsign Final Dsign for Typ 1 Mmbrs Spcial Cas Calculation of Momnt Dmand For simply supportd prstrssd bams, th maximum momnt
More informationHigher order derivatives
Robrto s Nots on Diffrntial Calculus Chaptr 4: Basic diffrntiation ruls Sction 7 Highr ordr drivativs What you nd to know alrady: Basic diffrntiation ruls. What you can larn hr: How to rpat th procss of
More informationWhat are those βs anyway? Understanding Design Matrix & Odds ratios
Ral paramtr stimat WILD 750 - Wildlif Population Analysis of 6 What ar thos βs anyway? Undrsting Dsign Matrix & Odds ratios Rfrncs Hosmr D.W.. Lmshow. 000. Applid logistic rgrssion. John Wily & ons Inc.
More informationComputing and Communications -- Network Coding
89 90 98 00 Computing and Communications -- Ntwork Coding Dr. Zhiyong Chn Institut of Wirlss Communications Tchnology Shanghai Jiao Tong Univrsity China Lctur 5- Nov. 05 0 Classical Information Thory Sourc
More informationDETERMINATION OF THE DISTORTION COEFFICIENT OF A 500 MPA FREE-DEFORMATION PISTON GAUGE USING A CONTROLLED-CLEARANCE ONE UP TO 200 MPA
XX IMEKO World Congrss Mtrology for Grn Growth Stmbr 9 14, 2012, Busan, Rublic of Kora DETERMINATION OF THE DISTORTION COEFFICIENT OF A 500 MPA FREE-DEFORMATION PISTON GAUGE USING A CONTROLLED-CLEARANCE
More informationThat is, we start with a general matrix: And end with a simpler matrix:
DIAGON ALIZATION OF THE STR ESS TEN SOR INTRO DUCTIO N By th us of Cauchy s thorm w ar abl to rduc th numbr of strss componnts in th strss tnsor to only nin valus. An additional simplification of th strss
More informationLearning Spherical Convolution for Fast Features from 360 Imagery
Larning Sphrical Convolution for Fast Faturs from 36 Imagry Anonymous Author(s) 3 4 5 6 7 8 9 3 4 5 6 7 8 9 3 4 5 6 7 8 9 3 3 3 33 34 35 In this fil w provid additional dtails to supplmnt th main papr
More informationPARTICLE MOTION IN UNIFORM GRAVITATIONAL and ELECTRIC FIELDS
VISUAL PHYSICS ONLINE MODULE 6 ELECTROMAGNETISM PARTICLE MOTION IN UNIFORM GRAVITATIONAL and ELECTRIC FIELDS A fram of rfrnc Obsrvr Origin O(,, ) Cartsian coordinat as (X, Y, Z) Unit vctors iˆˆj k ˆ Scif
More informationDifference -Analytical Method of The One-Dimensional Convection-Diffusion Equation
Diffrnc -Analytical Mthod of Th On-Dimnsional Convction-Diffusion Equation Dalabav Umurdin Dpartmnt mathmatic modlling, Univrsity of orld Economy and Diplomacy, Uzbistan Abstract. An analytical diffrncing
More informationINCOMPLETE KLOOSTERMAN SUMS AND MULTIPLICATIVE INVERSES IN SHORT INTERVALS. xy 1 (mod p), (x, y) I (j)
INCOMPLETE KLOOSTERMAN SUMS AND MULTIPLICATIVE INVERSES IN SHORT INTERVALS T D BROWNING AND A HAYNES Abstract W invstigat th solubility of th congrunc xy (mod ), whr is a rim and x, y ar rstrictd to li
More informationSymmetric centrosymmetric matrix vector multiplication
Linar Algbra and its Applications 320 (2000) 193 198 www.lsvir.com/locat/laa Symmtric cntrosymmtric matrix vctor multiplication A. Mlman 1 Dpartmnt of Mathmatics, Univrsity of San Francisco, San Francisco,
More informationThe pn junction: 2 Current vs Voltage (IV) characteristics
Th pn junction: Currnt vs Voltag (V) charactristics Considr a pn junction in quilibrium with no applid xtrnal voltag: o th V E F E F V p-typ Dpltion rgion n-typ Elctron movmnt across th junction: 1. n
More information2008 AP Calculus BC Multiple Choice Exam
008 AP Multipl Choic Eam Nam 008 AP Calculus BC Multipl Choic Eam Sction No Calculator Activ AP Calculus 008 BC Multipl Choic. At tim t 0, a particl moving in th -plan is th acclration vctor of th particl
More informationWHAT LIES BETWEEN + AND (and beyond)? H.P.Williams
Working Par LSEOR 10-119 ISSN 2041-4668 (Onlin) WHAT LIES BETWEEN + AND (and byond)? HPWilliams London School of Economics hwilliams@lsacuk First ublishd in Grat Britain in 2010 by th Orational Rsarch
More informationMCB137: Physical Biology of the Cell Spring 2017 Homework 6: Ligand binding and the MWC model of allostery (Due 3/23/17)
MCB37: Physical Biology of th Cll Spring 207 Homwork 6: Ligand binding and th MWC modl of allostry (Du 3/23/7) Hrnan G. Garcia March 2, 207 Simpl rprssion In class, w drivd a mathmatical modl of how simpl
More informationThe Matrix Exponential
Th Matrix Exponntial (with xrciss) by D. Klain Vrsion 207.0.05 Corrctions and commnts ar wlcom. Th Matrix Exponntial For ach n n complx matrix A, dfin th xponntial of A to b th matrix A A k I + A + k!
More information3-2-1 ANN Architecture
ARTIFICIAL NEURAL NETWORKS (ANNs) Profssor Tom Fomby Dpartmnt of Economics Soutrn Mtodist Univrsity Marc 008 Artificial Nural Ntworks (raftr ANNs) can b usd for itr prdiction or classification problms.
More informationSER/BER in a Fading Channel
SER/BER in a Fading Channl Major points for a fading channl: * SNR is a R.V. or R.P. * SER(BER) dpnds on th SNR conditional SER(BER). * Two prformanc masurs: outag probability and avrag SER(BER). * Ovrall,
More informationu 3 = u 3 (x 1, x 2, x 3 )
Lctur 23: Curvilinar Coordinats (RHB 8.0 It is oftn convnint to work with variabls othr than th Cartsian coordinats x i ( = x, y, z. For xampl in Lctur 5 w mt sphrical polar and cylindrical polar coordinats.
More informationHomotopy perturbation technique
Comput. Mthods Appl. Mch. Engrg. 178 (1999) 257±262 www.lsvir.com/locat/cma Homotopy prturbation tchniqu Ji-Huan H 1 Shanghai Univrsity, Shanghai Institut of Applid Mathmatics and Mchanics, Shanghai 272,
More informationEstimation over Communication Networks: Performance Bounds and Achievability Results
Estimation ovr Communication Ntworks: Prformanc Bounds and Achivability Rsults A. F. Dana, V. Guta, J. P. Hsanha, B. Hassibi R. M. Murray Abstract This ar considrs th roblm of stimation ovr communication
More informationThe Matrix Exponential
Th Matrix Exponntial (with xrciss) by Dan Klain Vrsion 28928 Corrctions and commnts ar wlcom Th Matrix Exponntial For ach n n complx matrix A, dfin th xponntial of A to b th matrix () A A k I + A + k!
More informationTechnical Manual. S-Curve Tool
Tchnical Manual for S-Curv Tool Vrsion 1.0 (as of 09/1/1) Sonsord by: Naval Cntr for Cost Analysis (NCCA) Dvlod by: Tchnomics, Inc. 01 1 th Strt South, Suit 61 Arlington, VA 0 Points of Contact: Bruc Parkr,
More informationExam 1. It is important that you clearly show your work and mark the final answer clearly, closed book, closed notes, no calculator.
Exam N a m : _ S O L U T I O N P U I D : I n s t r u c t i o n s : It is important that you clarly show your work and mark th final answr clarly, closd book, closd nots, no calculator. T i m : h o u r
More informationA COMPARISON OF SELECTED GBN ARQ SCHEMES FOR VARIABLE-ERROR-RATE CHANNEL USING QAM
Radionginring A Comarison of Slctd GBN ARQ Schms for Variabl-Error-Rat Channl Using QAM 43 Vol., No. 3, Stmbr 22. AEXOVÁ,. KOŠÚT, J. OEC, K. KOTUIAKOVÁ A COMARISON OF SEECTED GBN ARQ SCEMES FOR VARIABE-ERROR-RATE
More informationProblem Set 6 Solutions
6.04/18.06J Mathmatics for Computr Scinc March 15, 005 Srini Dvadas and Eric Lhman Problm St 6 Solutions Du: Monday, March 8 at 9 PM in Room 3-044 Problm 1. Sammy th Shark is a financial srvic providr
More informationAnswer Homework 5 PHA5127 Fall 1999 Jeff Stark
Answr omwork 5 PA527 Fall 999 Jff Stark A patint is bing tratd with Drug X in a clinical stting. Upon admiion, an IV bolus dos of 000mg was givn which yildd an initial concntration of 5.56 µg/ml. A fw
More informationEstimation of apparent fraction defective: A mathematical approach
Availabl onlin at www.plagiarsarchlibrary.com Plagia Rsarch Library Advancs in Applid Scinc Rsarch, 011, (): 84-89 ISSN: 0976-8610 CODEN (USA): AASRFC Estimation of apparnt fraction dfctiv: A mathmatical
More information1 Input-Output Stability
Inut-Outut Stability Inut-outut stability analysis allows us to analyz th stability of a givn syst without knowing th intrnal stat x of th syst. Bfor going forward, w hav to introduc so inut-outut athatical
More informationph People Grade Level: basic Duration: minutes Setting: classroom or field site
ph Popl Adaptd from: Whr Ar th Frogs? in Projct WET: Curriculum & Activity Guid. Bozman: Th Watrcours and th Council for Environmntal Education, 1995. ph Grad Lvl: basic Duration: 10 15 minuts Stting:
More informationThe Frequency Response of a Quarter-Wave Matching Network
4/1/29 Th Frquncy Rsons o a Quartr 1/9 Th Frquncy Rsons o a Quartr-Wav Matchg Ntwork Q: You hav onc aga rovidd us with conusg and rhas uslss ormation. Th quartr-wav matchg ntwork has an xact SFG o: a Τ
More informationSupplementary Materials
6 Supplmntary Matrials APPENDIX A PHYSICAL INTERPRETATION OF FUEL-RATE-SPEED FUNCTION A truck running on a road with grad/slop θ positiv if moving up and ngativ if moving down facs thr rsistancs: arodynamic
More informationA Sub-Optimal Log-Domain Decoding Algorithm for Non-Binary LDPC Codes
Procdings of th 9th WSEAS Intrnational Confrnc on APPLICATIONS of COMPUTER ENGINEERING A Sub-Optimal Log-Domain Dcoding Algorithm for Non-Binary LDPC Cods CHIRAG DADLANI and RANJAN BOSE Dpartmnt of Elctrical
More informationExtraction of Doping Density Distributions from C-V Curves
Extraction of Doping Dnsity Distributions from C-V Curvs Hartmut F.-W. Sadrozinski SCIPP, Univ. California Santa Cruz, Santa Cruz, CA 9564 USA 1. Connction btwn C, N, V Start with Poisson quation d V =
More informationStrain-softening in continuum damage models: Investigation of MAT_058
9th Euroan LS-DYNA Confrnc 2013 Strain-softning in continuum damag modls: Invstigation of MAT_058 Karla Simon Gmkow, Rad Vignjvic School of Enginring, Cranfild Univrsity, Cranfild, Bdfordshir, MK43 0AL,
More informationPhysical Organization
Lctur usbasd symmtric multiprocssors (SM s): combin both aspcts Compilr support? rchitctural support? Static and dynamic locality of rfrnc ar critical for high prformanc M I M ccss to local mmory is usually
More informationElements of Statistical Thermodynamics
24 Elmnts of Statistical Thrmodynamics Statistical thrmodynamics is a branch of knowldg that has its own postulats and tchniqus. W do not attmpt to giv hr vn an introduction to th fild. In this chaptr,
More informationFree Software Offer and Software Diffusion: The Monopolist Case
Association for Information Systms AIS Elctronic Library (AISL) ICIS 3 Procdings Intrnational Confrnc on Information Systms (ICIS) Dcmbr 3 Fr Softwar Offr and Softwar Diffusion: h Monoolist Cas Zhngrui
More informationDISTRIBUTION OF DIFFERENCE BETWEEN INVERSES OF CONSECUTIVE INTEGERS MODULO P
DISTRIBUTION OF DIFFERENCE BETWEEN INVERSES OF CONSECUTIVE INTEGERS MODULO P Tsz Ho Chan Dartmnt of Mathmatics, Cas Wstrn Rsrv Univrsity, Clvland, OH 4406, USA txc50@cwru.du Rcivd: /9/03, Rvisd: /9/04,
More informationCOHORT MBA. Exponential function. MATH review (part2) by Lucian Mitroiu. The LOG and EXP functions. Properties: e e. lim.
MTH rviw part b Lucian Mitroiu Th LOG and EXP functions Th ponntial function p : R, dfind as Proprtis: lim > lim p Eponntial function Y 8 6 - -8-6 - - X Th natural logarithm function ln in US- log: function
More informationFourier Transforms and the Wave Equation. Key Mathematics: More Fourier transform theory, especially as applied to solving the wave equation.
Lur 7 Fourir Transforms and th Wav Euation Ovrviw and Motivation: W first discuss a fw faturs of th Fourir transform (FT), and thn w solv th initial-valu problm for th wav uation using th Fourir transform
More informationForces. Quantum ElectroDynamics. α = = We have now:
W hav now: Forcs Considrd th gnral proprtis of forcs mdiatd by xchang (Yukawa potntial); Examind consrvation laws which ar obyd by (som) forcs. W will nxt look at thr forcs in mor dtail: Elctromagntic
More information1 Minimum Cut Problem
CS 6 Lctur 6 Min Cut and argr s Algorithm Scribs: Png Hui How (05), Virginia Dat: May 4, 06 Minimum Cut Problm Today, w introduc th minimum cut problm. This problm has many motivations, on of which coms
More information3 Finite Element Parametric Geometry
3 Finit Elmnt Paramtric Gomtry 3. Introduction Th intgral of a matrix is th matrix containing th intgral of ach and vry on of its original componnts. Practical finit lmnt analysis rquirs intgrating matrics,
More informationECE602 Exam 1 April 5, You must show ALL of your work for full credit.
ECE62 Exam April 5, 27 Nam: Solution Scor: / This xam is closd-book. You must show ALL of your work for full crdit. Plas rad th qustions carfully. Plas chck your answrs carfully. Calculators may NOT b
More informationChapter 6 Folding. Folding
Chaptr 6 Folding Wintr 1 Mokhtar Abolaz Folding Th folding transformation is usd to systmatically dtrmin th control circuits in DSP architctur whr multipl algorithm oprations ar tim-multiplxd to a singl
More informationKoch Fractal Boundary Single feed Circularly Polarized Microstrip Antenna
1 Journal of Microwavs, Optolctronics and Elctromagntic Applications, Vol. 6, No. 2, Dcmbr 2007 406 Koch Fractal Boundary Singl fd Circularly Polarizd Microstrip Antnna P. Nagswara Rao and N. V. S.N Sarma
More informationHomework #3. 1 x. dx. It therefore follows that a sum of the
Danil Cannon CS 62 / Luan March 5, 2009 Homwork # 1. Th natural logarithm is dfind by ln n = n 1 dx. It thrfor follows that a sum of th 1 x sam addnd ovr th sam intrval should b both asymptotically uppr-
More informationNetwork Congestion Games
Ntwork Congstion Gams Assistant Profssor Tas A&M Univrsity Collg Station, TX TX Dallas Collg Station Austin Houston Bst rout dpnds on othrs Ntwork Congstion Gams Travl tim incrass with congstion Highway
More informationDerangements and Applications
2 3 47 6 23 Journal of Intgr Squncs, Vol. 6 (2003), Articl 03..2 Drangmnts and Applications Mhdi Hassani Dpartmnt of Mathmatics Institut for Advancd Studis in Basic Scincs Zanjan, Iran mhassani@iasbs.ac.ir
More informationMachine Detector Interface Workshop: ILC-SLAC, January 6-8, 2005.
Intrnational Linar Collidr Machin Dtctor Intrfac Workshop: ILCSLAC, January 68, 2005. Prsntd by Brtt Parkr, BNLSMD Mssag: Tools ar now availabl to optimiz IR layout with compact suprconducting quadrupols
More informationFirst derivative analysis
Robrto s Nots on Dirntial Calculus Chaptr 8: Graphical analysis Sction First drivativ analysis What you nd to know alrady: How to us drivativs to idntiy th critical valus o a unction and its trm points
More informationAnalysis of the Monochloroacetic Acid Crystallization Process by Entropic Modeling
A ublication of 2071 VOL 32, 2013 CHEMICAL ENGINEERING TRANSACTIONS Chif Editors: Sauro Pirucci, Jiří J Klmš Coyright 2013, AIDIC Srvizi Srl, ISBN 978-88-95608-23-5; ISSN 1974-9791 Th Italian Association
More informationcycle that does not cross any edges (including its own), then it has at least
W prov th following thorm: Thorm If a K n is drawn in th plan in such a way that it has a hamiltonian cycl that dos not cross any dgs (including its own, thn it has at last n ( 4 48 π + O(n crossings Th
More informationAddition of angular momentum
Addition of angular momntum April, 0 Oftn w nd to combin diffrnt sourcs of angular momntum to charactriz th total angular momntum of a systm, or to divid th total angular momntum into parts to valuat th
More informationDesign Guidelines for Quartz Crystal Oscillators. R 1 Motional Resistance L 1 Motional Inductance C 1 Motional Capacitance C 0 Shunt Capacitance
TECHNICAL NTE 30 Dsign Guidlins for Quartz Crystal scillators Introduction A CMS Pirc oscillator circuit is wll known and is widly usd for its xcllnt frquncy stability and th wid rang of frquncis ovr which
More information4.4 Design of Sections for Flexure (Part III)
4.4 Dsign of Sctions for Flxur (Part ) This sction covrs th following topics. Choic of Sctions Dtrmination of Limiting Zon Post-tnsioning in Stags 4.4.1 Choic of Sctions Th typ of sction is slctd asd on
More informationMor Tutorial at www.dumblittldoctor.com Work th problms without a calculator, but us a calculator to chck rsults. And try diffrntiating your answrs in part III as a usful chck. I. Applications of Intgration
More informationCO-ORDINATION OF FAST NUMERICAL RELAYS AND CURRENT TRANSFORMERS OVERDIMENSIONING FACTORS AND INFLUENCING PARAMETERS
CO-ORDINATION OF FAST NUMERICAL RELAYS AND CURRENT TRANSFORMERS OVERDIMENSIONING FACTORS AND INFLUENCING PARAMETERS Stig Holst ABB Automation Products Swdn Bapuji S Palki ABB Utilitis India This papr rports
More informationUnfired pressure vessels- Part 3: Design
Unfird prssur vssls- Part 3: Dsign Analysis prformd by: Analysis prformd by: Analysis vrsion: According to procdur: Calculation cas: Unfird prssur vssls EDMS Rfrnc: EF EN 13445-3 V1 Introduction: This
More informationAlpha and beta decay equation practice
Alpha and bta dcay quation practic Introduction Alpha and bta particls may b rprsntd in quations in svral diffrnt ways. Diffrnt xam boards hav thir own prfrnc. For xampl: Alpha Bta α β alpha bta Dspit
More informationThe Transfer Function. The Transfer Function. The Transfer Function. The Transfer Function. The Transfer Function. The Transfer Function
A gnraliation of th frquncy rsons function Th convolution sum scrition of an LTI iscrt-tim systm with an imuls rsons h[n] is givn by h y [ n] [ ] x[ n ] Taing th -transforms of both sis w gt n n h n n
More informationEEO 401 Digital Signal Processing Prof. Mark Fowler
EEO 401 Digital Signal Procssing Prof. Mark Fowlr Dtails of th ot St #19 Rading Assignmnt: Sct. 7.1.2, 7.1.3, & 7.2 of Proakis & Manolakis Dfinition of th So Givn signal data points x[n] for n = 0,, -1
More informationSliding Mode Flow Rate Observer Design
Sliding Mod Flow Rat Obsrvr Dsign Song Liu and Bin Yao School of Mchanical Enginring, Purdu Univrsity, Wst Lafaytt, IN797, USA liu(byao)@purdudu Abstract Dynamic flow rat information is ndd in a lot of
More informationCollisions between electrons and ions
DRAFT 1 Collisions btwn lctrons and ions Flix I. Parra Rudolf Pirls Cntr for Thortical Physics, Unirsity of Oxford, Oxford OX1 NP, UK This rsion is of 8 May 217 1. Introduction Th Fokkr-Planck collision
More informationChapter 6: Polarization and Crystal Optics
Chaptr 6: Polarization and Crystal Optics * P6-1. Cascadd Wav Rtardrs. Show that two cascadd quartr-wav rtardrs with paralll fast axs ar quivalnt to a half-wav rtardr. What is th rsult if th fast axs ar
More informationCS 361 Meeting 12 10/3/18
CS 36 Mting 2 /3/8 Announcmnts. Homwork 4 is du Friday. If Friday is Mountain Day, homwork should b turnd in at my offic or th dpartmnt offic bfor 4. 2. Homwork 5 will b availabl ovr th wknd. 3. Our midtrm
More informationE hf. hf c. 2 2 h 2 2 m v f ' f 2f ' f cos c
EXPERIMENT 9: COMPTON EFFECT Rlatd Topics Intractions of photons with lctrons, consrvation of momntum and nrgy, inlastic and lastic scattring, intraction cross sction, Compton wavlngth. Principl Whn photons
More information5.80 Small-Molecule Spectroscopy and Dynamics
MIT OpnCoursWar http://ocw.mit.du 5.80 Small-Molcul Spctroscopy and Dynamics Fall 008 For information about citing ths matrials or our Trms of Us, visit: http://ocw.mit.du/trms. Lctur # 3 Supplmnt Contnts
More informationPrinciples of Humidity Dalton s law
Principls of Humidity Dalton s law Air is a mixtur of diffrnt gass. Th main gas componnts ar: Gas componnt volum [%] wight [%] Nitrogn N 2 78,03 75,47 Oxygn O 2 20,99 23,20 Argon Ar 0,93 1,28 Carbon dioxid
More information2/12/2013. Overview. 12-Power Transmission Text: Conservation of Complex Power. Introduction. Power Transmission-Short Line
//03 Ovrviw -owr Transmission Txt: 4.6-4.0 ECEGR 45 owr ystms Consrvation of Complx owr hort in owr Transmission owr Transmission isualization Radial in Mdium and ong in owr Transmission oltag Collaps
More informationAddition of angular momentum
Addition of angular momntum April, 07 Oftn w nd to combin diffrnt sourcs of angular momntum to charactriz th total angular momntum of a systm, or to divid th total angular momntum into parts to valuat
More informationINFLUENCE OF GROUND SUBSIDENCE IN THE DAMAGE TO MEXICO CITY S PRIMARY WATER SYSTEM DUE TO THE 1985 EARTHQUAKE
13 th World Confrnc on Earthquak Enginring Vancouvr, B.C., Canada August 1-6, 2004 Papr No. 2165 INFLUENCE OF GROUND SUBSIDENCE IN THE DAMAGE TO MEXICO CITY S PRIMARY WATER SYSTEM DUE TO THE 1985 EARTHQUAKE
More informationMath 34A. Final Review
Math A Final Rviw 1) Us th graph of y10 to find approimat valus: a) 50 0. b) y (0.65) solution for part a) first writ an quation: 50 0. now tak th logarithm of both sids: log() log(50 0. ) pand th right
More informationEvaluating Reliability Systems by Using Weibull & New Weibull Extension Distributions Mushtak A.K. Shiker
Evaluating Rliability Systms by Using Wibull & Nw Wibull Extnsion Distributions Mushtak A.K. Shikr مشتاق عبذ الغني شخير Univrsity of Babylon, Collg of Education (Ibn Hayan), Dpt. of Mathmatics Abstract
More informationLINEAR DELAY DIFFERENTIAL EQUATION WITH A POSITIVE AND A NEGATIVE TERM
Elctronic Journal of Diffrntial Equations, Vol. 2003(2003), No. 92, pp. 1 6. ISSN: 1072-6691. URL: http://jd.math.swt.du or http://jd.math.unt.du ftp jd.math.swt.du (login: ftp) LINEAR DELAY DIFFERENTIAL
More informationDealing with quantitative data and problem solving life is a story problem! Attacking Quantitative Problems
Daling with quantitati data and problm soling lif is a story problm! A larg portion of scinc inols quantitati data that has both alu and units. Units can sa your butt! Nd handl on mtric prfixs Dimnsional
More informationde/dx Effectively all charged particles except electrons
de/dx Lt s nxt turn our attntion to how chargd particls los nrgy in mattr To start with w ll considr only havy chargd particls lik muons, pions, protons, alphas, havy ions, Effctivly all chargd particls
More informationPropositional Logic. Combinatorial Problem Solving (CPS) Albert Oliveras Enric Rodríguez-Carbonell. May 17, 2018
Propositional Logic Combinatorial Problm Solving (CPS) Albrt Olivras Enric Rodríguz-Carbonll May 17, 2018 Ovrviw of th sssion Dfinition of Propositional Logic Gnral Concpts in Logic Rduction to SAT CNFs
More informationObserver Bias and Reliability By Xunchi Pu
Obsrvr Bias and Rliability By Xunchi Pu Introduction Clarly all masurmnts or obsrvations nd to b mad as accuratly as possibl and invstigators nd to pay carful attntion to chcking th rliability of thir
More informationCramér-Rao Inequality: Let f(x; θ) be a probability density function with continuous parameter
WHEN THE CRAMÉR-RAO INEQUALITY PROVIDES NO INFORMATION STEVEN J. MILLER Abstract. W invstigat a on-paramtr family of probability dnsitis (rlatd to th Parto distribution, which dscribs many natural phnomna)
More information