Energy-Efficient Primary/Backup Scheduling Techniques for Heterogeneous Multicore Systems

Size: px
Start display at page:

Download "Energy-Efficient Primary/Backup Scheduling Techniques for Heterogeneous Multicore Systems"

Transcription

1 Energy-Effcent Prmary/Backup Schedulng Technques for Heterogeneous Multcore Systems Abhshek Roy, Hakan Aydn epartment of Computer Scence George Mason Unversty Farfax, Vrgna aka Zhu epartment of Computer Scence Unversty of Texas at San Antono San Antono, TX Abstract In ths paper, we consder energy-effcent and faulttolerant schedulng of real-tme tasks on heterogeneous multcore systems. Each task conssts of a man copy and a backup copy whch are scheduled on dfferent cores, for fault tolerance purposes. Our framework delberately delays the backup tasks n order to cancel them dynamcally when the man task copes complete successfully (wthout faults). We dentfy and address two dmensons of the problem,.e., parttonng tasks and determnng processor voltage/frequency levels to mnmze energy consumpton. Our expermental results show that our proposed algorthms performance levels are close to that of an deal soluton wth optmal (but computatonally prohbtve) parttonng and frequency assgnment components. I. INTROUCTION Energy management remans a crucal component for the desgn and mplementaton of embedded systems, ncludng those deployed n safety-crtcal and tme-crtcal applcatons such as those n ndustral control, avoncs, and hghconfdence medcal systems. Recently, heterogeneous (asymmetrc) multcore systems have been embraced by the ndustry due to ther power-effcent desgn and the flexblty they offer n dealng wth dfferent types of workloads. These systems typcally combne the hgh-performance bg cores wth lttle cores that consume less power, at the cost of provdng more modest performance. ARM s bg.little systems that nclude out-of-order and fast cores (such as ARM Cortex-A15) and n-order, energy-effcent cores (such as ARM Cortex A-7) are among the most well-known examples [1]. The man dea n deployng heterogeneous multcore systems s to execute the workload at hand by the core most sutable for the current performance objectve (hgh performance or energy savngs). The research communty has recently addressed several aspects of heterogeneous multcore systems wth a mult-dmensonal effort [2], [3]. Another ncreasngly mportant dmenson, n partcular for safety-crtcal embedded systems, s relablty. Those systems should be able to detect, and recover from, varous types of faults n a tmely manner [4]. The majorty of run-tme faults are categorzed as transent, n that they are shortlved they are typcally nduced by the phenomena such as electromagnetc nterference and cosmc rays. However, they result n erroneous task computaton, and typcally a recovery task, n the form of an alternatve task or a re-executon, s nvoked [4], [5], [6]. It has been reported that the transent faults occurrence rate s ncreasng, n partcular due to the use of aggressve power management technques such as nearthreshold voltage operaton [7]. On the other hand, n case of permanent faults, a processng core becomes unavalable ths s typcally due to the agng effects, harsh envronmental condtons, and manufacturng defects. Toleratng permanent faults requres the deployment of addtonal hardware (such as another core) that can take over the executon of the tasks orgnally allocated to the affected unt [4]. In ths paper, we propose mplementng a fault-tolerant framework on heterogeneous dual-core systems whle keepng the energy consumpton at a mnmum level. Specfcally, we consder a set of real-tme tasks where each task conssts of a prmary (man) copy and a backup copy, that are allocated to dfferent cores. Ths allows the system to tolerate the permanent fault of any sngle core, snce each processor has exactly one copy of each task (prmary or backup) [4]. Moreover, the transent faults detected n all prmary tasks can be recovered from by the executon of the respectve backup task. Our work dffers from the exstng so-called standbysparng frameworks [8], [9], n that:.) we allow schedulng a mx of prmary and back-up tasks on each processor, and,.) we consder heterogeneous multcore systems. Although n [10] we nvestgated a smlar problem n the context of heterogeneous dual-core systems, the focus was agan the standby-sparng confguraton, and the mxng of prmary and backup copes on a gven core was not consdered. To keep the energy consumpton under control, the backup tasks are delayed as much as possble on ther correspondng processors, because a backup can be canceled as soon as the correspondng prmary completes successfully (.e., wthout a fault). Ths also gves a chance to apply ynamc Voltage and Frequency Scalng (VFS) wth maxmum effcency durng the executon of the prmary tasks on each core. We develop and propose schemes,.) to partton all prmary and backup tasks, and,.) assgn frequency (speed) to all the prmary tasks to mnmze the energy consumpton, whle meetng tmng and fault tolerance constrants. Our expermental results suggest that the lst-schedulng based parttonng technques, coupled wth a speed assgnment approach that dynamcally avods the overlaps wth the /17/$31.00 c 2017 IEEE

2 backups, exhbt superor performance whch s close to the theoretcal lower bound n terms of energy consumpton. Our framework drectly ncorporates a salent feature of heterogeneous cores, namely the fact that the energy consumpton and executon tme fgures of dfferent tasks scale by dfferent ratos when executed on dfferent cores [11]. II. SYSTEM MOEL AN ASSUMPTIONS A. Platform and Applcaton model We consder a heterogenous dual-core system wth a hghperformance (bg) core and a low-power (lttle) core. The hghperformance and low-power cores are denoted by HP and, respectvely, throughout the paper. The cores are assumed to have the same nstructon set archtecture, mplyng that the executable of a task can run on ether core. The workload conssts of n ndependent real-tme tasks {,..., τ n } that wll be executed on ths dual-core platform. We assume the frame-based executon model [6], [12] n whch all tasks have the same perod, whch s equal to the common deadlne. Each processng core s equpped wth the ynamc Voltage and Frequency Scalng (VFS) feature that allows changng the frequency (processng speed) at runtme. Moreover, the ynamc Power Management (PM) feature allows a gven core to swtch to a low-power (dle) mode when t s not actvely executng tasks. A task τ that requres C number of cycles on a gven core may take up to W = C /f unts of executon tme on that core, f executed at the frequency level f. ue to the archtectural dfferences, a task s requred number of cycles, and hence executon tme, can be dfferent on the HP and cores. Therefore, we use superscrpts HP and to denote the varables on the HP or the core (C, W, C HP, W HP ). We defne the nomnal utlzaton of a task τ as (C HP /). The maxmum frequency levels supported by the HP and cores are denoted by fmax HP and fmax, respectvely. We assume fmax HP = 1.0, and normalze all other frequency values wth respect to that value. Assocated wth each (prmary) task τ, there s a backup task B wth exact same tmng parameters as those of τ. τ and B are allocated to dfferent cores: should a permanent fault affect any of the processng cores, the alternatve core can take over and fnsh the workload before deadlne. When a prmary copy completes, the acceptance (or, santy) tests [4] are performed to check the exstence of errors nduced by transent faults. If a fault s not detected, the correspondng backup copy (or, ts remanng part) on the other core s canceled. Otherwse, the backup copy runs to completon. B. Power Model The power consumpton characterstcs of the HP and cores dffer by desgn. For any processng core, the dynamc power consumpton of an executng task τ s modeled as, P (f) = a f 3 + α, where a denotes the swtchng capactance, α denotes the frequency-ndependent power consumpton, and f s the processng frequency of the task. ue to the asymmetry of the cores, these parameters are dfferent for each core and agan we use superscrpts HP and to denote core-specfc power parameters (e.g., P HP, α HP ). Each core executes tasks n the actve state, dsspatng power as determned by the characterstcs of the current task and processng frequency. When a core does not execute tasks, t remans n the low-power (dle) state. The low-power (dle) power consumpton of the hgh-performance and low-power cores are denoted by Pdle HP and Pdle, respectvely. We assume those fgures nclude the statc power consumpton of the correspondng core as well. The energy consumpton durng a tme nterval s gven by the aggregate power consumpton durng the same nterval. Exstng research ndcates that scalng down the frequency below a certan threshold s no longer effectve for savng energy, due to the mpact of the frequency-ndependent power component [12]. Ths threshold frequency, known as the energy-effcent frequency (f ee ) can be derved through analytcal technques [12]. Problem Statement: Gven a set of real-tme tasks and a heterogeneous dual-core system, mnmze the energy consumpton by determnng 1) The allocaton of tasks such that the prmary and backup copy of each task are assgned to dfferent cores, and, 2) The processng frequency (speed) assgnment to ndvdual tasks. In the followng secton, we nvestgate these two nterconnected dmensons and propose several effcent schemes. III. PROPOSE SCHEMES Before descrbng the specfc algorthms that we propose, we present a number of general prncples that gude our soluton framework. To start wth, n general, the concurrent executon of a prmary task and ts backup, though possble, s not desrable because t ncurs the full energy cost of the backup executon (Fgure 1a). However, n case when the backup s executon can be delayed, by the tme the prmary completes successfully, ts remanng part can be cancelled (Fgure 1b) 1. (a) Executon wth full prmarybackup overlap (b) Executon wth partal overlap Fg. 1: Concurrent Executon of Prmary and Backup Tasks Ths further suggests that on a gven core, all the prmary tasks must execute before the backup tasks allocated to that core. Moreover, provsons are made to execute all backup tasks at the maxmum frequency on ther respectve cores, 1 Throughout the paper, we show the cancelled part of the backup tasks by dashed patterns n all the fgures.

3 TABLE I: Example Task Set 1 τ 3 B 4 B 5 Fg. 2: Canoncal Executon Order W HP W E HP E τ τ should there be a need obvously ths choce mnmzes ther overlap wth ther respectve prmary tasks on the other core, and n addton, snce faults are rare events, the full speed executon of the backups has only a mnmal mpact on the average-case energy consumpton. Clearly, ths choce also leaves maxmum slow-down opportuntes for the prmary tasks scheduled on that core through VFS. Thus, we defne the canoncal executon order, n whch on a gven core all prmary tasks are started as soon as possble, whereas backup tasks are delayed as much as possble subject to the deadlne constrants, and executed at the maxmum frequency f needed. Fgure 2 shows a canoncal executon on a sngle processng core to whch three prmary tasks (,, τ 3 ), and two backup tasks (B 4 and B 5 ) are assgned. In the rest of the paper, we commt to ths canoncal executon order to execute prmary and backup copes of tasks on all cores, once the parttonng s done. A related framework s the so-called energy-aware standbysparng technque, n whch, one of the cores s desgnated for the prmary tasks and the other one for the backup tasks exclusvely [8], [9], [10]. In our framework, however, for maxmum flexblty, we allow schedulng the prmary and backup copes on both cores, when possble for that reason, we call our framework mxed prmary backup (MPB) assgnment. The schemes we propose consst of task parttonng and speed (frequency) assgnment phases whch are descrbed next. A. Task parttonng Task parttonng, n general, s an ntractable problem; however, a well-known approach s based on the lst-schedulng technque. We frst descrbe two varants based on lst schedulng for our task parttonng phase. Lst-schedulng wth Prmares (). In ths algorthm, we consder the prmary copes of the tasks and employ lstschedulng algorthm to allocate them. Frst, the tasks are ordered accordng to ther decreasng nomnal utlzatons. Then, each prmary task s placed on a processng core that has the maxmum free capacty after the placement. Free capacty C ), where Γ p s the on a core s defned by (f max τ ɛγ p set of all prmary tasks assgned to that core, augmented by the task under consderaton. f max and {C } values are defned n the context of the core under consderaton. Observe that the frst few prmary tasks wll always go to the HP core, untl ts free capacty matches that of the core. Once the dstrbuton of the prmary tasks s complete, a backup copy for each prmary task s allocated to the alternate core. Also, at each stage of the prmary task allocaton, the feasblty of both cores, n terms of tme constrants, are checked. We llustrate the behavor of the algorthm on an example task set gven n Table I. The table gves task executon tmes (n ms), and energy consumpton (E HP, E ) on both cores (n mj), under respectve maxmum frequences. The 4-task set s scheduled on a dual core system wth fmax HP = 1.0 and fmax = 0.8. We also assume Pdle HP = 0.05 and Pdle = 0.02, and for all tasks, a HP = 1.0, a = 0.3, α HP = 0.1 and α = For demonstraton, we use a smple runtme polcy (called statc polcy) n whch, each prmary task s slowed down as much as possble wthout volatng the frame deadlne. The canoncal executon order s adopted on each core. Fgure 3b shows the task allocaton under ths scheme for our example task set n Table I. The frst task, s allocated to the HP core, because t has the most free capacty among the two cores. s allocated to the core whose free capacty s hgher at that tme. Smlarly, tasks τ 3 and τ 4 are allocated to the HP core. It should be noted that, n contrast to the standby-sparng confguraton shown n Fgure 3a (whch uses the parttonng method SlowerP, one of the best-performng scheme n [10]), the extent of prmary-backup overlapped executons s much less n the soluton. Lst-schedulng wth Backups (). Ths algorthm works n the same way as, but ths tme, the backup copes of the tasks are consdered whle parttonng. Once the backup copes are dstrbuted, ther correspondng prmary copes are allocated to the respectve alternate processng cores. By ts very nature, ths algorthm tends to allocate a few ntal prmary tasks to the core, before ther backups are allocated to the HP core thanks to the rule. Fgure 3c shows the task allocaton under ths scheme for our example task set n Table I. Ths parttonng s a mrror mage of the parttonng. It can be noted that, all prmarybackup overlapped executons are avoded. Fxed-Threshold Algorthm (). In ths algorthm, the prmary tasks are at frst ordered accordng to ther decreasng nomnal utlzatons and processed one by one. Tasks are assgned to the core, as long as ts load does not exceed a pre-defned threshold value. Otherwse the prmary task s assgned to the HP core. After each prmary task assgnment, ts backup copy s allocated to the counterpart core. The threshold value can assume any value between 0.0 and 1.0. For our example Task Set 1, ths heurstc produces the taskallocaton shown n Fgure 3d when the threshold value s 0.6. Tasks and are allocated to the core. When task τ 3 s processed, the total used capacty on the core exceeds % f t s assgned to the core. Therefore, t s assgned to the HP core. Smlarly, τ 4 s allocated to the HP core.

4 B 3 B 4 τ 4 τ 3 B 4 B 3 τ 4 τ 3 B 4 B 3 τ 4 τ 3 B 4 B (a) Standby-sparng [10] (b) Lst-schedulng wth prmares (c) Lst-schedulng wth backups Fg. 3: Task parttonng algorthms (d) Fxed-threshold algorthm HP: τ 3 τ 4 HP: HP: HP: : B 3 B 4 : B 3 B 4 : B 4 : B 4 r HP r r HP r r HP r r HP r (a) Intal parttonng (b) After speed assgnment Fg. 4: Statc Speed Assgnment (a) ynamc Backup Cancellaton Fg. 5: ynamc Polces (b) ynamc Backup Cancellaton wth Mnmum Overlap B. Speed assgnment Once the task parttonng phase s complete, the next step s to determne the speed (frequency) of the prmary tasks on each core, whle commttng to the canoncal executon order. Speed assgnment to the prmary tasks s crtcal not only because t determnes drectly the prmary s energy consumpton, but also ndrectly, that of the correspondng backup whose overlap extent may change as a result of that assgnment. Below we propose three speed assgnment polces. Statc Speed Assgnment (SSA). Fgure 4 llustrates the basc prncples of the SSA polcy. The scheme reserves capacty for each allocated backup task (whch runs at the maxmum frequency of the core), and assgns a latest-starttme to each of them such that no deadlnes are mssed. In Fgure 4a, r HP and r denote the latest start tme for the frst backup task on the HP and cores, respectvely. Prmary tasks are slowed down as much as possble, subject to the energy-effcent frequency bound (f ee ). Lettng r denote the start tme of the frst backup task on a specfc core, and Γ P denote the set of all prmary tasks on that core, then, the common frequency that fnshes all these prmary tasks before tme r s gven by f U = ( τ ɛγ P C )/r. Then, each prmary task τ s assgned the frequency f = Max(f ee, f U ). Fgure 4b shows the extended executon tmes for prmary tasks, derved through ths prncple. ynamc Backup Cancellaton (BC). In ths scheme, as n SSA, the processng capacty s reserved for backup tasks and prmary copes are slowed down as much as possble, subject to the energy-effcent frequency. However, the speed assgnment routne s re-nvoked at runtme: each tme a prmary task completes wthout fault, the reserved capacty for ts backup copy s deallocated and used to further slow down the next prmary tasks on that core. For example when τ 3 fnshes wthout error, the reserved capacty for B 3 on the core s reclamed to further slow-down (Fgure 5a). Note that, ths ntroduces some overlapped executon for. In general, when task τ s about to run at tme t, ts speed s chosen as f U = ( τ C ɛγ )/(r t), where Γ s the set of unfnshed prmary tasks on the same core, and r represents the earlest start tme among the unfnshed backup tasks, agan on the same core. When a prmary task completes wthout error, the earlest backup actvaton tme on the alternate processng core s updated at runtme. The chosen speed value s subject to the energy-effcent frequency, therefore, for each task τ, the speed s set to f = Max(f ee, f U ). ynamc Backup Cancellaton wth Mnmum Overlap (MO). Ths scheme works as the BC scheme; but when settng the speed of the prmary tasks at run-tme, t attempts to mnmze the overlapped-executon wth back-ups. As shown n Fgure 5b, when VFS s appled to at the begnnng of ts executon, t s not maxmally slowed down; nstead, the overlapped executon wth s avoded by runnng somewhat faster than the BC polcy. Under ths polcy, the speed of τ s chosen to be f = Mn(f max, f ) where f = C r, where r t s the latest tme the backup copy of τ can be actvated (on the alternatve core) wthout volatng any deadlnes, and t s the current tme. Ths speed s subject to the deadlne constrant and the energy-effcent speed, therefore, f s updated as f = Max(f, f ee, f U ). In ths scheme, f U s re-computed wth a dynamcally updated r value as n the BC scheme. TABLE II: Example Task Set 2 W HP W E HP E τ τ To contrast the mpact of these schemes, we use the 4-

5 f = 0.29 f = 0.29 f = 0.29 f = 0.7 f = 0.7 f = 0.7 f = 0.54 f = 0.7 f = 0.8 B 3 B (a) Statc speed assgnment (SSA) (b) ynamc backup cancellaton (BC) Fg. 6: Executon under dfferent schemes (c) ynamc cancellaton wth mnmum overlap (MO) task set n Table II wth a HP 0.1 and α = 1.0, a = 0.3, α HP = 0.03 for each task. The task set s executed = on a dual core system wth fmax HP = 1.0 and fmax = 0.8. We also assume Pdle HP = 0.05 and Pdle = Fgure 6a shows the executon of the task set under parttonng and statc speed assgnment. The HP core (at the top) uses the energy-effcent frequency for tasks τ 3 and τ 4, and the core (at the bottom) s slowed down maxmally (f = 0.7) so that all backup copes (B 3 and B 4 ) can make ther deadlne. The overall energy consumpton s 24.7 mj. Fgure 6b shows the executon of the same task set under parttonng and BC polcy. The scheme reclams the reserved capacty for the backup copes B 3 and B 4 whose prmares complete wthout fault, and uses ths capacty to further slow down the prmary task to speed f = However, ths ntroduces overlapped executon for, and n ths case, hurts the energy savngs. The overall energy consumpton of ths system s 36.7 mj. Fnally, Fgure 6c shows the executon under parttonng and MO runtme polcy. Although ths scheme could use all the reclamed capacty from B 3 and B 4, t runs at the maxmum speed of the core (f = 0.8) to mnmze the overlap wth. Ths executon yelds an overall energy consumpton of 20.2 mj, whch s 18% lower than that of the statc polcy. IV. EXPERIMENTAL EVALUATION We evaluated the energy consumpton performance of the proposed algorthms n a dscrete event smulator. We smulated dual core systems wth fmax HP = 1.0 and fmax vared from 0.6 to 1.0. ue to space lmtatons, we wll show the results for fmax = 0.8, and analyze the mpact of varyng fmax separately n Secton IV-C. It s known that the power parameters and requred number of cycles for dfferent tasks scale dfferently on heterogeneous systems [11]. Therefore, as n [10], we defne tscale = C, C HP whch models how executon tme changes on the core for a gven task, τ. Moreover, followng [10], we defne pscale to be the rato of power consumpton of τ on the core to that on the HP core. Therefore, pscale = P, whch s also P HP assumed to be the same as a = α a HP. α HP For each experment, the smulator generates a task set contanng n tasks, and a gven total utlzaton, U. The utlzaton value s calculated wth respect to the core (whch s more constraned n terms of performance) and normalzed consderng ts maxmum speed. Hence, U = ( C )/fmax. Based on the target U, we use the RandFxedSum algorthm [13] to assgn a random utlzaton (accordng to unform dstrbuton) to each task such that the total utlzaton equals U. We set the frame deadlne = ms. Next, for each task a tscale and a pscale value are chosen randomly wthn ranges suggested n [11]. Specfcally, 1.4 tscale 2.3 and 1.4 1/(tscale pscale ) 2.1 hold. We assume for all tasks, a HP = 1.0 and α HP = 0.1. In addton, Pdle HP = 0.05 and Pdle = 0.02 for all experments. Each generated task set s parttoned upon the HP and cores accordng to one of the proposed parttonng algorthm. For every partton obtaned n ths way, we smulate the executon accordng to the speed assgnment polces that we suggested, and record the energy consumpton. Every combnaton of a parttonng scheme and a speed assgnment algorthm gves us a vald overall algorthm, whose name s ndcated by the concatenaton of the member schemes (e.g., -SSA, -MO). We use task sets wth n = 10 n all the results shown, but we dscuss the mpact of varyng the number of tasks n Secton IV-C. Every reported data pont s the average of 3000 runs. We report the average energy consumpton n fault-free executons, snce faults are very rare events. The obtaned energy consumpton numbers are normalzed wth respect to the maxmum energy consumpton (observed n the consdered parameter spectrum) of a standby-sparng system wth statc speed assgnment and n whch all the prmary copes are allocated to the core [10]. ue to the multple dmensons of the problem and large number of scheme combnatons, n our evaluaton, we wll adopt a herarchcal approach. We wll frst dscuss the performance of the parttonng algorthms by fxng the speed assgnment polcy. Next, we wll compare the performance of the proposed speed assgnment polces, and also nvestgate the mpact of the chosen threshold value on the algorthm. Fnally, we show the effect of the maxmum speed of the core and the effect of the number of tasks. A. Evaluaton of Parttonng Algorthms We mplemented the followng parttonng schemes n our smulator:

6 Utlzaton (%) TSCALE PSCALE (a) Impact of utlzaton (b) Impact of tscale (c) Impact of pscale Fg. 7: Performance of parttonng algorthms Lst-schedulng wth Prmares () Lst-schedulng wth Backups () Fxed-threshold Algorthm () Standby-sparng () Optmal Parttonng () The optmal parttonng we show n the plots s obtaned by exhaustvely enumeratng all possble task allocatons, and measurng ther runtme energy consumpton, then choosng the best. Ths s mplemented by the exhaustve search whch becomes mpractcal when the number of tasks grows beyond 15. The algorthm s adopted from the SlowerP scheme n [10], because t s shown to be the best-performng one n ts respectve context. The threshold value for the algorthm s fxed as 0.6. The energy consumpton of the parttonng algorthms s shown usng the statc speed assgnment algorthm (SSA); we obtaned smlar trends wth the other (BC and MO) algorthms. Impact of Utlzaton. In Fgure 7a, we show the mpact of utlzaton on normalzed energy consumpton. When the utlzaton s low, the algorthm s performance approaches the optmal one, suggestng that allocatng all prmary tasks to the core, and all the (delayed) backups to the HP core s the best strategy. Ths s because under low load, can fnsh the prmary workload quckly and n a power-effcent way, allowng the backup tasks to get cancelled on the HP core early. Ths s evdent for the scheme too, because t allocates all the prmary workload to one core as well. As the load ncreases, drfts from the optmal scheme and becomes a comparable scheme. Ths s due to the fact that, as the load grows, a more balanced parttonng s preferable whch can allow a sutable dstrbuton of the reserved space for backup copes such that ther actvaton s seldom needed. Both and gve relatvely balanced parttonngs, but generally allocates more prmary copes to the core, wth an energy advantage. scheme, performng very poorly on the low-load case, starts to outperform both and when the utlzaton exceeds %, and comes wthn 5% of the optmal scheme. For heavy load, executng prmary copes on the HP core s preferable because n ths case, the backup copes cannot, n general, get cancelled and executng them at the maxmum speed of the core s preferable to executng them at the maxmum speed of the HP core. For the same reasons, performs the worst for heavy load cases. Impact of tscale. Fgure 7b shows the mpact of tscale on the performance of the parttonng algorthms. tscale s vared wthn the range of 1.4 to 2.3, whch s obtaned from [11]. In general, larger tscale values ndcate that tasks take much longer to complete on the core, despte ts power-effcency. In these experments the utlzaton s fxed at %, and therefore, ncreasng tscale mples addtonal unused capacty on the HP core. We see that performs consstently wthn 3% of the optmal scheme throughout the entre range of tscale. Ths s because executng the prmary copes of the workload on the power-effcent core results n less energy consumpton, and tends to allocate prmary workload to the core., on the other hand, has a tendency to assgn prmary workloads to the HP core, and n general, t lags behnd. comes very close to the performance of as tscale ncreases. Impact of pscale. Fgure 7c shows the mpact of pscale on the performance of the parttonng algorthms. When the core s very power-effcent,.e., pscale s low, and come very close to optmal scheme. Ths s because at the fxed % system load, assgns most of the prmary workload on the core, and that helps savng energy. As pscale grows, drfts away from the optmal scheme the most, because t s no longer effcent to use the core for most of the prmary workload. However, can stll perform wthn 5% of the optmal scheme, because t produces a more balanced parttonng wth a bas to allocate the prmary tasks to the core., whch produces a balanced parttonng wth a bas to assgn the prmary tasks to the HP core, performs poorly for low pscale, but starts to outperform for pscale greater than 0.4 and comes 2% of the optmal scheme.

7 SSA -SSA -BC -BC -MO -MO -Bound 40 Utlzaton (%) TSCALE -BC -SSA -BC -SSA -MO -MO -Bound PSCALE -BC -SSA -MO -SSA -BC -MO -Bound (a) Impact of utlzaton (b) Impact of tscale Fg. 8: Performance of the speed assgnment algorthms (c) Impact of pscale B. Evaluaton of Speed Assgnment Algorthms We mplemented the followng speed assgnment polces. Statc Speed Assgnment (SSA) ynamc Backup Cancellaton (BC) ynamc Backup Cancellaton wth Mnmum Overlap (MO) Bound The Bound algorthm s mplemented as a yardstck speed assgnment algorthm. After parttonng the tasks the executons slots are stll reserved for backup tasks those slots are dynamcally released (as n BC), but no extra energy consumpton s recorded for the overlapped executon of the backup tasks at run-tme. Snce the backup executons essentally ncur zero energy cost, no speed assgnment algorthm can outperform Bound. We matched Bound wth the exhaustve search based Optmal parttonng algorthm, obtanng a combned scheme denoted by -Bound, whch gves the lower bound on the performance of any realstc MPB algorthm. Gven the large number of parttonng/speed assgnment scheme combnatons, for other schemes, we are showng only the results we obtaned wth the best performng parttonng algorthms, namely and. We are usng the Overlap-Aware speed assgnment scheme for, as t s shown to be the best performng scheme for standby-sparng n [10]. Impact of Utlzaton. In Fgure 8a, we see that both - MO and -MO perform wthn 2% of Opt-Bound. Ths s because dynamcally reclamng the capacty for backup tasks and mnmzng overlap whle applyng VFS s a very effectve strategy, as done wthn MO. Ths s also true for at low-load, because t allows some carefully calculated overlapped executon. As the load ncreases, drfts away from Opt-Bound the most, because t has the restrcton that t cannot allocate prmary and backup copes on the same processor. -BC and -BC perform poorly for moderately loaded systems due to the large overlapped executons that t creates. However, for heavy load, backup copes need to run untl deadlne anyway, therefore the performance of the BC scheme mproves. Both -SSA and -SSA offer decent performance levels unless the load s very hgh. Impact of tscale. As we change tscale value (when the load s fxed at %), -MO performs the best and stays wthn 3% of Opt-Bound (Fgure 8b). The next best performng scheme s -MO. Ths agan suggests the superorty of MO thanks to ts dynamc but moderately aggressve approach n applyng VFS whle avodng overlaps. The plot also shows that -BC performs the worst, and -BC performs the worst among all the algorthms. Ths s because BC aggressvely slows down a task wthout regard to the overlapped executon. Impact of pscale. Varyng pscale yelds smlar trends (Fgure 8c). -MO and -MO perform the best, wthn 3% of Opt-Bound, by explotng the overlap avodance strategy of MO. -MO s performance, however, decreases as the core becomes less power-effcent (pscale ncreases). Ths s because, wth less power-effcent core, t s no longer favorable to assgn prmary workload to the core up to a threshold. ue to the aggressve frequency scalng of the BC scheme, -BC performs the worst throughout the entre spectrum. -BC, performng poorly for low pscale, starts to mprove when pscale s greater than 0.4, and comes wthn 1% of Bound. Ths s because when the core s less powereffcent, slowng t down as much as possble proves helpful from the energy consumpton perspectve. C. Addtonal Results Impact of the threshold value n the algorthm. The Fxed-Threshold () algorthm works by allocatng all prmary tasks to the core untl a threshold utlzaton s reached. Fgure 9a shows the mpact of the threshold value on a system that s % loaded and wth MO polcy. The results ndcate that the energy consumpton of decreases as we ncrease the threshold value, and at about 0.45, t outperforms the otherwse best performng algorthm,

8 Threshold (a) Utlzaton = % Threshold (b) Utlzaton = % Fg. 9: Impact of threshold value n algorthm 0.6 -BC -SSA -MO -SSA -BC -MO -Bound Max. speed of the core (a) Impact of the maxmum speed of the core Number of tasks (b) Impact of the number of tasks Fg. 10: Addtonal Evaluatons. Its energy consumpton s mnmzed at some threshold value around 0.. The energy consumpton goes up as we ncrease the threshold and becomes constant at some pont because when the threshold value exceeds the utlzaton, all of the workload s assgned to the core. The thresholdndependent algorthms, naturally yeld a constant energy consumpton. Fgure 9b shows a smlar pattern for a system wth % load. The results suggest that choosng a threshold value n the range [0.5, 0.6] s generally a very good choce when usng the algorthm. Impact of the maxmum speed of the core. In ths set of experments, we vared the maxmum speed of the core whle fxng the load at % for each confguraton (Fgure 10a). The performance of -MO remans wthn 5% of Opt-Bound for the entre regon, suggestng that t s applcable n a wde range of heterogeneous systems. algorthms, on the other hand, tend to drft away from Opt- Bound as the maxmum speed of the two processng cores become close to each other. We also see that the energy consumpton of all schemes ncreases wth ncreasng fmax. Ths s because, when the utlzaton s kept fxed at %, when we ncrease fmax, the effectve amount of workload on the system s ncreased, whch s reflected n the results. Impact of the number of tasks. Fgure 10b shows the mpact of number of tasks for a system wth utlzaton %. We see that for small number of tasks, the performance of all the schemes s affected. As the number of tasks grows, the average task sze decreases and the performances of varous schemes stablze. We can see that performs wthn 3% of the optmal scheme, and s about 3% worse than, for the entre regon. performs worse than the other two, but t also shows stable performance when the number of tasks grows. For the optmal scheme, we could only calculate energy consumptons for up to 17 tasks due to ts prohbtve computatonal complexty. V. CONCLUSION In ths paper, we proposed a fault-tolerant framework mplemented on heterogeneous dual-core systems, and proposed technques that can keep energy consumpton at a mnmum level. We devsed task parttonng algorthms along wth runtme frequency assgnment polces whle takng nto account the dfferent executon-tme and power-parameter scalng factors for applcaton tasks on heterogeneous cores. Our smulaton experments show that our proposed schemes perform very close to the theoretcal lower bound. ACKNOWLEGMENTS Ths work was supported by the US Natonal Scence Foundaton Awards CNS and CNS REFERENCES [1] ARM bg.little Technology. processors/technologes/bglttleprocessng.php. [2] T. S. Muthukaruppan, M. Prcop, V. Venkataraman, T. Mtra, and S. Vshn, Herarchcal power management for asymmetrc mult-core n dark slcon era, n Proc. of ACM/IEEE AC, [3] G. Sngla, G. Kaur, A. K. Unver, and U. Y. Ogras, Predctve dynamc thermal and power management for heterogeneous moble platforms, n Proc. of IEEE ATE, [4] I. Koren and C. M. Krshna, Fault-tolerant systems. Morgan Kaufmann, [5] H. Aydn, R. Melhem, and. Mossé, Toleratng faults whle maxmzng reward, n Proc. of IEEE ECRTS, [6] B. Zhao, H. Aydn, and. Zhu, Shared recovery for energy effcency and relablty enhancements n real-tme applcatons wth precedence constrants, ACM Transactons on esgn Automaton of Electronc Systems (TOAES), vol. 18, no. 2, p. 23, [7]. Ernst, S. as, S. Lee,. Blaauw, T. Austn, T. Mudge, N. S. Km, and K. Flautner, Razor: crcut-level correcton of tmng errors for low-power operaton, IEEE Mcro, vol. 24, no. 6, pp , [8] A. Ejlal, B. M. Al-Hashm, and P. Eles, Low-energy standby-sparng for hard real-tme systems, IEEE Trans. on Computer-Aded esgn of Integrated Crcuts and Systems, vol. 31, no. 3, pp , [9] M. A. Haque, H. Aydn, and. Zhu, Energy-aware standby-sparng technque for perodc real-tme applcatons, n Proc. of IEEE ICC, [10] A. Roy, H. Aydn, and. Zhu, Energy-aware standby-sparng on heterogeneous multcore systems, n Proc. of IEEE/ACM AC, [11] M. Prcop, T. S. Muthukaruppan, V. Venkataraman, T. Mtra, and S. Vshn, Power-performance modelng on asymmetrc mult-cores, n Proc. of IEEE CASES, [12]. Zhu, R. Melhem, and. Mossé, The effects of energy management on relablty n real-tme embedded systems, n Proc. of IEEE ICCA, [13] P. Emberson, R. Stafford, and R. I. avs, Technques for the synthess of multprocessor tasksets, n Proc. of the Int. WS on Analyss Tools and Methodologes for Embedded and Real-tme Systems (WATERS), 2010.

Energy-Aware Standby-Sparing on Heterogeneous Multicore Systems

Energy-Aware Standby-Sparing on Heterogeneous Multicore Systems Energy-Aware Standby-Sparng on Heterogeneous Multcore Systems ABSTRACT Abhshek Roy, Hakan Aydn epartment of Computer Scence George Mason Unversty Farfax, Vrgna 220 aroy6@gmu.edu, aydn@cs.gmu.edu Standby-sparng

More information

Real-Time Systems. Multiprocessor scheduling. Multiprocessor scheduling. Multiprocessor scheduling

Real-Time Systems. Multiprocessor scheduling. Multiprocessor scheduling. Multiprocessor scheduling Real-Tme Systems Multprocessor schedulng Specfcaton Implementaton Verfcaton Multprocessor schedulng -- -- Global schedulng How are tasks assgned to processors? Statc assgnment The processor(s) used for

More information

A FAST HEURISTIC FOR TASKS ASSIGNMENT IN MANYCORE SYSTEMS WITH VOLTAGE-FREQUENCY ISLANDS

A FAST HEURISTIC FOR TASKS ASSIGNMENT IN MANYCORE SYSTEMS WITH VOLTAGE-FREQUENCY ISLANDS Shervn Haamn A FAST HEURISTIC FOR TASKS ASSIGNMENT IN MANYCORE SYSTEMS WITH VOLTAGE-FREQUENCY ISLANDS INTRODUCTION Increasng computatons n applcatons has led to faster processng. o Use more cores n a chp

More information

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud Resource Allocaton wth a Budget Constrant for Computng Independent Tasks n the Cloud Wemng Sh and Bo Hong School of Electrcal and Computer Engneerng Georga Insttute of Technology, USA 2nd IEEE Internatonal

More information

Two-Phase Low-Energy N-Modular Redundancy for Hard Real-Time Multi-Core Systems

Two-Phase Low-Energy N-Modular Redundancy for Hard Real-Time Multi-Core Systems 1 Two-Phase Low-Energy N-Modular Redundancy for Hard Real-Tme Mult-Core Systems Mohammad Saleh, Alreza Ejlal, and Bashr M. Al-Hashm, Fellow, IEEE Abstract Ths paper proposes an N-modular redundancy (NMR)

More information

Embedded Systems. 4. Aperiodic and Periodic Tasks

Embedded Systems. 4. Aperiodic and Periodic Tasks Embedded Systems 4. Aperodc and Perodc Tasks Lothar Thele 4-1 Contents of Course 1. Embedded Systems Introducton 2. Software Introducton 7. System Components 10. Models 3. Real-Tme Models 4. Perodc/Aperodc

More information

Partitioned Mixed-Criticality Scheduling on Multiprocessor Platforms

Partitioned Mixed-Criticality Scheduling on Multiprocessor Platforms Parttoned Mxed-Crtcalty Schedulng on Multprocessor Platforms Chuanca Gu 1, Nan Guan 1,2, Qngxu Deng 1 and Wang Y 1,2 1 Northeastern Unversty, Chna 2 Uppsala Unversty, Sweden Abstract Schedulng mxed-crtcalty

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Problem Set 9 Solutions

Problem Set 9 Solutions Desgn and Analyss of Algorthms May 4, 2015 Massachusetts Insttute of Technology 6.046J/18.410J Profs. Erk Demane, Srn Devadas, and Nancy Lynch Problem Set 9 Solutons Problem Set 9 Solutons Ths problem

More information

Winter 2008 CS567 Stochastic Linear/Integer Programming Guest Lecturer: Xu, Huan

Winter 2008 CS567 Stochastic Linear/Integer Programming Guest Lecturer: Xu, Huan Wnter 2008 CS567 Stochastc Lnear/Integer Programmng Guest Lecturer: Xu, Huan Class 2: More Modelng Examples 1 Capacty Expanson Capacty expanson models optmal choces of the tmng and levels of nvestments

More information

Single-Facility Scheduling over Long Time Horizons by Logic-based Benders Decomposition

Single-Facility Scheduling over Long Time Horizons by Logic-based Benders Decomposition Sngle-Faclty Schedulng over Long Tme Horzons by Logc-based Benders Decomposton Elvn Coban and J. N. Hooker Tepper School of Busness, Carnege Mellon Unversty ecoban@andrew.cmu.edu, john@hooker.tepper.cmu.edu

More information

Module 9. Lecture 6. Duality in Assignment Problems

Module 9. Lecture 6. Duality in Assignment Problems Module 9 1 Lecture 6 Dualty n Assgnment Problems In ths lecture we attempt to answer few other mportant questons posed n earler lecture for (AP) and see how some of them can be explaned through the concept

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

Appendix B: Resampling Algorithms

Appendix B: Resampling Algorithms 407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

Two Methods to Release a New Real-time Task

Two Methods to Release a New Real-time Task Two Methods to Release a New Real-tme Task Abstract Guangmng Qan 1, Xanghua Chen 2 College of Mathematcs and Computer Scence Hunan Normal Unversty Changsha, 410081, Chna qqyy@hunnu.edu.cn Gang Yao 3 Sebel

More information

Second Order Analysis

Second Order Analysis Second Order Analyss In the prevous classes we looked at a method that determnes the load correspondng to a state of bfurcaton equlbrum of a perfect frame by egenvalye analyss The system was assumed to

More information

Minimizing Energy Consumption of MPI Programs in Realistic Environment

Minimizing Energy Consumption of MPI Programs in Realistic Environment Mnmzng Energy Consumpton of MPI Programs n Realstc Envronment Amna Guermouche, Ncolas Trquenaux, Benoît Pradelle and Wllam Jalby Unversté de Versalles Sant-Quentn-en-Yvelnes arxv:1502.06733v2 [cs.dc] 25

More information

Improved Worst-Case Response-Time Calculations by Upper-Bound Conditions

Improved Worst-Case Response-Time Calculations by Upper-Bound Conditions Improved Worst-Case Response-Tme Calculatons by Upper-Bound Condtons Vctor Pollex, Steffen Kollmann, Karsten Albers and Frank Slomka Ulm Unversty Insttute of Embedded Systems/Real-Tme Systems {frstname.lastname}@un-ulm.de

More information

Operating conditions of a mine fan under conditions of variable resistance

Operating conditions of a mine fan under conditions of variable resistance Paper No. 11 ISMS 216 Operatng condtons of a mne fan under condtons of varable resstance Zhang Ynghua a, Chen L a, b, Huang Zhan a, *, Gao Yukun a a State Key Laboratory of Hgh-Effcent Mnng and Safety

More information

ECE559VV Project Report

ECE559VV Project Report ECE559VV Project Report (Supplementary Notes Loc Xuan Bu I. MAX SUM-RATE SCHEDULING: THE UPLINK CASE We have seen (n the presentaton that, for downlnk (broadcast channels, the strategy maxmzng the sum-rate

More information

NP-Completeness : Proofs

NP-Completeness : Proofs NP-Completeness : Proofs Proof Methods A method to show a decson problem Π NP-complete s as follows. (1) Show Π NP. (2) Choose an NP-complete problem Π. (3) Show Π Π. A method to show an optmzaton problem

More information

Energy and Feasibility Optimal Global Scheduling Framework on big.little platforms

Energy and Feasibility Optimal Global Scheduling Framework on big.little platforms Energy and Feasblty Optmal Global Schedulng Framework on bg.little platforms Hoon Sung Chwa, Jaebaek Seo, Hyuck Yoo Jnkyu Lee, Insk Shn Department of Computer Scence, KAIST, Republc of Korea Department

More information

EEL 6266 Power System Operation and Control. Chapter 3 Economic Dispatch Using Dynamic Programming

EEL 6266 Power System Operation and Control. Chapter 3 Economic Dispatch Using Dynamic Programming EEL 6266 Power System Operaton and Control Chapter 3 Economc Dspatch Usng Dynamc Programmng Pecewse Lnear Cost Functons Common practce many utltes prefer to represent ther generator cost functons as sngle-

More information

The optimal delay of the second test is therefore approximately 210 hours earlier than =2.

The optimal delay of the second test is therefore approximately 210 hours earlier than =2. THE IEC 61508 FORMULAS 223 The optmal delay of the second test s therefore approxmately 210 hours earler than =2. 8.4 The IEC 61508 Formulas IEC 61508-6 provdes approxmaton formulas for the PF for smple

More information

Minimisation of the Average Response Time in a Cluster of Servers

Minimisation of the Average Response Time in a Cluster of Servers Mnmsaton of the Average Response Tme n a Cluster of Servers Valery Naumov Abstract: In ths paper, we consder task assgnment problem n a cluster of servers. We show that optmal statc task assgnment s tantamount

More information

College of Computer & Information Science Fall 2009 Northeastern University 20 October 2009

College of Computer & Information Science Fall 2009 Northeastern University 20 October 2009 College of Computer & Informaton Scence Fall 2009 Northeastern Unversty 20 October 2009 CS7880: Algorthmc Power Tools Scrbe: Jan Wen and Laura Poplawsk Lecture Outlne: Prmal-dual schema Network Desgn:

More information

The Minimum Universal Cost Flow in an Infeasible Flow Network

The Minimum Universal Cost Flow in an Infeasible Flow Network Journal of Scences, Islamc Republc of Iran 17(2): 175-180 (2006) Unversty of Tehran, ISSN 1016-1104 http://jscencesutacr The Mnmum Unversal Cost Flow n an Infeasble Flow Network H Saleh Fathabad * M Bagheran

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting Onlne Appendx to: Axomatzaton and measurement of Quas-hyperbolc Dscountng José Lus Montel Olea Tomasz Strzaleck 1 Sample Selecton As dscussed before our ntal sample conssts of two groups of subjects. Group

More information

Chapter - 2. Distribution System Power Flow Analysis

Chapter - 2. Distribution System Power Flow Analysis Chapter - 2 Dstrbuton System Power Flow Analyss CHAPTER - 2 Radal Dstrbuton System Load Flow 2.1 Introducton Load flow s an mportant tool [66] for analyzng electrcal power system network performance. Load

More information

Overhead-Aware Compositional Analysis of Real-Time Systems

Overhead-Aware Compositional Analysis of Real-Time Systems Overhead-Aware ompostonal Analyss of Real-Tme Systems Lnh T.X. Phan, Meng Xu, Jaewoo Lee, nsup Lee, Oleg Sokolsky PRESE enter Department of omputer and nformaton Scence Unversty of Pennsylvana ompostonal

More information

An Admission Control Algorithm in Cloud Computing Systems

An Admission Control Algorithm in Cloud Computing Systems An Admsson Control Algorthm n Cloud Computng Systems Authors: Frank Yeong-Sung Ln Department of Informaton Management Natonal Tawan Unversty Tape, Tawan, R.O.C. ysln@m.ntu.edu.tw Yngje Lan Management Scence

More information

Simultaneous Optimization of Berth Allocation, Quay Crane Assignment and Quay Crane Scheduling Problems in Container Terminals

Simultaneous Optimization of Berth Allocation, Quay Crane Assignment and Quay Crane Scheduling Problems in Container Terminals Smultaneous Optmzaton of Berth Allocaton, Quay Crane Assgnment and Quay Crane Schedulng Problems n Contaner Termnals Necat Aras, Yavuz Türkoğulları, Z. Caner Taşkın, Kuban Altınel Abstract In ths work,

More information

CONTRAST ENHANCEMENT FOR MIMIMUM MEAN BRIGHTNESS ERROR FROM HISTOGRAM PARTITIONING INTRODUCTION

CONTRAST ENHANCEMENT FOR MIMIMUM MEAN BRIGHTNESS ERROR FROM HISTOGRAM PARTITIONING INTRODUCTION CONTRAST ENHANCEMENT FOR MIMIMUM MEAN BRIGHTNESS ERROR FROM HISTOGRAM PARTITIONING N. Phanthuna 1,2, F. Cheevasuvt 2 and S. Chtwong 2 1 Department of Electrcal Engneerng, Faculty of Engneerng Rajamangala

More information

Effective Power Optimization combining Placement, Sizing, and Multi-Vt techniques

Effective Power Optimization combining Placement, Sizing, and Multi-Vt techniques Effectve Power Optmzaton combnng Placement, Szng, and Mult-Vt technques Tao Luo, Davd Newmark*, and Davd Z Pan Department of Electrcal and Computer Engneerng, Unversty of Texas at Austn *Advanced Mcro

More information

Optimal Static Partition Configuration in ARINC653 System

Optimal Static Partition Configuration in ARINC653 System JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY, VOL. 9, NO. 4, DECEMBER 7 Optmal Statc rtton Confguraton n ARINC6 System Sheng-Ln Gu, Le Luo, Sen-Sen Tang, and Yang Meng Abstract ARINC6 systems, whch have

More information

Annexes. EC.1. Cycle-base move illustration. EC.2. Problem Instances

Annexes. EC.1. Cycle-base move illustration. EC.2. Problem Instances ec Annexes Ths Annex frst llustrates a cycle-based move n the dynamc-block generaton tabu search. It then dsplays the characterstcs of the nstance sets, followed by detaled results of the parametercalbraton

More information

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced,

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced, FREQUENCY DISTRIBUTIONS Page 1 of 6 I. Introducton 1. The dea of a frequency dstrbuton for sets of observatons wll be ntroduced, together wth some of the mechancs for constructng dstrbutons of data. Then

More information

Global EDF Scheduling for Parallel Real-Time Tasks

Global EDF Scheduling for Parallel Real-Time Tasks Washngton Unversty n St. Lous Washngton Unversty Open Scholarshp Engneerng and Appled Scence Theses & Dssertatons Engneerng and Appled Scence Sprng 5-15-2014 Global EDF Schedulng for Parallel Real-Tme

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

TOPICS MULTIPLIERLESS FILTER DESIGN ELEMENTARY SCHOOL ALGORITHM MULTIPLICATION

TOPICS MULTIPLIERLESS FILTER DESIGN ELEMENTARY SCHOOL ALGORITHM MULTIPLICATION 1 2 MULTIPLIERLESS FILTER DESIGN Realzaton of flters wthout full-fledged multplers Some sldes based on support materal by W. Wolf for hs book Modern VLSI Desgn, 3 rd edton. Partly based on followng papers:

More information

CHAPTER 17 Amortized Analysis

CHAPTER 17 Amortized Analysis CHAPTER 7 Amortzed Analyss In an amortzed analyss, the tme requred to perform a sequence of data structure operatons s averaged over all the operatons performed. It can be used to show that the average

More information

A new construction of 3-separable matrices via an improved decoding of Macula s construction

A new construction of 3-separable matrices via an improved decoding of Macula s construction Dscrete Optmzaton 5 008 700 704 Contents lsts avalable at ScenceDrect Dscrete Optmzaton journal homepage: wwwelsevercom/locate/dsopt A new constructon of 3-separable matrces va an mproved decodng of Macula

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

find (x): given element x, return the canonical element of the set containing x;

find (x): given element x, return the canonical element of the set containing x; COS 43 Sprng, 009 Dsjont Set Unon Problem: Mantan a collecton of dsjont sets. Two operatons: fnd the set contanng a gven element; unte two sets nto one (destructvely). Approach: Canoncal element method:

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Economics 101. Lecture 4 - Equilibrium and Efficiency

Economics 101. Lecture 4 - Equilibrium and Efficiency Economcs 0 Lecture 4 - Equlbrum and Effcency Intro As dscussed n the prevous lecture, we wll now move from an envronment where we looed at consumers mang decsons n solaton to analyzng economes full of

More information

Energy-Efficient Scheduling Fixed-Priority tasks with Preemption Thresholds on Variable Voltage Processors

Energy-Efficient Scheduling Fixed-Priority tasks with Preemption Thresholds on Variable Voltage Processors Energy-Effcent Schedulng Fxed-Prorty tasks wth Preempton Thresholds on Varable Voltage Processors XaoChuan He, Yan Ja Insttute of Network Technology and Informaton Securty School of Computer Scence Natonal

More information

EEE 241: Linear Systems

EEE 241: Linear Systems EEE : Lnear Systems Summary #: Backpropagaton BACKPROPAGATION The perceptron rule as well as the Wdrow Hoff learnng were desgned to tran sngle layer networks. They suffer from the same dsadvantage: they

More information

Parametric Utilization Bounds for Fixed-Priority Multiprocessor Scheduling

Parametric Utilization Bounds for Fixed-Priority Multiprocessor Scheduling 2012 IEEE 26th Internatonal Parallel and Dstrbuted Processng Symposum Parametrc Utlzaton Bounds for Fxed-Prorty Multprocessor Schedulng Nan Guan 1,2, Martn Stgge 1, Wang Y 1,2 and Ge Yu 2 1 Uppsala Unversty,

More information

Grover s Algorithm + Quantum Zeno Effect + Vaidman

Grover s Algorithm + Quantum Zeno Effect + Vaidman Grover s Algorthm + Quantum Zeno Effect + Vadman CS 294-2 Bomb 10/12/04 Fall 2004 Lecture 11 Grover s algorthm Recall that Grover s algorthm for searchng over a space of sze wors as follows: consder the

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

DUE: WEDS FEB 21ST 2018

DUE: WEDS FEB 21ST 2018 HOMEWORK # 1: FINITE DIFFERENCES IN ONE DIMENSION DUE: WEDS FEB 21ST 2018 1. Theory Beam bendng s a classcal engneerng analyss. The tradtonal soluton technque makes smplfyng assumptons such as a constant

More information

Clock-Gating and Its Application to Low Power Design of Sequential Circuits

Clock-Gating and Its Application to Low Power Design of Sequential Circuits Clock-Gatng and Its Applcaton to Low Power Desgn of Sequental Crcuts ng WU Department of Electrcal Engneerng-Systems, Unversty of Southern Calforna Los Angeles, CA 989, USA, Phone: (23)74-448 Massoud PEDRAM

More information

Variability-Driven Module Selection with Joint Design Time Optimization and Post-Silicon Tuning

Variability-Driven Module Selection with Joint Design Time Optimization and Post-Silicon Tuning Asa and South Pacfc Desgn Automaton Conference 2008 Varablty-Drven Module Selecton wth Jont Desgn Tme Optmzaton and Post-Slcon Tunng Feng Wang, Xaoxa Wu, Yuan Xe The Pennsylvana State Unversty Department

More information

Queueing Networks II Network Performance

Queueing Networks II Network Performance Queueng Networks II Network Performance Davd Tpper Assocate Professor Graduate Telecommuncatons and Networkng Program Unversty of Pttsburgh Sldes 6 Networks of Queues Many communcaton systems must be modeled

More information

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) ,

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) , A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS Dr. Derald E. Wentzen, Wesley College, (302) 736-2574, wentzde@wesley.edu ABSTRACT A lnear programmng model s developed and used to compare

More information

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography CSc 6974 and ECSE 6966 Math. Tech. for Vson, Graphcs and Robotcs Lecture 21, Aprl 17, 2006 Estmatng A Plane Homography Overvew We contnue wth a dscusson of the major ssues, usng estmaton of plane projectve

More information

Chapter Newton s Method

Chapter Newton s Method Chapter 9. Newton s Method After readng ths chapter, you should be able to:. Understand how Newton s method s dfferent from the Golden Secton Search method. Understand how Newton s method works 3. Solve

More information

Assortment Optimization under MNL

Assortment Optimization under MNL Assortment Optmzaton under MNL Haotan Song Aprl 30, 2017 1 Introducton The assortment optmzaton problem ams to fnd the revenue-maxmzng assortment of products to offer when the prces of products are fxed.

More information

The Study of Teaching-learning-based Optimization Algorithm

The Study of Teaching-learning-based Optimization Algorithm Advanced Scence and Technology Letters Vol. (AST 06), pp.05- http://dx.do.org/0.57/astl.06. The Study of Teachng-learnng-based Optmzaton Algorthm u Sun, Yan fu, Lele Kong, Haolang Q,, Helongang Insttute

More information

9 Derivation of Rate Equations from Single-Cell Conductance (Hodgkin-Huxley-like) Equations

9 Derivation of Rate Equations from Single-Cell Conductance (Hodgkin-Huxley-like) Equations Physcs 171/271 - Chapter 9R -Davd Klenfeld - Fall 2005 9 Dervaton of Rate Equatons from Sngle-Cell Conductance (Hodgkn-Huxley-lke) Equatons We consder a network of many neurons, each of whch obeys a set

More information

Pop-Click Noise Detection Using Inter-Frame Correlation for Improved Portable Auditory Sensing

Pop-Click Noise Detection Using Inter-Frame Correlation for Improved Portable Auditory Sensing Advanced Scence and Technology Letters, pp.164-168 http://dx.do.org/10.14257/astl.2013 Pop-Clc Nose Detecton Usng Inter-Frame Correlaton for Improved Portable Audtory Sensng Dong Yun Lee, Kwang Myung Jeon,

More information

8 Derivation of Network Rate Equations from Single- Cell Conductance Equations

8 Derivation of Network Rate Equations from Single- Cell Conductance Equations Physcs 178/278 - Davd Klenfeld - Wnter 2019 8 Dervaton of Network Rate Equatons from Sngle- Cell Conductance Equatons Our goal to derve the form of the abstract quanttes n rate equatons, such as synaptc

More information

Lossy Compression. Compromise accuracy of reconstruction for increased compression.

Lossy Compression. Compromise accuracy of reconstruction for increased compression. Lossy Compresson Compromse accuracy of reconstructon for ncreased compresson. The reconstructon s usually vsbly ndstngushable from the orgnal mage. Typcally, one can get up to 0:1 compresson wth almost

More information

8 Derivation of Network Rate Equations from Single- Cell Conductance Equations

8 Derivation of Network Rate Equations from Single- Cell Conductance Equations Physcs 178/278 - Davd Klenfeld - Wnter 2015 8 Dervaton of Network Rate Equatons from Sngle- Cell Conductance Equatons We consder a network of many neurons, each of whch obeys a set of conductancebased,

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

Outline. Communication. Bellman Ford Algorithm. Bellman Ford Example. Bellman Ford Shortest Path [1]

Outline. Communication. Bellman Ford Algorithm. Bellman Ford Example. Bellman Ford Shortest Path [1] DYNAMIC SHORTEST PATH SEARCH AND SYNCHRONIZED TASK SWITCHING Jay Wagenpfel, Adran Trachte 2 Outlne Shortest Communcaton Path Searchng Bellmann Ford algorthm Algorthm for dynamc case Modfcatons to our algorthm

More information

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T.

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T. Assgnment 5 Smulaton for Logstcs Mont, N.E. Yunta, T. November 26, 2007 1. Smulaton Desgn The frst objectve of ths assgnment s to derve a 90% two-sded Confdence Interval (CI) for the average watng tme

More information

Calculation of time complexity (3%)

Calculation of time complexity (3%) Problem 1. (30%) Calculaton of tme complexty (3%) Gven n ctes, usng exhaust search to see every result takes O(n!). Calculaton of tme needed to solve the problem (2%) 40 ctes:40! dfferent tours 40 add

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Online Appendix: Reciprocity with Many Goods

Online Appendix: Reciprocity with Many Goods T D T A : O A Kyle Bagwell Stanford Unversty and NBER Robert W. Stager Dartmouth College and NBER March 2016 Abstract Ths onlne Appendx extends to a many-good settng the man features of recprocty emphaszed

More information

On balancing multiple video streams with distributed QoS control in mobile communications

On balancing multiple video streams with distributed QoS control in mobile communications On balancng multple vdeo streams wth dstrbuted QoS control n moble communcatons Arjen van der Schaaf, José Angel Lso Arellano, and R. (Inald) L. Lagendjk TU Delft, Mekelweg 4, 68 CD Delft, The Netherlands

More information

On the Multicriteria Integer Network Flow Problem

On the Multicriteria Integer Network Flow Problem BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 5, No 2 Sofa 2005 On the Multcrtera Integer Network Flow Problem Vassl Vasslev, Marana Nkolova, Maryana Vassleva Insttute of

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty Addtonal Codes usng Fnte Dfference Method Benamn Moll 1 HJB Equaton for Consumpton-Savng Problem Wthout Uncertanty Before consderng the case wth stochastc ncome n http://www.prnceton.edu/~moll/ HACTproect/HACT_Numercal_Appendx.pdf,

More information

Fixed-Priority Multiprocessor Scheduling with Liu & Layland s Utilization Bound

Fixed-Priority Multiprocessor Scheduling with Liu & Layland s Utilization Bound Fxed-Prorty Multprocessor Schedulng wth Lu & Layland s Utlzaton Bound Nan Guan, Martn Stgge, Wang Y and Ge Yu Department of Informaton Technology, Uppsala Unversty, Sweden Department of Computer Scence

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

= z 20 z n. (k 20) + 4 z k = 4

= z 20 z n. (k 20) + 4 z k = 4 Problem Set #7 solutons 7.2.. (a Fnd the coeffcent of z k n (z + z 5 + z 6 + z 7 + 5, k 20. We use the known seres expanson ( n+l ( z l l z n below: (z + z 5 + z 6 + z 7 + 5 (z 5 ( + z + z 2 + z + 5 5

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

A Simple Inventory System

A Simple Inventory System A Smple Inventory System Lawrence M. Leems and Stephen K. Park, Dscrete-Event Smulaton: A Frst Course, Prentce Hall, 2006 Hu Chen Computer Scence Vrgna State Unversty Petersburg, Vrgna February 8, 2017

More information

Min Cut, Fast Cut, Polynomial Identities

Min Cut, Fast Cut, Polynomial Identities Randomzed Algorthms, Summer 016 Mn Cut, Fast Cut, Polynomal Identtes Instructor: Thomas Kesselhem and Kurt Mehlhorn 1 Mn Cuts n Graphs Lecture (5 pages) Throughout ths secton, G = (V, E) s a mult-graph.

More information

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput. J. Parallel Dstrb. Comput. 7 (20) 537 555 Contents lsts avalable at ScenceDrect J. Parallel Dstrb. Comput. ournal homepage: www.elsever.com/locate/pdc Game-theoretc statc load balancng for dstrbuted systems

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Lecture 14: Bandits with Budget Constraints

Lecture 14: Bandits with Budget Constraints IEOR 8100-001: Learnng and Optmzaton for Sequental Decson Makng 03/07/16 Lecture 14: andts wth udget Constrants Instructor: Shpra Agrawal Scrbed by: Zhpeng Lu 1 Problem defnton In the regular Mult-armed

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS

A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS HCMC Unversty of Pedagogy Thong Nguyen Huu et al. A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS Thong Nguyen Huu and Hao Tran Van Department of mathematcs-nformaton,

More information

Temperature. Chapter Heat Engine

Temperature. Chapter Heat Engine Chapter 3 Temperature In prevous chapters of these notes we ntroduced the Prncple of Maxmum ntropy as a technque for estmatng probablty dstrbutons consstent wth constrants. In Chapter 9 we dscussed the

More information

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin Proceedngs of the 007 Wnter Smulaton Conference S G Henderson, B Bller, M-H Hseh, J Shortle, J D Tew, and R R Barton, eds LOW BIAS INTEGRATED PATH ESTIMATORS James M Calvn Department of Computer Scence

More information

Lecture 4: November 17, Part 1 Single Buffer Management

Lecture 4: November 17, Part 1 Single Buffer Management Lecturer: Ad Rosén Algorthms for the anagement of Networs Fall 2003-2004 Lecture 4: November 7, 2003 Scrbe: Guy Grebla Part Sngle Buffer anagement In the prevous lecture we taled about the Combned Input

More information

Joint Scheduling and Power-Allocation for Interference Management in Wireless Networks

Joint Scheduling and Power-Allocation for Interference Management in Wireless Networks Jont Schedulng and Power-Allocaton for Interference Management n Wreless Networks Xn Lu *, Edwn K. P. Chong, and Ness B. Shroff * * School of Electrcal and Computer Engneerng Purdue Unversty West Lafayette,

More information

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for U Charts. Dr. Wayne A. Taylor

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for U Charts. Dr. Wayne A. Taylor Taylor Enterprses, Inc. Adjusted Control Lmts for U Charts Copyrght 207 by Taylor Enterprses, Inc., All Rghts Reserved. Adjusted Control Lmts for U Charts Dr. Wayne A. Taylor Abstract: U charts are used

More information

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity Week3, Chapter 4 Moton n Two Dmensons Lecture Quz A partcle confned to moton along the x axs moves wth constant acceleraton from x =.0 m to x = 8.0 m durng a 1-s tme nterval. The velocty of the partcle

More information