Energy-Efficient Task Scheduling on Multiple Heterogeneous Computers: Algorithms, Analysis, and Performance Evaluation

Size: px
Start display at page:

Download "Energy-Efficient Task Scheduling on Multiple Heterogeneous Computers: Algorithms, Analysis, and Performance Evaluation"

Transcription

1 IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, VOL., NO., JANUARY-JUNE Energy-Efficient Tas Scheduing on Mutipe Heterogeneous Computers: Agorithms, Anaysis, and Performance Evauation Keqin Li, Feow, IEEE Abstract The probems of energy-constrained and time-constrained tas scheduing on mutipe heterogeneous computers are investigated as combinatoria optimization probems. For a given set of independent tass, our strategy is to find a schedue of the tass first, and then find a power aocation to the tass, where the power aocation is performed in such a way that the tota tas execution time or the tota energy consumption is minimized. We are abe to find an optima partition of a given woroad and use a modified ist scheduing (MLS) agorithm to generate a partition of the set of tass that is an approximation of the optima woroad partition. Our simuation resuts demonstrate that when compared with optima soutions, the MLS agorithm has exceent expected performance in soving the probems of energy-constrained and time-constrained scheduing of independent tass on mutipe heterogeneous computers. For precedence constrained tass represented by a directed acycic graph (dag), our eve-by-eve modified ist scheduing (LL-MLS) agorithm schedues tass eve by eve (LL) and schedues tass in the same eve by using the MLS agorithm. We are abe to sove the probems of optima energy/time aocation to the eves and optima woroad partition for a eves in a given dag. Our simuation resuts demonstrate that when compared with optima soutions, the LL-MLS agorithm has exceent expected performance in soving the probems of energy-constrained and time-constrained scheduing of precedence constrained tass on mutipe heterogeneous computers. Index Terms Energy-constrained scheduing, expected performance bound, modified ist scheduing, mutipe heterogeneous computers, time-constrained scheduing Ç INTRODUCTION. Motivation HETEROGENEOUS system architectures (HSA) [] have attracted growing interest in modern high-performance computing to achieve higher performance/power ratio. In a recent (November 205) Top500 isting, there are 04 supercomputing systems which adopt HSA. The Sugon Custer W780I system equipped with Xeon E5-2640v3 8C 2.6 GHz and NVIDIA Tesa K80 (instaed in the Institute of Modern Physics of Chinese Academy of Sciences) can achieve the power efficiency of Gfops/Watt. In the area of coud computing, there is a consensus that heterogeneous server architectures wi dominate the future data centers [33]. A typica future heterogeneous data center contains speciaized servers and acceerators incuding graphica processing units (GPUs), fied-programmabe gate arrays (FPGAs), and digita signa processors (DSPs) various storage systems such as networ fie system (NSF) and Hadoop distributed fie system (HDFS) and fexibe interconnects such as Gigabit Ethernet and InfiniBand [9]. A heterogeneous architecture is aso abe to provide ow-power and high-throughput cores, aong with appication-specific acceerators for arge-scae appications [3]. Notice that The author is with the Department of Computer Science, State University of New Yor, New Patz, NY 256. E-mai: i@newpatz.edu. Manuscript received 3 Feb. 206 revised 3 Oct. 206 accepted 30 Oct Date of pubication Nov. 206 date of current version 5 Jan Recommended for acceptance by R.G. Mehem. For information on obtaining reprints of this artice, pease send e-mai to: reprints@ieee.org, and reference the Digita Obect Identifier beow. Digita Obect Identifier no. 0.09/TSUSC heterogeneity exists not ony within a server [8], but aso between the servers [28]. It is a chaenge on how to effectivey and efficienty manage heterogeneous processors in current and future supercomputing systems and coud computing patforms. The probem becomes more difficut when processors are equipped with the capabiity of dynamic votage and frequency scaing. Virtuay a existing studies on energy-efficient tas scheduing on mutiprocessors or mutipe computers assume that a the processors or computers have the same power consumption mode, i.e., the power consumption is proportiona to the execution speed raised to the power a, where a is the same for a processors or computers. However, it is we nown that processors from different manufacturers and vendors have very different characteristics of performance (i.e., CPU/core speed) and power consumption [2]. Unfortunatey, there has been itte study on energy-efficient tas scheduing on mutipe heterogeneous computers which are characterized by different vaues of a. The motivation of this paper is to mae some initia effort in this direction. Such investigation is certainy of theoretica interest and practica importance, since a arge-scae data center may empoy different servers manufactured by different vendors with different technoogies, which resut in different power consumption modes..2 Our Contributions In this paper, we investigate the probems of energy-constrained and time-constrained tas scheduing on mutipe heterogeneous computers. We define these probems as combinatoria optimization probems. Given a set of tass ß 206 IEEE. Persona use is permitted, but repubication/redistribution requires IEEE permission. See for more information.

2 8 IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, VOL., NO., JANUARY-JUNE 206 with an energy budget, the energy-constrained scheduing probem is to find a nonpreemptive schedue of the tass, such that the tota energy consumption does not exceed the given energy budget and that the tota tas execution time is minimized. Given a set of tass with a time deadine, the time-constrained scheduing probem is to find a nonpreemptive schedue of the tass, such that the tota tas execution time does not exceed the given time deadine and that the tota energy consumption is minimized. These probems have been investigated for independent/precedence-constrained and sequentia/parae tass on homogeneous computers [4], [5], [6], [8], [9], [20]. We find that for independent tass, both probems contain two subprobems, i.e., power aocation and tas scheduing. For a given set of tass, our strategy is to find a schedue of the tass first, and then find a power aocation to the tass, where the power aocation is performed in such a way that the tota tas execution time or the tota energy consumption is minimized (Theorems and 2). Our investigation reveas the fact that both scheduing probems can be reduced to the probem of finding an optima partition of the given set of tass into disoint subsets. Fortunatey, we are abe to find an optima partition of a given woroad (Theorems 3 and 4), and use a modified ist scheduing (MLS) agorithm to generate a partition of the set of tass that is an approximation of the optima woroad partition. Our simuation resuts demonstrate that when compared with optima soutions, the MLS agorithm has exceent expected performance in soving the probems of energy-constrained and time-constrained scheduing of independent tass on mutipe heterogeneous computers. For precedence constrained tass represented by a directed acycic graph (dag), there is an additiona subprobem of deaing with tas inter-dependency. Our strategy to hande precedence constraints is to schedue tass eve by eve (LL) and to schedue tass in the same eve by using the MLS agorithm. Hence, our agorithm is caed eve-byeve modified ist scheduing (LL-MLS) agorithm. The ey issue in using the LL scheduing method is to find an optima energy or time aocation to the eves. Fortunatey, we are abe to sove the probems of optima energy/time aocation to the eves and optima woroad partition for a eves in a given dag (Theorems 5 and 6). Our simuation resuts demonstrate that when compared with optima soutions, the LL-MLS agorithm has exceent expected performance in soving the probems of energy-constrained and time-constrained scheduing of precedence constrained tass on mutipe heterogeneous computers. To the best of our nowedge, this is the first paper that investigates energy-efficient tas scheduing on mutipe heterogeneous computers in a systematic and anaytic way..3 Paper Outine Due to space imitation, this section is moved to the suppementary materia, which can be found on the Computer Society Digita Library at org/0.09/tsusc RELATED RESEARCH Reducing processor energy consumption has been an important and pressing research issue in recent years. There has been increasing interest and importance in deveoping highperformance and energy-efficient computing systems. There exists an exposive body of iterature on energy-efficient computing (see [3], [4], [32], [39] for comprehensive surveys). 2. Heterogeneous High-Performance Computing Energy-efficient high-performance computing on heterogeneous systems has been studied extensivey by numerous researchers. In [22], the authors were concerned with the probem of scheduing a bag-of-tass appication, made of a coection of independent stochastic tass with norma distributions of tas execution times, on a heterogeneous patform with deadine and energy consumption budget constraints. In [25], the authors proposed a new energy-aware scheduing agorithm with reduced tas dupication, which taes the energy consumption as we as the maespan of an appication into consideration. In [29], the authors deveoped a reatime and energy-efficient resource scheduing and optimization framewor to achieve high energy efficiency and ow response time in big data stream computing environments. In [30], the authors proposed an energy-efficient worfow tas scheduing agorithm in order to obtain more energy reduction as we as maintain the quaity of service by meeting the deadines. In [35], the authors addressed the probem of energy-aware data aocation and tas scheduing on a heterogeneous distributed shared-memory mutiprocessor system for rea-time appications. In [37], the authors devised a nove reiabiity maximization with energy constraint agorithm to effectivey baance the tradeoff between high reiabiity and ow energy consumption. In [38], the authors deveoped new green tas scheduing agorithms for heterogeneous computers with changeabe continuous and discrete speeds to reduce energy consumption as much as possibe and finish a tass before a deadine. 2.2 Heterogeneous Coud Computing Many researchers have investigated various issues in heterogeneous coud systems, incuding resource aocation, tas scheduing, performance guarantee, and energy saving. In [], the authors outined a principed approach to designing energy-efficient and heterogeneous data centers that are robust against data center woroad variations. In [23], the authors emphasized that the operationa cost of data centers is dominated by the cost on energy consumption, and modeed a data center as a cyber physica system to capture its therma properties. In [24], the authors presented a green strategy mode for heterogeneous coud systems, and provided a soution for heterogeneous ob-communicating tass and heterogeneous virtua machines that mae up the nodes of the coud to guarantee the service-eve agreement and to optimize energy savings. In [26], the authors had the obective to satisfy performance expectations of customers by heterogeneous dynamic dedicated server scheduing whie considering heterogeneous servers and different priority casses of customers. In [27], the authors proposed an optimization strategy based on a mixed integer programming mode for achieving improvement on power-efficiency whie providing performance guarantee in the virtuaized custer. In [34], the authors studied the muti-resource aocation probem in coud computing systems where the resource poo is

3 LI: ENERGY-EFFICIENT TASK SCHEDULING ON MULTIPLE HETEROGENEOUS COMPUTERS: ALGORITHMS, ANALYSIS, AND... 9 constructed from a arge number of heterogeneous servers, representing different points in the configuration space of resources such as processing, memory, and storage. However, the above research did not incude dynamic votage and frequency scaing into consideration. Due to space imitation, some part of this section is moved to the suppementary materia, avaiabe onine. 3 POWER ALLOCATION The power consumption mode adopted in this paper is the foowing, which is a standard mode used by virtuay a researchers in the fied [3], [4], [5], [6], [8], [9], [20], [22], [25], [29], [30], [35], [37], [38], [39]. Power dissipation and circuit deay in digita CMOS circuits can be accuratey modeed by simpe equations, even for compex microprocessor circuits. CMOS circuits have dynamic, static, and short-circuit power dissipation however, the dominant component in a we designed circuit is dynamic power consumption p (i.e., the switching component of power), which is approximatey p ¼ acv 2 f, where a is an activity factor, C is the oading capacitance, V is the suppy votage, and f is the coc frequency [7]. Since s / f, where s is the processor speed, and f / V f with 0 < f, which impies that V / f =f, we now that power consumption is p / f a and p / s a, where a ¼ þ 2=f 3. Furthermore, the vaue of a can be as high as for high suppy votage [36]. (Note: Athough it has been nown that power consumption can be highy dependent on types of appications [0], we assume that p is primariy determined by s, as in the existing iterature.) The main parameter which characterizes the power efficiency of a processor is a. In this paper, we consider tas scheduing on m heterogeneous computers specified by a a 2... a m. Assume that we are given n independent or precedence constrained sequentia tass to be executed on m heterogeneous processors. Let r i represent the execution requirement (i.e., the number of CPU cyces or the number of instructions) of tas i, where i n. We use p i to represent the power aocated to execute tas i which is executed on processor. For ease of discussion, we wi assume that p i is simpy s i, where s i ¼ p = i is the execution speed of tas i. The execution time of tas i is t i ¼ r i =s i ¼ r i =p = i. The energy consumed to execute tas i is e i ¼ p i t i ¼ r i p = i ¼ r i s i. 3. Energy-Constrained Scheduing In this section, we define and examine the energy-constrained scheduing probem. Given a set of n tass with execution requirements r r 2...r n, a group of m heterogeneous computers characterized by a a 2... a m, and an energy constraint E, the energy-constrained scheduing probem is to find a power aocation p p 2...p n to the n tass and a nonpreemptive schedue of the n tass on the m computers, such that the tota energy consumption of the n tass does not exceed E and that the tota execution time of the n tass (i.e., the schedue ength) is minimized. A schedue of a set of n tass on m computers is actuay a partition of the set of n tass into m disoint subsets or groups, such that tass in the th group are executed on the th computer. Let R denote the th group as we as the tota execution requirement of the tass in the th group, where m. We use R ¼ r þ r 2 þþr n ¼ R þ R 2 þþr m to represent the tota execution requirement of a the n tass. Assume that the given energy budget E is divided into m parts, i.e., E E 2...E m, such that E is aocated to computer, where m. The foowing theorem gives the optima power aocation and the minimized schedue ength for a given schedue. Theorem. For a given energy constraint E and a given schedue of a set of n tass on m computers R R 2...R m, the minimized schedue ength T satisfies E ¼ Ra T þ Ra2 2 a T a 2 Ram þþ m T : am The above minimized schedue ength is achieved when E ¼ R =T a and a tass in R are aocated with the same power p ¼ðR =TÞ and executed with the same speed s ¼ R =T, for a m. Proof. It is aready nown in [4] that given energy constraint E, the tota execution time of the tass in R on computer is minimized when a tass in R are suppied with the same power p ¼ðE =R Þ =ð Þ and executed with the same speed s ¼ðE =R Þ =ða Þ, and the minimized execution time is T ¼ R =ð Þ =E =ð Þ : It is cear that in order to minimize the schedue ength T of the n tass on the m computers, we need to have T ¼ T 2 ¼¼T m ¼ T, i.e., a the m computers compete their assigned tass at the same time. Hence, we get T ¼ R =ð Þ =E =ð Þ which gives E ¼ R =T a for a m. Since E ¼ E þ E 2 þþe m, we have E given in the theorem. It is easy to verify that p ¼ ðr =T Þ and s ¼ R =T, for a m. This proves the theorem. tu The significance of Theorem is that we have reduced the energy-constrained scheduing probem on mutipe heterogeneous computers to the probem of finding a schedue R R 2...R m, such that T is minimized. This suggests an effective approach to soving the energy-constrained scheduing probem. The method consists of three steps. In the first step, we find R R 2...R m, such that T is minimized (see Section 4. and Theorem 3). In the second step, we find a partition R R 2...R m of the n tass into m disoint groups, such that R R 2...R m are cose to R R 2...R m (see Section 5.). In the third step, we use Theorem to find the optima power aocation p p 2...p m and speed setting s s 2...s m for the given schedue R R 2...R m. 3.2 Time-Constrained Scheduing In this section, we define and examine the time-constrained scheduing probem. Given a set of n tass with execution requirements r r 2...r n, a group of m heterogeneous computers characterized by a a 2... a m, and a time constraint T, the timeconstrained scheduing probem is to find a power aocation p p 2...p n to the n tass and a nonpreemptive schedue of the n tass on the m computers, such that the tota execution

4 0 IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, VOL., NO., JANUARY-JUNE 206 time of the n tass does not exceed T and that the tota energy consumption of the n tass is minimized. The foowing theorem gives the optima power aocation and the minimized energy consumption for a given schedue. Theorem 2. For a given a time constraint T and a given schedue of a set of n tass on m computers R R 2...R m, the minimized energy consumption E is E ¼ Ra T a þ Ra2 2 T a 2 Ram þþ m T am : The above minimized energy consumption is achieved when a tass in R are aocated with the same power p ¼ðR =T Þ and executed with the same speed s ¼ R =T, for a m. Proof. It is aready nown in [4] that given time constraint T, the tota energy consumption of the tass in R on computer is minimized when a tass in R are suppied with the same power p ¼ðR =TÞ and executed with the same speed s ¼ R =T, and the minimized energy consumption is E ¼ R =T a where m. This impies that given a time constraint T and a partition R R 2...R m, the minimized energy consumption of the n tass is E ¼ E þ E 2 þþe m, i.e., the E given in the theorem. This proves the theorem. tu The significance of Theorem 2 is that we have reduced the time-constrained scheduing probem on mutipe heterogeneous computers to the probem of finding a schedue R R 2...R m, such that E is minimized. This suggests an effective approach to soving the time-constrained scheduing probem. The method consists of three steps. In the first step, we find R R 2...R m, such that E is minimized (see Section 4.2 and Theorem 4). In the second step, we find a partition R R 2...R m of the n tass into m disoint groups, such that R R 2...R m are cose to R R 2...R m (see Section 5.). In the third step, we use Theorem 2 to find the optima power aocation p p 2...p m and speed setting s s 2...s m for the given schedue R R 2...R m. 4 LOWER BOUNDS 4. A Lower Bound for Optima Schedue Length The main purpose of this section is to deveop a numerica method to find R R 2...R m, i.e., an optima partition of a given woroad R, assuming that R R 2...R m are continuous variabes, such that T in Theorem is minimized. From Theorem, we now that the minimized schedue ength T can be viewed as a function of R R 2...R m.we are interested in the foowing optimization probem (i.e., optima woroad partition), namey, given m, a a 2... a m, R, and E, finding a partition R R 2...R m of the woroad R, such that T is minimized, subect to the constraint that F ¼ R þ R 2 þþr m ¼ R: Unfortunatey, there is no cosed-form soution. Our obective is to deveop a numerica method to find R R 2...R m and the minimized T. The significance of the study is two fod. First, the vaues of R R 2...R m can be used to guide us in finding an optima schedue to sove the energy-constrained scheduing probem. Second, the minimized T can be used as a ower bound for the optima schedue ength T OPT, such that our soutions can be compared with the optima soutions. Our main resut of this section is the foowing theorem. Theorem 3. The optima schedue ength has a ower bound T OPT T, where the minimized T and the partition R R 2...R m that resuts in T can be obtained by soving the m þ equations, i.e., the constraint and m equations T ¼ R þ R 2 þþr m ¼ R R R E =ða Þ for a m. Note: The ower bound is impicity given by equations in the theorem. We wi deveop a numerica agorithm to sove these equations after the proof. The main idea of the proof is essentiay to treat T as a function of R R 2...R m. The optima woroad partition probem can be soved by minimizing T using a standard Lagrange mutipier system. The proof can be sipped if the reader is ess interested in the detais. Proof. It is cear that the optima schedue ength may not reach the minimized T, since the partition R R 2...R m of R may not be reaized by a partition of the n tass, due to the imited vaues of r r 2...r n. Hence, we have T OPT T. To sove the optimization probem, we use a Lagrange mutipier system, i.e., rt ¼ frf where f is a Lagrange mutipier. The above equation impies ¼ f@f=@r for a m. To et us reca the equation of T in Theorem, namey, E ¼ Ra T a þ Ra2 2 T a 2 Ram þþ m T am : We tae a partia derivative of R on both sides of the above equation and get R T Xm which gives rise ¼ R T ð ÞR T a ¼ ð ÞR a T a for a m. ¼ we obtain R T for a m. ð ÞR a T a ¼ f

5 LI: ENERGY-EFFICIENT TASK SCHEDULING ON MULTIPLE HETEROGENEOUS COMPUTERS: ALGORITHMS, ANALYSIS, AND... The ast equation can be rewritten as R =T a ¼ fs where S ¼ Xm ð ÞR T a : Hence, by Theorem, we have E ¼ R T ¼ fs R for a m. The above equation impies that and and E ¼ E þ E 2 þþe m ¼ fs Xm fs ¼ E Xm R R E ¼ R X m R E a R = ¼ E R =a þ R 2 =a 2 þþr m =a m for a m. By Theorem, we have T ¼ R =E =ða Þ : Substituting E, we get T ¼ R R E =ða Þ for a m. From the ast m equations, pus the condition R þ R 2 þþr m ¼ R we can find R R 2...R m and the resuting minimized T. tu It is cear that the sophistication of the system of noninear equations in Theorem 3 does not accommodate a cosed-form soution. In the foowing, we deveop a numerica method to find a soution to the m þ equations in Theorem 3. First, we need a method to find R R 2...R m for a fixed T. To this end, we et W ¼ P m ¼ R = : Therefore, we obtain E =ða Þ R ¼ T W for a m. It is cear that W can be found by the cassic bisection method [5]. The reason is that R R 2...R m a decrease with W. Hence, our numerica method can be described as foows. First, et W ¼ W 0 be some arbitrary vaue. We eep reducing W (e.g., having W) and increasing R R 2...R m, unti W ¼ W and W < P m ¼ R = : Next, we et W ¼ W 0 and eep increasing W (e.g., doubing W) and decreasing R R 2...R m, unti W ¼ W 2 and W 2 > P m ¼ R = : Finay, we can search W in ½W W 2 Š, such that W ¼ P m ¼ R = : Once W is determined, R R 2...R m can be cacuated in a straightforward manner. The detaied procedure of the above method is given in Agorithm of the suppementary materia, avaiabe onine. Second, we can aso find the minimized T by using the bisection method. It is observed that the vaues in R R 2...R m a increase with T. Hence, our numerica method can be described as foows. First, et T ¼ T 0 be some arbitrary vaue. We eep reducing T (e.g., having T) and decreasing R R 2...R m, unti R þ R 2 þþ R m <R: Next, we et T ¼ T 0 and eep increasing T (e.g., doubing T) and increasing R R 2...R m, unti R þ R 2 þþr m > R: Finay, we can search T in ½T T 2 Š, such that R þ R 2 þþr m ¼ R: The fina vaues of R R 2...R m are considered as the numerica soution of R R 2...R m. The detaied procedure of the above method is given in Agorithm 2 of the suppementary materia, avaiabe onine. 4.2 A Lower Bound for Optima Energy Consumption The main purpose of this section is to deveop a numerica method to find R R 2...R m, i.e., an optima partition of a given woroad R, assuming that R R 2...R m are continuous variabes, such that E in Theorem 2 is minimized. From Theorem 2, we now that the minimized energy consumption E can be viewed as a function of R R 2...R m. We are interested in the foowing optimization probem (i.e., optima woroad partition), namey, given m, a a 2... a m, R, and T, finding a partition R R 2...R m of the woroad R, such that E is minimized, subect to the constraint that F ¼ R þ R 2 þþr m ¼ R: Again, there is no cosed-form soution. Our obective is to deveop a numerica method to find R R 2...R m and the minimized E. The significance of the study is two fod. First, the vaues of R R 2...R m can be used to guide us in finding an optima schedue to sove the time-constrained scheduing probem. Second, the minimized E can be used as a ower bound for the optima energy consumption E OPT, such that our soutions can be compared with the optima soutions. Our main resut of this section is the foowing theorem. Theorem 4. The optima energy consumption has a ower bound E OPT E, where the minimized E and the partition R R 2...R m that resuts in E are E ¼ T Xm f a =ð Þ and R ¼ ¼ f =ða Þ T for a m, and f satisfies ¼ f =ða Þ ¼ R T : Note: The ower bound is impicity given by equations in the theorem. We wi deveop a numerica agorithm to sove these equations after the proof. The main idea of the proof is simiar to that of Theorem 3 and can be sipped.

6 2 IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, VOL., NO., JANUARY-JUNE 206 Proof. It is cear that the optima energy consumption may not reach the minimized E, since the partition R R 2...R m of R may not be reaized by a partition of the n tass. Hence, we have E OPT E. To sove the optimization probem, we use a Lagrange mutipier system, i.e., re ¼ frf where f is a Lagrange mutipier. The above equation impies ¼ f@f=@r for a m. From Theorem 2, we get R =T a ¼ f and consequenty, R ¼ f =ða Þ T for a m. Since R þ R 2 þþr m ¼ R, we get R ¼ T Xm f =ða Þ : By Theorem 2, we have E ¼ Xm ¼ ¼ R a T ¼ T Xm f ¼ a =ð Þ The theorem is proven. tu The equation for f in Theorem 4 can be soved by using the bisection method, where we observe that the eft-hand side of the equation is an increasing function of f. Hence, our numerica method can be described as foows. Assume that S ¼ Xm f =ða Þ : ¼ First, et f ¼ f 0 be some arbitrary vaue. We eep reducing f (e.g., having f), unti S < R=T. Next, we et f ¼ f 0 and eep increasing f (e.g., doubing f), unti S > R=T. Finay, we can search f in ½f f 2 Š, such that S ¼ R=T. Once f is avaiabe, we can cacuate R R 2...R m and the minimized energy consumption E. The detaied procedure of the above method is given in Agorithm 3 of the suppementary materia, avaiabe onine. : 5 SCHEDULING INDEPENDENT TASKS 5. The MLS Agorithm We now describe our heuristic agorithms. Let R R 2...R m denote the partition obtained in Theorem 3 which resuts in the minimized T. Our strategy to sove the energy-constrained scheduing probem is to find a partition R R 2...R m of the n tass into m disoint groups, such that R R 2...R m are cose to the optima woroad partition R R 2...R m. Let R R 2...R m denote the partition obtained in Theorem 4 which resuts in the minimized E. Our strategy to sove the time-constrained scheduing probem is to find a partition R R 2...R m of the n tass into m disoint groups, such that R R 2...R m are cose to the optima woroad partition R R 2...R m. For both energy-constrained scheduing and time-constrained scheduing, we are facing the same probem, namey, given R R 2...R m, finding a partition of the n tass into m disoint groups, such that R R 2...R m are cose to R R 2...R m. Once a schedue R R 2...R m is determined, we can use Theorem to find an optima power aocation for energy-constrained scheduing, or Theorem 2 to find an optima power aocation for time-constrained scheduing. To generate a partition of the n tass into m disoint groups, we treat the tas execution requirements r r 2... r n as tas execution times. (Note: They are not the actua tas execution times, but ony the input to the MLS agorithm for the purpose of generating a partition.) The n tass are assigned to the m computers by using the cassic ist scheduing agorithm [2] with some modifications. Our modified ist scheduing agorithm wors as foows. Initiay, the n tass are arranged into a ist. The first m tass are assigned to the m computers, one tas per computer. Whenever a computer competes a tas, the next tas in the ist is assigned to the computer. As soon as the tota tas execution requirement of a computer exceeds R, the computer is boced and does not accept more tass. The resut of the above procedure is a tas assignment to the m computers, i.e., a schedue of the n tass on the m computers. The resuted partition R R 2...R m is considered as an approximation of R R 2...R m. 5.2 Numerica Exampes In this section, we give exampes of our numerica cacuations. In Tabe, we demonstrate numerica data of energyconstrained scheduing of independent tass. We consider m ¼ 7 heterogeneous computers with ða a 2 a 3 a 4 a 5 a 6 a 7 Þ¼ð3:0 3:5 4:0 4:5 5:0 5:5 6:0Þ: (Note: As commony accepted, the vaue of a is at east 3 [7]. As mentioned earier, the vaue of a can be as high as for high suppy votage [36].) Let us assume that the tota amount of tas execution requirement is set as R ¼ 00. For energy constraint E ¼ , we show R, E, p, and s for a m. (Note: The vaues of R and E are set in such a way that the execution speeds are in a reasonabe range.) These data are cacuated by using Theorem 3. The minimized T is aso given. We observe that due to the heterogeneity of the s, the woroad R and the energy E are uneveny distributed among the computers, such that a computer with a smaer is assigned more woroad and aocated more energy. Furthermore, tass assigned to a computer with a smaer are suppied with more power and executed with faster speed. In fact, as E increases, the ratios R =R m, E =E m, p =p m, and s =s m a increase. In Tabe 2, we demonstrate numerica data of time-constrained scheduing of independent tass. The vaues of m, R, and the s are the same as Tabe. For time constraint T ¼ 2:0 0:5 9:0 7:5 6:0, weshowr, E, p,ands for a m. These data are cacuated by using Theorem 4. The minimized E is aso given. It is observed that as T decreases, the ratios R =R m, E =E m, p =p m,ands =s m a increase. 5.3 Performance Data We now present simuation data to demonstrate the performance of our heuristic agorithms on m ¼ 7 heterogeneous computers with the same s in Section 5.2.

7 LI: ENERGY-EFFICIENT TASK SCHEDULING ON MULTIPLE HETEROGENEOUS COMPUTERS: ALGORITHMS, ANALYSIS, AND... 3 TABLE Numerica Data of Energy-Constrained Scheduing of Independent Tass E R E p s T , TABLE 2 Numerica Data of Time-Constrained Scheduing of Independent Tass T R E p s E , The soutions produced by our agorithms wi be compared with optima soutions (actuay, the ower bounds in Theorems 3 and 4). Our performance measures are defined as foows. Let A be a heuristic agorithm for the energy-constrained scheduing probem. We use T A to denote the ength of the schedue produced by agorithm A. Then, the ratio T A =T OPT is caed the performance ratio of agorithm A. If T A =T OPT B, then B is caed a performance bound. Let T denote the minimized T in Theorem 3. It is cear that B ¼ T A =T is a performance bound. Furthermore, if the tas execution requirements are random variabes, T A, T, and B are a random variabes. The expectation B is caed the expected performance bound. Let A be a heuristic agorithm for the time-constrained scheduing probem. We use E A to denote the amount of energy consumed by agorithm A. Then, the ratio E A =E OPT is caed the performance ratio of agorithm A. If E A =E OPT C, then C is caed a performance bound. Let E denote the minimized E in Theorem 4. It is cear that C ¼ E A =E is a performance bound. Furthermore, if the tas execution requirements are random variabes, E A, E, and C are a random variabes. The expectation C is caed the expected performance bound. In Tabe 3, we show simuation data for the expected performance bound B of the modified ist scheduing agorithm for the energy-constrained scheduing probem. For each combination of n ¼ and E ¼ , we generate 2,000 sets of n tass, whose tas execution requirements are random numbers uniformy TABLE 3 Simuation Data of B for Independent Tass (CI¼ 0:34%) n E ¼ 00 E ¼ 200 E ¼ 300 E ¼ 400 E ¼

8 4 IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, VOL., NO., JANUARY-JUNE 206 TABLE 4 Simuation Data of C for Independent Tass (CI¼ 0:44%) n T ¼ 4:0t T ¼ 3:5t T ¼ 3:0t T ¼ 2:5t T ¼ 2:0t distributed in ½0 Š. For each set of tass, we cacuate the ower bound T of T OPT by using Theorem 3. We aso simuate the MLS agorithm and find its schedue ength T MLS. The performance bound is obtained as B ¼ T MLS =T. The average vaue of the 2,000 vaues of B is then reported in Tabe 3. The 99 percent confidence interva of a the data in Tabe 3 is no more than 0:34 percent. In Tabe 4, we show simuation data for the expected performance bound C of the modified ist scheduing agorithm for the time-constrained scheduing probem. For each combination of n ¼ and T ¼ 4:0t 3:5t 3:0t 2:5t 2:0t, where t ¼ n=50, we generate 2,000 sets of n tass, whose tas execution requirements are random numbers uniformy distributed in ½0 Š. For each set of tass, we cacuate the ower bound E of E OPT by using Theorem 4. We aso simuate the MLS agorithm and find its energy consumption E MLS. The performance bound is obtained as C ¼ E MLS =E. The average vaue of the 2,000 vaues of C is then reported in Tabe 4. The 99 percent confidence interva of a the data in Tabe 4 is no more than 0:44 percent. In the foowing, we point out some observations. From Tabe 3, we observe that B is very cose to. For a given n, the expected performance bound B sighty increases as E increases, and then decreases as E further increases, primariy due to the ower bounds. For a given E, the expected performance bound B decreases as n increases. From Tabe 4, we observe that C is very cose to. For a given n, the expected performance bound C sighty increases as T decreases, and then decreases as T further decreases, primariy due to the ower bounds. For a given T, the expected performance bound C decreases as n increases. Notice that a the data in Tabes 3 and 4 are for expected performance bounds. The actua vaues of expected performance ratios are smaer and coser to the optima. Our simuation resuts demonstrate that the MLS agorithm has exceent expected performance in soving the probems of energy-constrained and time-constrained scheduing of independent tass on mutipe heterogeneous computers. 6 SCHEDULING PRECEDENCE CONSTRAINED TASKS 6. The LL-MLS Agorithm A set of precedence constrained tass can be represented by a directed acycic graph. The nodes in a dag are tass and arcs in the dag are precedence constraints. A dag can be divided into v eves. Tass with no any predecessor are in eve. In genera, a tas is in eve if the ongest path from a node in eve to the tas contains nodes. Let R denote the tota execution requirement of tass in eve,where v. Our strategy to schedue precedence constrained tass is to schedue tass in a dag eve by eve,thatis,tassineve cannot be executed unti a tass in eve are finished. Since tass in each eve are independent of each other, they can be schedued by using the MLS agorithm. Therefore, our agorithm is caed eve-by-eve modified ist scheduing agorithm. Let E denote the amount of energy aocated to tass in eve (or, the amount of energy consumed by tass in eve ), where v, such that E þ E 2 þþe v ¼ E. Let T denote the execution time aocated to tass in eve (or, the tota execution time of tass in eve ), where v, such that T þ T 2 þþt v ¼ T. Our main concern for energy-constrained scheduing is the determination of E E 2...E v, such that the tota execution time of a tass in a dag, i.e., T ¼ T þ T 2 þþt v,isminimized (see Section 6.2 and Theorem 5). Once an optima energy aocation E E 2...E v is avaiabe, we can use the MLS agorithm in Section 5. to schedue the v eves separatey and sequentiay. Our main concern for timeconstrained scheduing is the determination of T T 2...T v, such that the tota energy consumption of a tass in a dag, i.e., E ¼ E þ E 2 þþe v, is minimized (see Section 6.3 and Theorem 6). Once an optima time aocation T T 2...T v is avaiabe, we can use the MLS agorithm in Section 5. to schedue the v eves separatey and sequentiay. We wi address these two issues in the next two sections. 6.2 Optima Energy Aocation According to the eve-by-eve scheduing method, the tota execution time T of a tass in a dag is simpy the summation of the T s, i.e., T ¼ T þ T 2 þþt v where T is viewed as a function of E, and T is viewed as a function of E E 2...E v. Hence, we can define the foowing optimization probem (i.e., optima energy aocation and woroad partition), namey, given m, a a 2... a m, R R 2...R v, and E, finding E E 2...E v, such that T is minimized, subect to the constraint that F ¼ E þ E 2 þþe v ¼ E: Notice that each R shoud be further divided into R R 2...R m by using Theorem 3, such that T is minimized for a given E, where v. Note: The main purpose of the foowing mathematica derivation is to deveop a numerica procedure to sove the above optimization probem. The reader can sip this part and go to Tabe 5. To sove the above optimization probem, we use a Lagrange mutipier system, i.e., rt ¼ frf where f is a Lagrange mutipier. The above equation impies @E or, =@E ¼ f for a v. In the foowing, we deveop a numerica procedure to sove the optimization probem.

9 LI: ENERGY-EFFICIENT TASK SCHEDULING ON MULTIPLE HETEROGENEOUS COMPUTERS: ALGORITHMS, ANALYSIS, AND... 5 TABLE 5 Numerica Data of Energy-Constrained Scheduing of Precedence Constrained Tass E R E p s T To =@E, we reca from Theorem 3 that for a given E, there is an optima partition R R 2...R m which resuts in the minimized T. The vaues of R R 2...R m and T can be obtained by soving the m þ equations, i.e., the constraint and m equations T ¼ R þ R 2 þþr m ¼ R R R E =ða Þ for a m. From Tabe, we now that T as we as R R 2...R m are a functions of E. We tae a partia derivative of E on both sides of the above equation and T 2 þr ð ÞR 2 R 0 R 0 R E R E R where R 0 =@E, for a m. The ast equation can be rewritten as E 2 ð ÞR 0 ¼ ft 2 R E þ R R 2 þ R R 0 R E for a m. For a given f, we need to find E, where v. For a given E, the vaues of R R 2...R m and T can be obtained from Theorem 3. The vaues of R 0 R0 2...R0 m can be obtained by soving the above m equations, which form a system of inear equations, X 6¼ R ¼ ft 2 R 0 þ ð Þ E R 2 þ R R þ R R E R 0 for a m. The detaied procedure of the above method is given in Agorithm 4 of the suppementary materia, avaiabe onine. Hence, we ony need to determine E, such that R 0 þ R0 2 þþr0 m ¼ 0. It is noticed that R 0 þ R0 2 þþr0 m is a decreasing function of E, and the vaue of E can be found by using the bisection method. The detaied procedure of the above method is given in Agorithm 5 of the suppementary materia, avaiabe onine. Finay, to find E E 2...E v, we notice that a the E s as we as E þ E 2 þþe v are increasing functions of f. Therefore, we ony need to determine f, such that E þ E 2 þþe v ¼ E, by using the bisection method. Once E E 2...E v are avaiabe, we can cacuate R R 2...R m and T, for a v, by using Theorem 3, and T as a direct consequence. The detaied procedure of the above method is given in Agorithm 6 of the suppementary materia, avaiabe onine. In Tabe 5, we demonstrate numerica data of energy-constrained scheduing of precedence constrained tass. We consider m ¼ 7 heterogeneous computers with the same s in Section 5.2. Assume that a dag has v ¼ 4 eves, with R ¼ 0, where v, and R ¼ 00. The energy constraint is E ¼ 200. For a v, weshowe and T,asweasR, E, p,ands, for a m. The foowing facts are observed, which are described by surprisingy simpe reations, and are not obvious at a from the above derivation. Theorem 5. Assume that () R R 2...R m is an optima partition of a woroad R of independent tass obtained from Theorem 3 (2) E E 2...E m is the corresponding optima energy aocation of a given energy constraint E (3) T is the resuting minimized schedue ength (4) p p 2...p m are the power settings of the m computers (5) s s 2...s m are the speed settings of the m computers. For the optima energy aocation and woroad partition probem with R R 2...R v and E, where R þ R 2 þþr v ¼ R, we have the foowing facts. ) E, R, E, and T are ineary proportiona to R, i.e., they can be obtained from E, R, E, and T by scaing a factor of R =ðr þ R 2 þþr v Þ, for a v. 2) p ¼ p 2 ¼¼p v ¼ p. 3) s ¼ s 2 ¼¼s v ¼ s.

10 6 IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, VOL., NO., JANUARY-JUNE 206 Proof. Notice that the minimized T 0 of the optima energy aocation and woroad partition probem cannot be shorter than the minimized T of the woroad partition probem of the same set of tass with a precedence constraints removed (i.e., tass become independent of each other). The minimized T of the set of independent tassisgivenbytheorem3.ifwecanshowthatt 0 is the same as T, thent 0 wi be obviousy optima, i.e., we get an optima energy aocation and woroad partition. Fortunatey, this is indeed true, as stated in the theorem. Theeyobservationisthefactthattheequationsin Theorem 3 sti hod if E, ther s, and T are scaed by a common factor. This expains Fact ), i.e., if we set E, the R s, the E s, and T according to Fact ), then they wi satisfy the equations in Theorem 3, and become the input and output of the woroad partition probem for eve, fora v. Thisimpiesthat T 0 ¼ T þ T 2 þþt v ¼ T, andwedogetasoutionto the optima energy aocation and woroad partition probem. Furthermore, since p ¼ðR =T Þ and s ¼ R =T, for a v and m, Facts 2)-3) are direct consequences, since R and T are a scaed by a common factor. tu 6.3 Optima Time Aocation According to the eve-by-eve scheduing method, the tota energy consumption E of a tass in a dag is simpy the summation of the E s, i.e., E ¼ E þ E 2 þþe v where E is viewed as a function of T, and E is viewed as a function of T T 2...T v. Hence, we can define the foowing optimization probem (i.e., optima time aocation and woroad partition), namey, given m, a a 2... a m, R R 2... R v, and T, finding T T 2...T v, such that E is minimized, subect to the constraint that F ¼ T þ T 2 þþt v ¼ T: Notice that each R shoud be further divided into R R 2...R m by using Theorem 4, such that E is minimized for a given T, where v. Note: The main purpose of the foowing mathematica derivation is to deveop a numerica procedure to sove the above optimization probem. The reader can sip this part and go to Tabe 6. To sove the above optimization probem, we use a Lagrange mutipier system, i.e., re ¼ frf where f is a Lagrange mutipier. The above equation impies ¼ f@f=@t or, =@T ¼ f for a v. In the foowing, we deveop a numerica procedure to sove the optimization probem. To =@T, we reca from Theorem 4 that for a given T, there is an optima partition R R 2...R m which resuts in the minimized E given by E ¼ T ¼ f a =ð Þ TABLE 6 Numerica Data of Time-Constrained Scheduing of Precedence Constrained Tass T R E p s E where f is aso a function of T. We tae a partia derivative of T on both sides of the above equation ¼ Xm ¼ f a =ð Þ þ T f 0 ¼ =ða Þ f where f 0 =T. It remains to find f 0. Since f satisfies ¼ f =ða Þ ¼ R T we can tae a partia derivative of T on both sides of the above equation and get f 0 ¼ which impies that f 0 ¼ R T 2 ð Þ ¼ a ða 2Þ=ð Þ f ¼ R T 2 a ða 2Þ=ð Þ : ð Þ f Substituting f 0 into the equation =@T and noticing =@T ¼ f, we get

11 LI: ENERGY-EFFICIENT TASK SCHEDULING ON MULTIPLE HETEROGENEOUS COMPUTERS: ALGORITHMS, ANALYSIS, AND... 7 TABLE 7 Simuation Data of B for Precedence Constrained Tass (CI¼ 0:459%) v E ¼ 200 E ¼ 400 E ¼ 600 E ¼ 800 E ¼ TABLE 8 Simuation Data of C for Precedence Constrained Tass (CI¼ :809%) v T ¼ 0:45t T ¼ 0:40t T ¼ 0:35t T ¼ 0:30t T ¼ 0:25t f ¼ Xm ¼ R T f a =ð Þ ¼ ¼ a ða 2Þ=ð Þ ð Þ f f =ða Þ : It is noticed that the right-hand side of the above equation is negative and an increasing function of T. Thus, for any given f < 0, we can find T by using the bisection method. The detaied procedure of the above method is given in Agorithm 7 of the suppementary materia, avaiabe onine. To determine T T 2...T v, we notice that a the T s as we as T þ T 2 þþt v are increasing functions of f. Therefore, we ony need to determine f, such that T þ T 2 þþt v ¼ T. Once T T 2...T v are avaiabe, we can cacuate R R 2...R m and E, for a v, by using Theorem 4, and E as a direct consequence. The detaied procedure of the above method is given in Agorithm 8 of the suppementary materia, avaiabe onine. In Tabe 6, we demonstrate numerica data of time-constrained scheduing of precedence constrained tass. The vaues of m, the s, v, and the R s are the same as Tabe 5. The time constraint is T ¼ 2. For a v, we show T and E, as we as R, E, p, and s, for a m. We observe the same facts as those in Tabe 5. Theorem 6. Assume that () R R 2...R m is an optima partition of a woroad R of independent tass obtained from Theorem 4, where the time constraint is T (2) E E 2...E m is the corresponding energy consumption of the m computers (3) E is the resuting minimized energy consumption (4) p p 2...p m are the power settings of the m computers (5) s s 2...s m are the speed settings of the m computers. For the optima time aocation and woroad partition probem with R R 2...R v and T, where R þ R 2 þþr v ¼ R, we have the foowing facts. ) T, R, E, and E are ineary proportiona to R, i.e., they can be obtained from T, R, E, and E by scaing a factor of R =ðr þ R 2 þþr v Þ, for a v. 2) p ¼ p 2 ¼¼p v ¼ p. 3) s ¼ s 2 ¼¼s v ¼ s. Proof. The proof foows the same argument as that of Theorem 5. The ey observation is the fact that the equations in Theorem 4 sti hod if E, the R s, and T are scaed by a common factor. tu 6.4 Performance Data In this section, we present simuation resuts to demonstrate the expected performance of the LL-MLS agorithm on m ¼ 7 heterogeneous computers with the same s in Section 5.2. Again, the soutions produced by the LL-MLS agorithm are compared with optima soutions. Notice that the optima schedue ength of tass in a dag with a given energy budget cannot be shorter than the optima schedue ength of the same set of tass with the same energy budget and with a precedence constraints removed. Simiary, the minimum energy consumption of tass in a dag with a given time deadine cannot be ess than the minimum energy consumption of the same set of tass with the same time deadine and with a precedence constraints removed. Hence, the ower bounds in Section 4 are aso appicabe to precedence constrained tass. In Tabe 7, we show simuation data for the expected performance bound B of the LL-MLS agorithm for the energyconstrained scheduing probem. We consider a dag with v eves. The number of tass in eve is 0, for a v. For each combination of v ¼ and E ¼ , we generate 2,000 sets of n ¼ 5vðv þ Þ tass, whose tas execution requirements are random numbers uniformy distributed in ½0 Š. For each set of tass, we cacuate the ower bound T of T OPT by using Theorem 3. We aso simuate the LL-MLS agorithm and find its schedue ength T LL-MLS. The performance bound is obtained as B ¼ T LL-MLS=T. The average vaue of the 2,000 vaues of B is then reported in Tabe 7. The 99 percent confidence interva of a the data in Tabe 7 is no more than 0:459 percent. In Tabe 8, we show simuation data for the expected performance bound C of the LL-MLS agorithm for the timeconstrained scheduing probem. We consider the same dag as that in Tabe 7. For each combination of v ¼ and T ¼ 0:45t 0:40t 0:35t 0:30t 0:25t, where t ¼ v 2, we generate 2,000 sets of n tass, whose tas execution requirements are random numbers uniformy distributed in ½0 Š. For each set of tass, we cacuate the ower bound E of E OPT by using Theorem 4. We aso simuate the LL-MLS agorithm and find its energy consumption E LL-MLS. The performance bound is obtained as C ¼ E LL-MLS=E. The average vaue of the 2,000 vaues of C is then reported in

12 8 IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, VOL., NO., JANUARY-JUNE 206 Tabe 8. The 99 percent confidence interva of a the data in Tabe 8 is no more than :809 percent. From Tabe 7, we observe that B is very cose to. For a given v, the expected performance bound B sighty increases as E increases, primariy due to the ower bounds. For a given E, the expected performance bound B decreases as v increases. From Tabe 8, we observe that C is reasonaby cose to, except for sma vaues of v. For a given v, the expected performance bound C increases sighty (except for sma v) as T decreases, primariy due to the ower bounds. For a given T, the expected performance bound C decreases as v increases. Notice that a the data in Tabes 7 and 8 are for expected performance bounds. The actua vaues of expected performance ratios are smaer and coser to the optima. Our simuation resuts demonstrate that the LL-MLS agorithm has exceent expected performance in soving the probems of energy-constrained and time-constrained scheduing of precedence constrained tass on mutipe heterogeneous computers. 7 SUMMARY AND FURTHER RESEARCH We have addressed the probems of energy-constrained and time-constrained scheduing of independent or precedence constrained tass on mutipe heterogeneous computers as combinatoria optimization probems. Four ey techniques have been deveoped in this paper. First, we show how to find an optima energy or time aocation to eves of a dag, such that the performance of the eve-by-eve scheduing method can be optimized (Theorems 5 and 6). Second, for independent tass in the same eve, we show how to find an optima woroad partition, such that the modified ist scheduing agorithm can be empoyed to find an approximate tas schedue (Theorems 3 and 4). Third, for a given schedue, we show how to find an optima power aocation, such that the tota execution time or the tota energy consumption is minimized (Theorems and 2). Fourth, we derive ower bounds for the optima schedue ength and the optima energy consumption, so that soutions produced by our heuristic agorithms can be compared with optima soutions (Theorems 3 and 4). Our extensive simuation resuts demonstrate that by using the above techniques, the MLS agorithm has exceent and cose-to-optima expected performance in soving the probems of energy-constrained and time-constrained scheduing of independent tass on mutipe heterogeneous computers, and that the LL-MLS agorithm has exceent and cose-to-optima expected performance in soving the probems of energy-constrained and time-constrained scheduing of precedence constrained tass on mutipe heterogeneous computers. Our research in this paper reveas that energy-efficient tas scheduing on heterogeneous computers can be studied anayticay, ust as what we did for homogeneous computers. To the best of our nowedge, there has been no such method and resut in the existing iterature. Our investigation in this paper has made significant initia effort and our advanced methodsanddeepresutsinthispaperarethecurrentstate-ofthe-art. Furthermore, this paper shoud inspire further research interest. To concude the paper, we woud ie to mention severa further research directions. First, it has been observed that power consumption is dependent on specific types of appications [0]. It is a chaenge to incorporate such dependency into energy-efficient tas scheduing. Second, our study in this paper can be extended to more sophisticated power consumption modes, such as discrete and bounded coc frequency and suppy votage and execution speed and power suppy eves, as what we have done for homogeneous computers [20]. Third, in a way simiar to that of most existing studies, our research in this paper is primariy anaytica and experimenta. It needs further effort to appy such anaytica and simuation resuts to actua appications in reaistic systems. Due to space imitation, further investigation in these directions is beyond the scope of the paper and deserves separate papers. ACKNOWLEDGMENTS The author deepy appreciates four anonymous reviewers whose criticism and comments hep to improve the presentation of the manuscript. REFERENCES [] [Onine]. Avaiabe: heterogeneous-computing/what-is-heterogeneous-systemarchitecture-hsa, Accessed on 9 Nov [2] [Onine]. Avaiabe: power_ss2008.htm, Accessed on 9 Nov [3] S. Abers, Energy-efficient agorithms, Commun. ACM, vo. 53, no. 5, pp , 200. [4] A. Beogazov, R. Buyya, Y. C. Lee, and A. Zomaya, A taxonomy and survey of energy-efficient data centers and coud computing systems, Advances Comput., vo. 82, pp. 47, 20. [5] R. L. Burden, J. D. Faires, and A. C. Reynods, Numerica Anaysis, 2nd ed. Boston, MA, USA: Prinde, Weber & Schmidt, 98. [6] J. Cao, K. Li, and I. Stomenovic, Optima power aocation and oad distribution for mutipe heterogeneous muticore server processors across couds and data centers, IEEE Trans. Comput., vo. 63, no., pp , Jan [7] A. P. Chandraasan, S. Sheng, and R. W. Brodersen, Low-power CMOS digita design, IEEE J. Soid-State Circuits, vo. 27, no. 4, pp , Apr [8] S. Crago, et a., Heterogeneous coud computing, in Proc. IEEE Int. Conf. Custer Comput., 20, pp [9] S. P. Crago and J. P. Waters, Heterogeneous coud computing: The way forward, IEEE Comput.,vo.48,no.,pp.59 6,Jan.205. [0] H. Esmaeizadeh, T. Cao, X. Yang, S. M. Bacburn, and K. S. McKiney, Looing bac and ooing forward: Power, performance, and upheava, Commun. ACM, vo. 55, no. 7, pp. 05 4, 202. [] S. Garg, S. Sundaram, and H. D. Pate, Robust heterogeneous data center design: A principed approach, ACM SIGMETRICS Perform. Eva. Rev., vo. 39, no. 3, pp , Dec. 20. [2] R. L. Graham, Bounds on mutiprocessing timing anomaies, SIAM J. App. Math., vo. 2, pp , 969. [3] R. Iyer, et a., CogniServe: Heterogeneous server architecture for arge-scae recognition, IEEE Micro, vo. 3, no. 3, pp. 20 3, May/Jun. 20. [4] K. Li, Performance anaysis of power-aware tas scheduing agorithms on mutiprocessor computers with dynamic votage and speed, IEEE Trans. Parae Distrib. Syst., vo. 9, no., pp , Nov

13 LI: ENERGY-EFFICIENT TASK SCHEDULING ON MULTIPLE HETEROGENEOUS COMPUTERS: ALGORITHMS, ANALYSIS, AND... 9 [5] K. Li, Energy efficient scheduing of parae tass on mutiprocessor computers, J. Supercomputing, vo. 60, no. 2, pp , 202. [6] K. Li, Scheduing precedence constrained tass with reduced processor energy on mutiprocessor computers, IEEE Trans. Comput., vo. 6, no. 2, pp , Dec [7] K. Li, Optima power aocation among mutipe heterogeneous servers in a data center, Sustain. Comput.: Informat. Syst., vo. 2, no., pp. 3 22, 202. [8] K. Li, Energy-efficient and high-performance processing of argescae parae appications in data centers, in Data Centers, S. U. Khan and A. Y. Zomaya, Eds. Berin, Germany: Springer, 205, pp. 33. [9] K. Li, Power and performance management for parae computations in couds and data centers, J. Comput. Syst. Sci., vo. 82, pp , 206. [20] K. Li, Energy and time constrained tas scheduing on mutiprocessor computers with discrete speed eves, J. Parae Distrib. Comput., vo. 95, pp. 5 28, 206. [2] K. Li, Improving muticore server performance and reducing energy consumption by woroad dependent dynamic power management, IEEE Trans. Coud Comput., vo. 4, no. 2, pp , Apr./Jun [22] K. Li, X. Tang, and K. Li, Energy-efficient stochastic tas scheduing on heterogeneous computing systems, IEEE Trans. Parae Distrib. Syst., vo. 25, no., pp , Nov [23] S. U. R. Mai, K. Bia, S. U. Khan, B. Veeravai, K. Li, and A. Y. Zomaya, Modeing and anaysis of the therma properties exhibited by cyber physica data centers, IEEE Syst. J., 206, doi: 0.09/JSYST , in press, 207. [24] J. Mateo, J. Viapana, L. M. Pa, J. L. Lerida, and F. Sosona, A green strategy for federated and heterogeneous couds with communicating woroads, Sci. Word J., Nov., 204. [Onine]. Avaiabe: artices/pmc / [25] J. Mei, K. Li, and K. Li, Energy-aware tas scheduing in heterogeneous computing environments, Custer Comput., vo. 7, no. 2, pp , 204. [26] H. S. Narman, M. S. Hossain, and M. Atiquzzaman, h-ddss: Heterogeneous dynamic dedicated servers scheduing in coud computing, in Proc. IEEE Int. Conf. Commun., 204, pp [27] V. Petrucci, E. V. Carrera, O. Loques, J. C. B. Leite, and D. Mosse, Optimized management of power and performance for virtuaized heterogeneous server custers, in Proc. th IEEE/ACM Int. Symp. Custer Coud Grid Comput., 20, pp [28] C. Reiss, A. Tumanov, G. R. Ganger, R. H. Katz, and M. A. Kozuch, Heterogeneity and dynamicity of couds at scae: Googe trace anaysis, in Proc. 3rd ACM Symp. Coud Comput., 202, Art. no. 7. [29] D. Sun, G. Zhang, S. Yang, W. Zheng, S. U. Khan, and K. Li, Re- Stream: Rea-time and energy-efficient resource scheduing in big data stream computing environments, Inf. Sci., vo. 39, pp. 92 2, 205. [30] Z. Tang, L. Qi, Z. Cheng, K. Li, S. U. Khan, and K. Li, An energyefficient tas scheduing agorithm in DVFS-enabed coud environment, J. Grid Comput., vo. 4, no., pp , 206. [3] Y. Tian, C. Lin, and K. Li, Managing performance and power consumption tradeoff for mutipe heterogeneous servers in coud computing, Custer Comput., vo. 7, no. 3, pp , 204. [32] G. L. Vaentini, et a., An overview of energy efficiency techniques in custer computing systems, Custer Comput., vo. 6, no., pp. 3 5, 203. [33] L. Van Doorn, Heterogeneous server architectures wi dominate the future data center, Open Server Summit, Santa Cara, CA, USA, Nov. 3, 204. [Onine]. Avaiabe: [34] W. Wang, B. Liang, and B. Li, Muti-Resource fair aocation in heterogeneous coud computing systems, IEEE Trans. Parae Distrib. Syst., vo. 26, no. 0, pp , Oct [35] Y. Wang, K. Li, H. Chen, L. He, and K. Li, Energy-aware data aocation and tas scheduing on heterogeneous mutiprocessor systems with time constraints, IEEE Trans. Emerg. Topics Comput., vo. 2, no. 2, pp , Jun [36] B. Zhai, D. Baauw, D. Syvester, and K. Fautner, Theoretica and practica imits of dynamic votage scaing, in Proc. 4st Des. Autom. Conf., 2004, pp [37] L. Zhang, K. Li, Y. Xu, J. Mei, F. Zhang, and K. Li, Maximizing reiabiity with energy conservation for parae tas scheduing in a heterogeneous custer, Inf. Sci., vo. 39, pp. 3 3, 205. [38] L. M. Zhang, K. Li, D. C.-T. Lo, and Y. Zhang, Energy-efficient tas scheduing agorithms on heterogeneous computers with continuous and discrete speeds, Sustain. Comput.: Informat. Syst., vo. 3, no. 2, pp. 09 8, 203. [39] S. Zhuravev, J. C. Saez, S. Bagodurov, A. Fedorova, and M. Prieto, Survey of energy-cognizant scheduing techniques, IEEE Trans. Parae Distrib. Syst., vo. 24, no. 7, pp , Ju Keqin Li is a SUNY distinguished professor of computer science. His current research interests incude parae computing and high-performance computing, distributed computing, energy-efficient computing and communication, heterogeneous computing systems, coud computing, big data computing, CPU-GPU hybrid and cooperative computing, muticore computing, storage and fie systems, wireess communication networs, sensor networs, and peer-to-peer fie sharing systems. He has more than 460 pubications. He is currenty or has served on the editoria boards of the IEEE Transactions on Parae and Distributed Systems, the IEEE Transactions on Computers, the IEEE Transactions on Coud Computing, and the IEEE Transactions on Services Computing. He is a feow of the IEEE. " For more information on this or any other computing topic, pease visit our Digita Library at

Asynchronous Control for Coupled Markov Decision Systems

Asynchronous Control for Coupled Markov Decision Systems INFORMATION THEORY WORKSHOP (ITW) 22 Asynchronous Contro for Couped Marov Decision Systems Michae J. Neey University of Southern Caifornia Abstract This paper considers optima contro for a coection of

More information

A Brief Introduction to Markov Chains and Hidden Markov Models

A Brief Introduction to Markov Chains and Hidden Markov Models A Brief Introduction to Markov Chains and Hidden Markov Modes Aen B MacKenzie Notes for December 1, 3, &8, 2015 Discrete-Time Markov Chains You may reca that when we first introduced random processes,

More information

NEW DEVELOPMENT OF OPTIMAL COMPUTING BUDGET ALLOCATION FOR DISCRETE EVENT SIMULATION

NEW DEVELOPMENT OF OPTIMAL COMPUTING BUDGET ALLOCATION FOR DISCRETE EVENT SIMULATION NEW DEVELOPMENT OF OPTIMAL COMPUTING BUDGET ALLOCATION FOR DISCRETE EVENT SIMULATION Hsiao-Chang Chen Dept. of Systems Engineering University of Pennsyvania Phiadephia, PA 904-635, U.S.A. Chun-Hung Chen

More information

Schedulability Analysis of Deferrable Scheduling Algorithms for Maintaining Real-Time Data Freshness

Schedulability Analysis of Deferrable Scheduling Algorithms for Maintaining Real-Time Data Freshness 1 Scheduabiity Anaysis of Deferrabe Scheduing Agorithms for Maintaining Rea-Time Data Freshness Song Han, Deji Chen, Ming Xiong, Kam-yiu Lam, Aoysius K. Mok, Krithi Ramamritham UT Austin, Emerson Process

More information

Related Topics Maxwell s equations, electrical eddy field, magnetic field of coils, coil, magnetic flux, induced voltage

Related Topics Maxwell s equations, electrical eddy field, magnetic field of coils, coil, magnetic flux, induced voltage Magnetic induction TEP Reated Topics Maxwe s equations, eectrica eddy fied, magnetic fied of cois, coi, magnetic fux, induced votage Principe A magnetic fied of variabe frequency and varying strength is

More information

Combining reaction kinetics to the multi-phase Gibbs energy calculation

Combining reaction kinetics to the multi-phase Gibbs energy calculation 7 th European Symposium on Computer Aided Process Engineering ESCAPE7 V. Pesu and P.S. Agachi (Editors) 2007 Esevier B.V. A rights reserved. Combining reaction inetics to the muti-phase Gibbs energy cacuation

More information

14 Separation of Variables Method

14 Separation of Variables Method 14 Separation of Variabes Method Consider, for exampe, the Dirichet probem u t = Du xx < x u(x, ) = f(x) < x < u(, t) = = u(, t) t > Let u(x, t) = T (t)φ(x); now substitute into the equation: dt

More information

XSAT of linear CNF formulas

XSAT of linear CNF formulas XSAT of inear CN formuas Bernd R. Schuh Dr. Bernd Schuh, D-50968 Kön, Germany; bernd.schuh@netcoogne.de eywords: compexity, XSAT, exact inear formua, -reguarity, -uniformity, NPcompeteness Abstract. Open

More information

c 2007 Society for Industrial and Applied Mathematics

c 2007 Society for Industrial and Applied Mathematics SIAM REVIEW Vo. 49,No. 1,pp. 111 1 c 7 Society for Industria and Appied Mathematics Domino Waves C. J. Efthimiou M. D. Johnson Abstract. Motivated by a proposa of Daykin [Probem 71-19*, SIAM Rev., 13 (1971),

More information

Unconditional security of differential phase shift quantum key distribution

Unconditional security of differential phase shift quantum key distribution Unconditiona security of differentia phase shift quantum key distribution Kai Wen, Yoshihisa Yamamoto Ginzton Lab and Dept of Eectrica Engineering Stanford University Basic idea of DPS-QKD Protoco. Aice

More information

Width of Percolation Transition in Complex Networks

Width of Percolation Transition in Complex Networks APS/23-QED Width of Percoation Transition in Compex Networs Tomer Kaisy, and Reuven Cohen 2 Minerva Center and Department of Physics, Bar-Ian University, 52900 Ramat-Gan, Israe 2 Department of Computer

More information

A. Distribution of the test statistic

A. Distribution of the test statistic A. Distribution of the test statistic In the sequentia test, we first compute the test statistic from a mini-batch of size m. If a decision cannot be made with this statistic, we keep increasing the mini-batch

More information

Minimizing Total Weighted Completion Time on Uniform Machines with Unbounded Batch

Minimizing Total Weighted Completion Time on Uniform Machines with Unbounded Batch The Eighth Internationa Symposium on Operations Research and Its Appications (ISORA 09) Zhangiaie, China, September 20 22, 2009 Copyright 2009 ORSC & APORC, pp. 402 408 Minimizing Tota Weighted Competion

More information

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents MARKOV CHAINS AND MARKOV DECISION THEORY ARINDRIMA DATTA Abstract. In this paper, we begin with a forma introduction to probabiity and expain the concept of random variabes and stochastic processes. After

More information

Sequential Decoding of Polar Codes with Arbitrary Binary Kernel

Sequential Decoding of Polar Codes with Arbitrary Binary Kernel Sequentia Decoding of Poar Codes with Arbitrary Binary Kerne Vera Miosavskaya, Peter Trifonov Saint-Petersburg State Poytechnic University Emai: veram,petert}@dcn.icc.spbstu.ru Abstract The probem of efficient

More information

<C 2 2. λ 2 l. λ 1 l 1 < C 1

<C 2 2. λ 2 l. λ 1 l 1 < C 1 Teecommunication Network Contro and Management (EE E694) Prof. A. A. Lazar Notes for the ecture of 7/Feb/95 by Huayan Wang (this document was ast LaT E X-ed on May 9,995) Queueing Primer for Muticass Optima

More information

Lecture Note 3: Stationary Iterative Methods

Lecture Note 3: Stationary Iterative Methods MATH 5330: Computationa Methods of Linear Agebra Lecture Note 3: Stationary Iterative Methods Xianyi Zeng Department of Mathematica Sciences, UTEP Stationary Iterative Methods The Gaussian eimination (or

More information

Approximate Bandwidth Allocation for Fixed-Priority-Scheduled Periodic Resources (WSU-CS Technical Report Version)

Approximate Bandwidth Allocation for Fixed-Priority-Scheduled Periodic Resources (WSU-CS Technical Report Version) Approximate Bandwidth Aocation for Fixed-Priority-Schedued Periodic Resources WSU-CS Technica Report Version) Farhana Dewan Nathan Fisher Abstract Recent research in compositiona rea-time systems has focused

More information

CS229 Lecture notes. Andrew Ng

CS229 Lecture notes. Andrew Ng CS229 Lecture notes Andrew Ng Part IX The EM agorithm In the previous set of notes, we taked about the EM agorithm as appied to fitting a mixture of Gaussians. In this set of notes, we give a broader view

More information

IN MODERN embedded systems, the ever-increasing complexity

IN MODERN embedded systems, the ever-increasing complexity IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 37, NO., JANUARY 208 2 Muticore Mixed-Criticaity Systems: Partitioned Scheduing and Utiization Bound Jian-Jun Han, Member,

More information

Separation of Variables and a Spherical Shell with Surface Charge

Separation of Variables and a Spherical Shell with Surface Charge Separation of Variabes and a Spherica She with Surface Charge In cass we worked out the eectrostatic potentia due to a spherica she of radius R with a surface charge density σθ = σ cos θ. This cacuation

More information

Stochastic Complement Analysis of Multi-Server Threshold Queues. with Hysteresis. Abstract

Stochastic Complement Analysis of Multi-Server Threshold Queues. with Hysteresis. Abstract Stochastic Compement Anaysis of Muti-Server Threshod Queues with Hysteresis John C.S. Lui The Dept. of Computer Science & Engineering The Chinese University of Hong Kong Leana Goubchik Dept. of Computer

More information

Problem set 6 The Perron Frobenius theorem.

Problem set 6 The Perron Frobenius theorem. Probem set 6 The Perron Frobenius theorem. Math 22a4 Oct 2 204, Due Oct.28 In a future probem set I want to discuss some criteria which aow us to concude that that the ground state of a sef-adjoint operator

More information

Determining The Degree of Generalization Using An Incremental Learning Algorithm

Determining The Degree of Generalization Using An Incremental Learning Algorithm Determining The Degree of Generaization Using An Incrementa Learning Agorithm Pabo Zegers Facutad de Ingeniería, Universidad de os Andes San Caros de Apoquindo 22, Las Condes, Santiago, Chie pzegers@uandes.c

More information

Uniprocessor Feasibility of Sporadic Tasks with Constrained Deadlines is Strongly conp-complete

Uniprocessor Feasibility of Sporadic Tasks with Constrained Deadlines is Strongly conp-complete Uniprocessor Feasibiity of Sporadic Tasks with Constrained Deadines is Strongy conp-compete Pontus Ekberg and Wang Yi Uppsaa University, Sweden Emai: {pontus.ekberg yi}@it.uu.se Abstract Deciding the feasibiity

More information

Componentwise Determination of the Interval Hull Solution for Linear Interval Parameter Systems

Componentwise Determination of the Interval Hull Solution for Linear Interval Parameter Systems Componentwise Determination of the Interva Hu Soution for Linear Interva Parameter Systems L. V. Koev Dept. of Theoretica Eectrotechnics, Facuty of Automatics, Technica University of Sofia, 1000 Sofia,

More information

More Scattering: the Partial Wave Expansion

More Scattering: the Partial Wave Expansion More Scattering: the Partia Wave Expansion Michae Fower /7/8 Pane Waves and Partia Waves We are considering the soution to Schrödinger s equation for scattering of an incoming pane wave in the z-direction

More information

A Simple and Efficient Algorithm of 3-D Single-Source Localization with Uniform Cross Array Bing Xue 1 2 a) * Guangyou Fang 1 2 b and Yicai Ji 1 2 c)

A Simple and Efficient Algorithm of 3-D Single-Source Localization with Uniform Cross Array Bing Xue 1 2 a) * Guangyou Fang 1 2 b and Yicai Ji 1 2 c) A Simpe Efficient Agorithm of 3-D Singe-Source Locaization with Uniform Cross Array Bing Xue a * Guangyou Fang b Yicai Ji c Key Laboratory of Eectromagnetic Radiation Sensing Technoogy, Institute of Eectronics,

More information

SydU STAT3014 (2015) Second semester Dr. J. Chan 18

SydU STAT3014 (2015) Second semester Dr. J. Chan 18 STAT3014/3914 Appied Stat.-Samping C-Stratified rand. sampe Stratified Random Samping.1 Introduction Description The popuation of size N is divided into mutuay excusive and exhaustive subpopuations caed

More information

$, (2.1) n="# #. (2.2)

$, (2.1) n=# #. (2.2) Chapter. Eectrostatic II Notes: Most of the materia presented in this chapter is taken from Jackson, Chap.,, and 4, and Di Bartoo, Chap... Mathematica Considerations.. The Fourier series and the Fourier

More information

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network An Agorithm for Pruning Redundant Modues in Min-Max Moduar Network Hui-Cheng Lian and Bao-Liang Lu Department of Computer Science and Engineering, Shanghai Jiao Tong University 1954 Hua Shan Rd., Shanghai

More information

Schedulability Analysis of Deferrable Scheduling Algorithms for Maintaining Real-Time Data Freshness

Schedulability Analysis of Deferrable Scheduling Algorithms for Maintaining Real-Time Data Freshness 1 Scheduabiity Anaysis of Deferrabe Scheduing Agorithms for Maintaining Rea- Data Freshness Song Han, Deji Chen, Ming Xiong, Kam-yiu Lam, Aoysius K. Mok, Krithi Ramamritham UT Austin, Emerson Process Management,

More information

A Novel Learning Method for Elman Neural Network Using Local Search

A Novel Learning Method for Elman Neural Network Using Local Search Neura Information Processing Letters and Reviews Vo. 11, No. 8, August 2007 LETTER A Nove Learning Method for Eman Neura Networ Using Loca Search Facuty of Engineering, Toyama University, Gofuu 3190 Toyama

More information

Convergence Property of the Iri-Imai Algorithm for Some Smooth Convex Programming Problems

Convergence Property of the Iri-Imai Algorithm for Some Smooth Convex Programming Problems Convergence Property of the Iri-Imai Agorithm for Some Smooth Convex Programming Probems S. Zhang Communicated by Z.Q. Luo Assistant Professor, Department of Econometrics, University of Groningen, Groningen,

More information

A simple reliability block diagram method for safety integrity verification

A simple reliability block diagram method for safety integrity verification Reiabiity Engineering and System Safety 92 (2007) 1267 1273 www.esevier.com/ocate/ress A simpe reiabiity bock diagram method for safety integrity verification Haitao Guo, Xianhui Yang epartment of Automation,

More information

Lecture 6: Moderately Large Deflection Theory of Beams

Lecture 6: Moderately Large Deflection Theory of Beams Structura Mechanics 2.8 Lecture 6 Semester Yr Lecture 6: Moderatey Large Defection Theory of Beams 6.1 Genera Formuation Compare to the cassica theory of beams with infinitesima deformation, the moderatey

More information

Discrete Applied Mathematics

Discrete Applied Mathematics Discrete Appied Mathematics 159 (2011) 812 825 Contents ists avaiabe at ScienceDirect Discrete Appied Mathematics journa homepage: www.esevier.com/ocate/dam A direct barter mode for course add/drop process

More information

8 Digifl'.11 Cth:uits and devices

8 Digifl'.11 Cth:uits and devices 8 Digif'. Cth:uits and devices 8. Introduction In anaog eectronics, votage is a continuous variabe. This is usefu because most physica quantities we encounter are continuous: sound eves, ight intensity,

More information

4 Separation of Variables

4 Separation of Variables 4 Separation of Variabes In this chapter we describe a cassica technique for constructing forma soutions to inear boundary vaue probems. The soution of three cassica (paraboic, hyperboic and eiptic) PDE

More information

Time-dependent analysis of an M / M / c preemptive priority system with two priority classes Selen, J.; Fralix, B.H.

Time-dependent analysis of an M / M / c preemptive priority system with two priority classes Selen, J.; Fralix, B.H. Time-dependent anaysis of an M / M / c preemptive priority system with two priority casses Seen, J.; Fraix, B.H. Pubished in: Queueing Systems DOI: 1.17/s11134-17-9541-2 Pubished: 1/12/217 Document Version

More information

MULTI-PERIOD MODEL FOR PART FAMILY/MACHINE CELL FORMATION. Objectives included in the multi-period formulation

MULTI-PERIOD MODEL FOR PART FAMILY/MACHINE CELL FORMATION. Objectives included in the multi-period formulation ationa Institute of Technoogy aicut Department of echanica Engineering ULTI-PERIOD ODEL FOR PART FAILY/AHIE ELL FORATIO Given a set of parts, processing requirements, and avaiabe resources The objective

More information

New Efficiency Results for Makespan Cost Sharing

New Efficiency Results for Makespan Cost Sharing New Efficiency Resuts for Makespan Cost Sharing Yvonne Beischwitz a, Forian Schoppmann a, a University of Paderborn, Department of Computer Science Fürstenaee, 3302 Paderborn, Germany Abstract In the context

More information

FOURIER SERIES ON ANY INTERVAL

FOURIER SERIES ON ANY INTERVAL FOURIER SERIES ON ANY INTERVAL Overview We have spent considerabe time earning how to compute Fourier series for functions that have a period of 2p on the interva (-p,p). We have aso seen how Fourier series

More information

Throughput Optimal Scheduling for Wireless Downlinks with Reconfiguration Delay

Throughput Optimal Scheduling for Wireless Downlinks with Reconfiguration Delay Throughput Optima Scheduing for Wireess Downinks with Reconfiguration Deay Vineeth Baa Sukumaran vineethbs@gmai.com Department of Avionics Indian Institute of Space Science and Technoogy. Abstract We consider

More information

THE OUT-OF-PLANE BEHAVIOUR OF SPREAD-TOW FABRICS

THE OUT-OF-PLANE BEHAVIOUR OF SPREAD-TOW FABRICS ECCM6-6 TH EUROPEAN CONFERENCE ON COMPOSITE MATERIALS, Sevie, Spain, -6 June 04 THE OUT-OF-PLANE BEHAVIOUR OF SPREAD-TOW FABRICS M. Wysocki a,b*, M. Szpieg a, P. Heström a and F. Ohsson c a Swerea SICOMP

More information

The EM Algorithm applied to determining new limit points of Mahler measures

The EM Algorithm applied to determining new limit points of Mahler measures Contro and Cybernetics vo. 39 (2010) No. 4 The EM Agorithm appied to determining new imit points of Maher measures by Souad E Otmani, Georges Rhin and Jean-Marc Sac-Épée Université Pau Veraine-Metz, LMAM,

More information

In-plane shear stiffness of bare steel deck through shell finite element models. G. Bian, B.W. Schafer. June 2017

In-plane shear stiffness of bare steel deck through shell finite element models. G. Bian, B.W. Schafer. June 2017 In-pane shear stiffness of bare stee deck through she finite eement modes G. Bian, B.W. Schafer June 7 COLD-FORMED STEEL RESEARCH CONSORTIUM REPORT SERIES CFSRC R-7- SDII Stee Diaphragm Innovation Initiative

More information

Discrete Techniques. Chapter Introduction

Discrete Techniques. Chapter Introduction Chapter 3 Discrete Techniques 3. Introduction In the previous two chapters we introduced Fourier transforms of continuous functions of the periodic and non-periodic (finite energy) type, as we as various

More information

The influence of temperature of photovoltaic modules on performance of solar power plant

The influence of temperature of photovoltaic modules on performance of solar power plant IOSR Journa of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vo. 05, Issue 04 (Apri. 2015), V1 PP 09-15 www.iosrjen.org The infuence of temperature of photovotaic modues on performance

More information

Inductive Bias: How to generalize on novel data. CS Inductive Bias 1

Inductive Bias: How to generalize on novel data. CS Inductive Bias 1 Inductive Bias: How to generaize on nove data CS 478 - Inductive Bias 1 Overfitting Noise vs. Exceptions CS 478 - Inductive Bias 2 Non-Linear Tasks Linear Regression wi not generaize we to the task beow

More information

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with?

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with? Bayesian Learning A powerfu and growing approach in machine earning We use it in our own decision making a the time You hear a which which coud equay be Thanks or Tanks, which woud you go with? Combine

More information

Analysis method of feeder partition capacity considering power supply security and distributed generation

Analysis method of feeder partition capacity considering power supply security and distributed generation The 6th Internationa Conference on Renewabe Power Generation (RPG) 19 20 October 2017 Anaysis method of feeder capacity considering power suppy security and distributed generation Weifu Wang 1, Zhaojing

More information

(1 ) = 1 for some 2 (0; 1); (1 + ) = 0 for some > 0:

(1 ) = 1 for some 2 (0; 1); (1 + ) = 0 for some > 0: Answers, na. Economics 4 Fa, 2009. Christiano.. The typica househod can engage in two types of activities producing current output and studying at home. Athough time spent on studying at home sacrices

More information

Interconnect effects on performance of Field Programmable Analog Array

Interconnect effects on performance of Field Programmable Analog Array nterconnect effects on performance of Fied Programmabe Anaog Array D. Anderson,. Bir, O. A. Pausinsi 3, M. Spitz, K. Reiss Motoroa, SPS, Phoenix, Arizona, USA, University of Karsruhe, Karsruhe, Germany,

More information

Discrete Techniques. Chapter Introduction

Discrete Techniques. Chapter Introduction Chapter 3 Discrete Techniques 3. Introduction In the previous two chapters we introduced Fourier transforms of continuous functions of the periodic and non-periodic (finite energy) type, we as various

More information

Tracking Control of Multiple Mobile Robots

Tracking Control of Multiple Mobile Robots Proceedings of the 2001 IEEE Internationa Conference on Robotics & Automation Seou, Korea May 21-26, 2001 Tracking Contro of Mutipe Mobie Robots A Case Study of Inter-Robot Coision-Free Probem Jurachart

More information

Recursive Constructions of Parallel FIFO and LIFO Queues with Switched Delay Lines

Recursive Constructions of Parallel FIFO and LIFO Queues with Switched Delay Lines Recursive Constructions of Parae FIFO and LIFO Queues with Switched Deay Lines Po-Kai Huang, Cheng-Shang Chang, Feow, IEEE, Jay Cheng, Member, IEEE, and Duan-Shin Lee, Senior Member, IEEE Abstract One

More information

Safety Evaluation Model of Chemical Logistics Park Operation Based on Back Propagation Neural Network

Safety Evaluation Model of Chemical Logistics Park Operation Based on Back Propagation Neural Network 1513 A pubication of CHEMICAL ENGINEERING TRANSACTIONS VOL. 6, 017 Guest Editors: Fei Song, Haibo Wang, Fang He Copyright 017, AIDIC Servizi S.r.. ISBN 978-88-95608-60-0; ISSN 83-916 The Itaian Association

More information

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA)

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA) 1 FRST 531 -- Mutivariate Statistics Mutivariate Discriminant Anaysis (MDA) Purpose: 1. To predict which group (Y) an observation beongs to based on the characteristics of p predictor (X) variabes, using

More information

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES Separation of variabes is a method to sove certain PDEs which have a warped product structure. First, on R n, a inear PDE of order m is

More information

Theory and implementation behind: Universal surface creation - smallest unitcell

Theory and implementation behind: Universal surface creation - smallest unitcell Teory and impementation beind: Universa surface creation - smaest unitce Bjare Brin Buus, Jaob Howat & Tomas Bigaard September 15, 218 1 Construction of surface sabs Te aim for tis part of te project is

More information

Cryptanalysis of PKP: A New Approach

Cryptanalysis of PKP: A New Approach Cryptanaysis of PKP: A New Approach Éiane Jaumes and Antoine Joux DCSSI 18, rue du Dr. Zamenhoff F-92131 Issy-es-Mx Cedex France eiane.jaumes@wanadoo.fr Antoine.Joux@ens.fr Abstract. Quite recenty, in

More information

Optimization Based Bidding Strategies in the Deregulated Market

Optimization Based Bidding Strategies in the Deregulated Market Optimization ased idding Strategies in the Dereguated arket Daoyuan Zhang Ascend Communications, nc 866 North ain Street, Waingford, C 0649 Abstract With the dereguation of eectric power systems, market

More information

Source and Relay Matrices Optimization for Multiuser Multi-Hop MIMO Relay Systems

Source and Relay Matrices Optimization for Multiuser Multi-Hop MIMO Relay Systems Source and Reay Matrices Optimization for Mutiuser Muti-Hop MIMO Reay Systems Yue Rong Department of Eectrica and Computer Engineering, Curtin University, Bentey, WA 6102, Austraia Abstract In this paper,

More information

STABILITY OF A PARAMETRICALLY EXCITED DAMPED INVERTED PENDULUM 1. INTRODUCTION

STABILITY OF A PARAMETRICALLY EXCITED DAMPED INVERTED PENDULUM 1. INTRODUCTION Journa of Sound and Vibration (996) 98(5), 643 65 STABILITY OF A PARAMETRICALLY EXCITED DAMPED INVERTED PENDULUM G. ERDOS AND T. SINGH Department of Mechanica and Aerospace Engineering, SUNY at Buffao,

More information

Expectation-Maximization for Estimating Parameters for a Mixture of Poissons

Expectation-Maximization for Estimating Parameters for a Mixture of Poissons Expectation-Maximization for Estimating Parameters for a Mixture of Poissons Brandon Maone Department of Computer Science University of Hesini February 18, 2014 Abstract This document derives, in excrutiating

More information

Scalable Spectrum Allocation for Large Networks Based on Sparse Optimization

Scalable Spectrum Allocation for Large Networks Based on Sparse Optimization Scaabe Spectrum ocation for Large Networks ased on Sparse Optimization innan Zhuang Modem R&D Lab Samsung Semiconductor, Inc. San Diego, C Dongning Guo, Ermin Wei, and Michae L. Honig Department of Eectrica

More information

BP neural network-based sports performance prediction model applied research

BP neural network-based sports performance prediction model applied research Avaiabe onine www.jocpr.com Journa of Chemica and Pharmaceutica Research, 204, 6(7:93-936 Research Artice ISSN : 0975-7384 CODEN(USA : JCPRC5 BP neura networ-based sports performance prediction mode appied

More information

u(x) s.t. px w x 0 Denote the solution to this problem by ˆx(p, x). In order to obtain ˆx we may simply solve the standard problem max x 0

u(x) s.t. px w x 0 Denote the solution to this problem by ˆx(p, x). In order to obtain ˆx we may simply solve the standard problem max x 0 Bocconi University PhD in Economics - Microeconomics I Prof M Messner Probem Set 4 - Soution Probem : If an individua has an endowment instead of a monetary income his weath depends on price eves In particuar,

More information

Numerical solution of one dimensional contaminant transport equation with variable coefficient (temporal) by using Haar wavelet

Numerical solution of one dimensional contaminant transport equation with variable coefficient (temporal) by using Haar wavelet Goba Journa of Pure and Appied Mathematics. ISSN 973-1768 Voume 1, Number (16), pp. 183-19 Research India Pubications http://www.ripubication.com Numerica soution of one dimensiona contaminant transport

More information

A Statistical Framework for Real-time Event Detection in Power Systems

A Statistical Framework for Real-time Event Detection in Power Systems 1 A Statistica Framework for Rea-time Event Detection in Power Systems Noan Uhrich, Tim Christman, Phiip Swisher, and Xichen Jiang Abstract A quickest change detection (QCD) agorithm is appied to the probem

More information

Automobile Prices in Market Equilibrium. Berry, Pakes and Levinsohn

Automobile Prices in Market Equilibrium. Berry, Pakes and Levinsohn Automobie Prices in Market Equiibrium Berry, Pakes and Levinsohn Empirica Anaysis of demand and suppy in a differentiated products market: equiibrium in the U.S. automobie market. Oigopoistic Differentiated

More information

Data Mining Technology for Failure Prognostic of Avionics

Data Mining Technology for Failure Prognostic of Avionics IEEE Transactions on Aerospace and Eectronic Systems. Voume 38, #, pp.388-403, 00. Data Mining Technoogy for Faiure Prognostic of Avionics V.A. Skormin, Binghamton University, Binghamton, NY, 1390, USA

More information

UI FORMULATION FOR CABLE STATE OF EXISTING CABLE-STAYED BRIDGE

UI FORMULATION FOR CABLE STATE OF EXISTING CABLE-STAYED BRIDGE UI FORMULATION FOR CABLE STATE OF EXISTING CABLE-STAYED BRIDGE Juan Huang, Ronghui Wang and Tao Tang Coege of Traffic and Communications, South China University of Technoogy, Guangzhou, Guangdong 51641,

More information

Research on liquid sloshing performance in vane type tank under microgravity

Research on liquid sloshing performance in vane type tank under microgravity IOP Conference Series: Materias Science and Engineering PAPER OPEN ACCESS Research on iquid soshing performance in vane type tan under microgravity Reated content - Numerica simuation of fuid fow in the

More information

First-Order Corrections to Gutzwiller s Trace Formula for Systems with Discrete Symmetries

First-Order Corrections to Gutzwiller s Trace Formula for Systems with Discrete Symmetries c 26 Noninear Phenomena in Compex Systems First-Order Corrections to Gutzwier s Trace Formua for Systems with Discrete Symmetries Hoger Cartarius, Jörg Main, and Günter Wunner Institut für Theoretische

More information

( ) is just a function of x, with

( ) is just a function of x, with II. MULTIVARIATE CALCULUS The first ecture covered functions where a singe input goes in, and a singe output comes out. Most economic appications aren t so simpe. In most cases, a number of variabes infuence

More information

On the evaluation of saving-consumption plans

On the evaluation of saving-consumption plans On the evauation of saving-consumption pans Steven Vanduffe Jan Dhaene Marc Goovaerts Juy 13, 2004 Abstract Knowedge of the distribution function of the stochasticay compounded vaue of a series of future

More information

Optimal Control of Assembly Systems with Multiple Stages and Multiple Demand Classes 1

Optimal Control of Assembly Systems with Multiple Stages and Multiple Demand Classes 1 Optima Contro of Assemby Systems with Mutipe Stages and Mutipe Demand Casses Saif Benjaafar Mohsen EHafsi 2 Chung-Yee Lee 3 Weihua Zhou 3 Industria & Systems Engineering, Department of Mechanica Engineering,

More information

Algorithms to solve massively under-defined systems of multivariate quadratic equations

Algorithms to solve massively under-defined systems of multivariate quadratic equations Agorithms to sove massivey under-defined systems of mutivariate quadratic equations Yasufumi Hashimoto Abstract It is we known that the probem to sove a set of randomy chosen mutivariate quadratic equations

More information

An approximate method for solving the inverse scattering problem with fixed-energy data

An approximate method for solving the inverse scattering problem with fixed-energy data J. Inv. I-Posed Probems, Vo. 7, No. 6, pp. 561 571 (1999) c VSP 1999 An approximate method for soving the inverse scattering probem with fixed-energy data A. G. Ramm and W. Scheid Received May 12, 1999

More information

4 1-D Boundary Value Problems Heat Equation

4 1-D Boundary Value Problems Heat Equation 4 -D Boundary Vaue Probems Heat Equation The main purpose of this chapter is to study boundary vaue probems for the heat equation on a finite rod a x b. u t (x, t = ku xx (x, t, a < x < b, t > u(x, = ϕ(x

More information

Approximated MLC shape matrix decomposition with interleaf collision constraint

Approximated MLC shape matrix decomposition with interleaf collision constraint Agorithmic Operations Research Vo.4 (29) 49 57 Approximated MLC shape matrix decomposition with intereaf coision constraint Antje Kiese and Thomas Kainowski Institut für Mathematik, Universität Rostock,

More information

Reliability Improvement with Optimal Placement of Distributed Generation in Distribution System

Reliability Improvement with Optimal Placement of Distributed Generation in Distribution System Reiabiity Improvement with Optima Pacement of Distributed Generation in Distribution System N. Rugthaicharoencheep, T. Langtharthong Abstract This paper presents the optima pacement and sizing of distributed

More information

Efficiently Generating Random Bits from Finite State Markov Chains

Efficiently Generating Random Bits from Finite State Markov Chains 1 Efficienty Generating Random Bits from Finite State Markov Chains Hongchao Zhou and Jehoshua Bruck, Feow, IEEE Abstract The probem of random number generation from an uncorreated random source (of unknown

More information

High Efficiency Development of a Reciprocating Compressor by Clarification of Loss Generation in Bearings

High Efficiency Development of a Reciprocating Compressor by Clarification of Loss Generation in Bearings Purdue University Purdue e-pubs Internationa Compressor Engineering Conference Schoo of Mechanica Engineering 2010 High Efficiency Deveopment of a Reciprocating Compressor by Carification of Loss Generation

More information

arxiv: v1 [math.ca] 6 Mar 2017

arxiv: v1 [math.ca] 6 Mar 2017 Indefinite Integras of Spherica Besse Functions MIT-CTP/487 arxiv:703.0648v [math.ca] 6 Mar 07 Joyon K. Boomfied,, Stephen H. P. Face,, and Zander Moss, Center for Theoretica Physics, Laboratory for Nucear

More information

Modal analysis of a multi-blade system undergoing rotational motion

Modal analysis of a multi-blade system undergoing rotational motion Journa of Mechanica Science and Technoogy 3 (9) 5~58 Journa of Mechanica Science and Technoogy www.springerin.com/content/738-494x DOI.7/s6-9-43-3 Moda anaysis of a muti-bade system undergoing rotationa

More information

Improving the Reliability of a Series-Parallel System Using Modified Weibull Distribution

Improving the Reliability of a Series-Parallel System Using Modified Weibull Distribution Internationa Mathematica Forum, Vo. 12, 217, no. 6, 257-269 HIKARI Ltd, www.m-hikari.com https://doi.org/1.12988/imf.217.611155 Improving the Reiabiity of a Series-Parae System Using Modified Weibu Distribution

More information

Consistent linguistic fuzzy preference relation with multi-granular uncertain linguistic information for solving decision making problems

Consistent linguistic fuzzy preference relation with multi-granular uncertain linguistic information for solving decision making problems Consistent inguistic fuzzy preference reation with muti-granuar uncertain inguistic information for soving decision making probems Siti mnah Binti Mohd Ridzuan, and Daud Mohamad Citation: IP Conference

More information

(This is a sample cover image for this issue. The actual cover is not yet available at this time.)

(This is a sample cover image for this issue. The actual cover is not yet available at this time.) (This is a sampe cover image for this issue The actua cover is not yet avaiabe at this time) This artice appeared in a journa pubished by Esevier The attached copy is furnished to the author for interna

More information

Centralized Coded Caching of Correlated Contents

Centralized Coded Caching of Correlated Contents Centraized Coded Caching of Correated Contents Qianqian Yang and Deniz Gündüz Information Processing and Communications Lab Department of Eectrica and Eectronic Engineering Imperia Coege London arxiv:1711.03798v1

More information

PHYS 110B - HW #1 Fall 2005, Solutions by David Pace Equations referenced as Eq. # are from Griffiths Problem statements are paraphrased

PHYS 110B - HW #1 Fall 2005, Solutions by David Pace Equations referenced as Eq. # are from Griffiths Problem statements are paraphrased PHYS 110B - HW #1 Fa 2005, Soutions by David Pace Equations referenced as Eq. # are from Griffiths Probem statements are paraphrased [1.] Probem 6.8 from Griffiths A ong cyinder has radius R and a magnetization

More information

Simulation of single bubble rising in liquid using front tracking method

Simulation of single bubble rising in liquid using front tracking method Advances in Fuid Mechanics VI 79 Simuation o singe bubbe rising in iquid using ront tracking method J. Hua & J. Lou Institute o High Perormance Computing, #01-01 The Capricorn, Singapore Abstract Front

More information

Approximated MLC shape matrix decomposition with interleaf collision constraint

Approximated MLC shape matrix decomposition with interleaf collision constraint Approximated MLC shape matrix decomposition with intereaf coision constraint Thomas Kainowski Antje Kiese Abstract Shape matrix decomposition is a subprobem in radiation therapy panning. A given fuence

More information

IE 361 Exam 1. b) Give *&% confidence limits for the bias of this viscometer. (No need to simplify.)

IE 361 Exam 1. b) Give *&% confidence limits for the bias of this viscometer. (No need to simplify.) October 9, 00 IE 6 Exam Prof. Vardeman. The viscosity of paint is measured with a "viscometer" in units of "Krebs." First, a standard iquid of "known" viscosity *# Krebs is tested with a company viscometer

More information

Stochastic Variational Inference with Gradient Linearization

Stochastic Variational Inference with Gradient Linearization Stochastic Variationa Inference with Gradient Linearization Suppementa Materia Tobias Pötz * Anne S Wannenwetsch Stefan Roth Department of Computer Science, TU Darmstadt Preface In this suppementa materia,

More information

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 15, NO. 2, FEBRUARY

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 15, NO. 2, FEBRUARY IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 5, NO. 2, FEBRUARY 206 857 Optima Energy and Data Routing in Networks With Energy Cooperation Berk Gurakan, Student Member, IEEE, OmurOze,Member, IEEE,

More information

Multilayer Kerceptron

Multilayer Kerceptron Mutiayer Kerceptron Zotán Szabó, András Lőrincz Department of Information Systems, Facuty of Informatics Eötvös Loránd University Pázmány Péter sétány 1/C H-1117, Budapest, Hungary e-mai: szzoi@csetehu,

More information

A Fictitious Time Integration Method for a One-Dimensional Hyperbolic Boundary Value Problem

A Fictitious Time Integration Method for a One-Dimensional Hyperbolic Boundary Value Problem Journa o mathematics and computer science 14 (15) 87-96 A Fictitious ime Integration Method or a One-Dimensiona Hyperboic Boundary Vaue Probem Mir Saad Hashemi 1,*, Maryam Sariri 1 1 Department o Mathematics,

More information