HIGH-SPEED MULTI OPERAND ADDITION UTILIZING FLAG BITS VIBHUTI DAVE DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING

Size: px
Start display at page:

Download "HIGH-SPEED MULTI OPERAND ADDITION UTILIZING FLAG BITS VIBHUTI DAVE DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING"

Transcription

1 HIGH-SPEED MULTI OPERAND ADDITION UTILIZING FLAG BITS BY VIBHUTI DAVE DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING Submtted n partal fulfllment of the requrements for the degree of Doctor of Phlosophy n Computer Engneerng n the Graduate College of the Illnos Insttute of Technology Approved Advser Co Advser Chcago, Illnos May 2007

2

3 ACKNOWLEDGEMENT I would le to than my mentor Dr. Erdal Orulu for hs constant support and undue fath n me. I hghly apprecate the tme he has nvested durng my research and for the completon of ths dssertaton. Ths dssertaton would not have been possble wthout Dr. Jafar Sane, my advsor and hs attempts to challenge me throughout my academc program, encouragng me when I was successful and pushng me to do better when I fell short. I would also le to than Dr. Dmtros Velens and Dr. James Stne for ther constructve crtcsm about my wor and helpng me to perform better. A specal thans to the commttee members for ther support and tme.

4 TABLE OF CONTENTS Page ACKNOWLEDGEMENT... LIST OF TABLES... v LIST OF FIGURES... v ABSTRACT... x CHAPTER 1. INTRODUCTION Motvaton Goals Structure of Thess DESIGN CRITERIA AND IMPLICATIONS Arthmetc Operatons and Unts Crcut and Layout Desgn Technques Automated Crcut Synthess and Optmzaton Crcut Complexty and Performance Measures Summary ADDER DESIGNS Bt Adders Carry Propagate Adders Carry Select Adders Carry Sp Adders Carry Save Adders Parallel Prefx Adders Summary LOGICAL EFFORT Delay n a Logc Gate Multstage Logc Networs Choosng the Best Number of Stages v

5 4.4 Summary of the Method Summary FLAGGED PREFIX ADDITION Bacground Theory of Fagged Prefx Addton Implementaton of a Flagged Prefx Adder Modfcatons to a Prefx Adder Delay Performance of a Flagged Prefx Adder Fxed Pont Arthmetc Applcatons Summary THREE - INPUT ADDITION Carry - Save Adders Mult - Operand Adders Flag Logc Computaton Constant Addton Three - Input Addton Gate Count Summary ANALYSIS AND SIMULATION Logcal Effort Smulaton Results Summary CONCLUSIONS AND FUTURE WORK BIBLIOGRAPHY v

6 LIST OF TABLES Table Page 4.1 Logcal Effort for nputs of statc CMOS gates Estmates of parastc delay for logc gates Best Number of stages for path efforts Summary of terms and equatons for Logcal Effort Selecton Table for a Flagged Prefx Adder Flag and Carry Logc Based on Thrd Operand Flag Logc utlzng Carry from the Prefx Tree Mnmum Flag Logc Gates Logc Gate Combnatons Gate Count for All Adder Implementatons Logcal Effort and Path Delays for Adder Blocs Logcal Effort and Path Delays for Gates wthn Adder Blocs Logcal Effort Estmates for Conventonal Adder Desgns Logcal Effort Estmates for Flagged Adder Desgns Logcal Effort Estmates for Three Input Flagged Adder Desgns Post Layout Estmates for Conventonal Adder Archtectures Post Layout Estmates for Flagged Adder Archtectures Post Layout Estmates for Enhanced Adder Archtectures wth Constant Addton Post Layout Estmates for Three - Input Adder Archtectures v

7 LIST OF FIGURES Fgure Page 3.1 (m,) Counter Symbol and Logc for Half Adder Symbol and Logc for Full Adder Rpple Carry Adder m-bt Condtonal Sum Adder Carry-Select Adder Carry-Sp Bloc Carry-Sp Adder Optmal Sze for Carry-Sp Adder Carry-Save Adder Parallel Prefx Adder Logc and Symbol for Pre-Processng Gates Logc and Symbol for Prefx Tree Gates Logc and Symbol for Post-Processng Gates Brent-Kung Prefx Tree Ladner-Fscher Prefx Tree Kogge-Stone Prefx Tree Electrcal Effort vs. Delay Dual Adder Desgn Flagged Prefx Adder Flagged Inverson Cells v

8 5.4 Output Cell Logc for the Flagged Prefx Adder Bloc Dagram of a Sgn Magntude Adder Symbol and Schematc of Carry Save Adder Four Operand Carry - Propagate Adder Array Four Operand Carry Save Adder Array wth Fnal CPA Typcal Array Adder Structure for Mult - Operand Addton Flag Inverson Cells for Constant Addton Flag Inverson Cells for Three - Input Addton Full Adder used wthn Carry-Sp and Carry-Select, and Carry-Save Adders One Bt FIC for Three-Input Addton Dot Dagram of a Carry - Save Adder Area Results for Conventonal Adder Desgns Delay Results for Conventonal Adder Desgns Power Results for Conventonal Adder Desgns Area Results for Enhanced Adder Desgns Delay Results for Enhanced Adder Desgns Power Results for Enhanced Adder Desgns Area Results for Three Input Adder Desgns Delay Results for Three - Input Adder Desgns Power Results for Three - Input Adder Desgns Area Results for 16 - bt Desgns Delay Results for 16 - bt Desgns Power Results for 16 - bt Desgns v

9 7.16 Area Results for 32 - bt Desgns Delay Results for 32 - bt Desgns Power Results for 32 - bt Desgns Area Results for 64 - bt Desgns Delay Results for 64 - bt Desgns Power Results for 64 - bt Desgns x

10 ABSTRACT The goal of ths research s to desgn arthmetc crcuts that meet the challenges faced by computer archtects durng the desgn of hgh performance embedded systems. The focus s narrowed down to addton algorthms and the desgn of hgh speed adder archtectures. Addton s one of the most basc operatons performed n all computng unts, ncludng mcroprocessors and dgtal sgnal processors. It s also a basc unt utlzed n varous complcated algorthms of multplcaton and dvson. Varous adder archtectures for bnary addton have been nvestgated, n vew of a wde range of performance characterstcs, whch nclude delay, area, power, the sze of the nput operands, and the number of nput operands. Effcent mplementaton of an adder crcut usually revolves around reducng the cost to propagate the carry between successve bt postons. The problem of carry propagaton s elmnated by expressng addton as a prefx computaton. The resultng adder crcuts are called parallel prefx adders. Based on the advantages posed by the prefx scheme, a qualtatve evaluaton of three dfferent exstng prefx adders (Brent Kung, Ladner Fscher, and Kogge Stone) has been performed. A technque to enhance the functonalty of these basc desgns has also been nvestgated enablng ther utlzaton n nteger arthmetc. The technque has been named after the new set of ntermedate outputs that are generated as part of the algorthm, called the flag bts. Ths technque targets a new desgn crtera; the number of results obtaned at the output. An algorthm based on flag bts has been proposed to acheve three-operand addton whch has proven to have a favorable overall performance wth respect to conventonal mult-operand addton schemes. x

11 1 CHAPTER 1 INTRODUCTION 1.1 Motvaton Besdes technologcal scalng, advances n the feld of computer archtecture have also contrbuted to the exponental growth n performance of dgtal computer hardware. The flp-sde of the rsng processor performance s an unprecedented ncrease n hardware and software complexty. Increasng complexty leads to hgh development costs, dffculty wth testablty and verfablty, and less adaptablty. The challenge n front of computer desgners s therefore to opt for smpler, robust, and easly certfable crcuts. Computer arthmetc, here plays a ey role adng computer archtects wth ths challenge. It s one of the oldest sub-felds of computer archtecture. The bul of hardware n earler computers resded n the accumulator and other arthmetc/logc crcuts. Successful operaton of computer arthmetc crcuts was taen for granted and hgh performance of these crcuts has been routnely expected. Ths context has been changng due to varous reasons. Frst, at very hgh cloc rates, the nterfaces between arthmetc crcuts and the rest of the processor become crtcal. Arthmetc crcuts can no longer be desgned and verfed n solaton. Rather an ntegrated desgn optmzaton s requred. Second, optmzng arthmetc crcuts to meet the desgn goals by tang advantage of the strengths of new technologes, and mang them tolerant to the weaness, requres a re-examnaton of exstng desgn paradgms. Fnally, ncorporaton of hgher-level arthmetc prmtves nto hardware maes the desgn, optmzaton and verfcaton efforts hghly complex and nterrelated.

12 2 The core of every mcroprocessor, dgtal sgnal processor (DSP), and dataprocessng applcaton-specfc ntegrated crcut (ASIC) s ts datapath. Wth respect to the most mportant desgn crtera; crtcal delay, chp sze, and power dsspaton, the datapath s a crucal crcut component. The datapath comprses of varous arthmetc unts, such as comparators, adders, and multpler [43]. The bass of every complex arthmetc operaton s bnary addton. Hence, t can be concluded, that bnary addton s one of the most mportant arthmetc operaton. The hardware mplementaton of an adder becomes even more crtcal due to the expensve carry-propagaton step, the evaluaton tme of whch s dependent on the operand word length. The effcent mplementaton of the addton operaton n an ntegrated crcut s a ey problem n VLSI desgn [58]. Productvty n ASIC desgn s constantly mproved by the use of cell-based desgn technques such as standard cells, gate arrays, and feld programmable gate arrays (FPGA), and low-level and hgh-level hardware synthess [13]. Ths ass for adder archtectures whch result n effcent cell-based crcut realzatons whch can easly be syntheszed. Furthermore, they should provde enough flexblty n order to accommodate custom tmng and area constrants as well as to allow the mplementaton of customzed adders. 1.2 Goals The followng goals have been formulated for ths research: Establsh the performance crtera for an adder desgn whch nclude; speed, area, power, sze of nput operands, number of nput operands, and number of useful results obtaned at the output.

13 3 Performance evaluaton of conventonal adder archtectures and compare them, wth focus on crcut mplementaton. Study of mathematcs nvolved behnd Flagged Prefx Adders [8] and performance evaluaton of ths adder desgn wth dfferent prefx trees. Derve the logc to ncrease the number of nput operands a prefx adder can add. Desgn a three-nput bnary adder utlzng the concept of flag bts, where the thrd operand s applcaton-ndependent. The hardware that needs to be ncorporated wthn a bnary adder wll be adder-ndependent. Performance evaluaton of the proposed desgn wth respect to varous adder archtectures n terms of delay, area, and power. Obtan delay and area estmates from synthess and verfy results by applcaton of Logcal Effort [50] to the proposed desgn. 1.3 Structure of the Thess As a startng pont, the basc desgn crtera are dscussed and ther mplcatons establshed n Chapter 2. Optmzaton technques have been presented whch are utlzed for the fnal adder desgn. It s substantated why cell-based combnatonal adders and ther synthess are mportant n VLSI desgn. Chapter 3 ntroduces conventonal adder archtectures whch nclude the Carry - Propagate, Carry - Select, Carry - Sp, and Parallel - Prefx adders [31]. The Carry-Save adder whch s a mult-operand adder has also been ntroduced. It s ths adder archtecture that s used as a benchmar to evaluate the performance of the proposed technque n ths thess.

14 4 Chapter 4 gves a bref ntroducton to the method of logcal effort and ts applcaton to dgtal crcuts. Ths forms the bass of verfcaton of results obtaned va synthess. The theory of flagged prefx addton [6] s ntroduced n Chapter 5. It also dscusses the applcatons of flagged prefx adders and the advantages presented by these desgns. Chapter 6 extends the theory of flagged prefx addton to enhance the functonalty of a bnary adder to three-nput addton, not lmtng the number of nput operands to two. The necessary hardware mplementaton s derved, ntally consderng that the thrd nput s a constant. The resultng adder desgns are called Enhanced Flagged Bnary Adders (EFBA) [17]. Ths s followed by a hardware optmzaton to enable three-nput addton ndependent of whether the thrd operand s a constant or a varable. The fnal adder desgns are referred to as Three Input Prefx Adders (TIFPA). Chapter 7 nvestgates the performance of conventonal, flagged prefx, EFBA, and TIFPA desgns theoretcally as well as based on smulaton results. The method of logcal effort s appled to all the desgns to verfy synthess results wth the analytcal results. Conclusons are drawn and potental for future wor s presented n Chapter 8.

15 5 CHAPTER 2 DESIGN CRITERIA AND IMPLICATIONS Ths chapter formulates the motvaton for the research presented n ths thess by focusng on questons le: Why s the effcent mplementaton of bnary adders mportant? What wll be the ey layout desgn technologes n the future, and why do cell-based desgn technques, such as standard cells, gan more and more mportance? Why s hardware synthess becomng a ey ssue n VLSI desgn? How can area, delay, and power measurements of combnatonal crcuts be estmated and optmzed? How can the performance and complexty of adder crcuts be modeled by tang nto account archtectural, crcut, layout, and technology aspects? Ths chapter summarzes the technques, advantages, and dsadvantages of technques that are utlzed durng the desgn and mplementatons of dgtal logc crcuts. 2.1 Arthmetc Operatons and Unts The tass of a VLSI chp are the processng of data and the control of nternal or external system components. Ths s typcally done by algorthms whch are based on logc and arthmetc operatons on data tems [10]. Applcatons of arthmetc operatons n ntegrated crcuts are manfold. Mcroprocessors and DSPs typcally contan adders and multplers n ther datapath. Specal crcut unts for fast dvson and square-root operatons are sometmes ncluded as well. Adders, ncrementers/decrementers, and comparators are often used for address calculaton and flag generaton purposes n controllers. ASICs use arthmetc unts for the same purposes. Dependng on ther applcaton, they may even requre dedcated crcut components for specal arthmetc

16 6 operators, such as for fnte feld arthmetc used n cryptography, error correcton codng, and sgnal processng. Some of the basc arthmetc operatons are lsted below [58] Shft/extenson operatons Equalty and magntude comparson Incrementaton / Decrementaton Negaton Addton/ Subtracton Multplcaton Dvson Square root Exponentaton Logarthmc Functons Trgonometrc Functons For trgonometrc and logarthmc functons as well as for exponentaton, varous teratve algorthms exst whch mae use of smpler arthmetc operatons. Multplcaton, dvson and square root can be performed usng seral or parallel methods. In both methods, the computaton s reduced to a sequence of condtonal addtons/subtractons and shft operatons. Exstng speed-up technques try to reduce the number of requred addton/subtracton operatons to mprove ther speed. Subtracton corresponds to the addton of a negated operand. The addton of two n-bt bnary numbers can be regarded as an elementary operaton. The algorthm for negaton of a number depends on the chosen number representaton [42] and s usually accomplshed by bt nverson and ncrementaton.

17 7 Increment and decrement operatons are smplfed addtons wth one nput operand beng constantly 1 or -1. Equalty and magntude comparson operatons can also be regarded as smplfed addtons wth only some of the respectve addton flags, but no sum bts are used as outputs [58]. Ths short overvew shows that the addton s the ey arthmetc operaton, whch most other operatons are based on. Its mplementaton n hardware s therefore crucal for the effcent realzaton of almost every arthmetc unt n VLSI. Ths s n terms of crcut sze, computaton delay, and power consumpton. Addton s a prefx problem [34], whch means that each result s dependent on all nput bts of equal or lower magntude. Propagaton of a carry sgnal from each bt poston to all hgher bt postons s necessary. Carry - propagate adders [31] perform ths operaton mmedately. The requred carry - propagaton from the least to the most sgnfcant bt results n a consderable crcut delay, whch s a functon of the word length of the nput operands [58]. The most effcent way to speed up addton s to avod carry propagaton thus savng the carres for later processng. Ths allows the addton of two or more numbers n a very short tme, but yelds results n a redundant number representaton [20]. The redundant representaton forms the bass of carry-save adders [20]. They play an mportant role n the effcent mplementaton of mult-operand addton crcuts. They are very fast, ther structure smple, but the potental for further optmzaton s mnmal. The bnary carry-propagate adder therefore, s one of the most often used and most crucal buldng blocs n dgtal VLSI desgn. Varous well-nown methods exst for speedng up carry - propagaton n adders, offerng very dfferent performance

18 8 characterstcs, advantages and dsadvantages. Instances of adder archtectures targetng mportant desgn crtera are lsted below. Delay: Kogge-Stone parallel prefx adder [32] Area: Carry-rpple adder [44] Power: 2 - level carry-sp adder [7] Sze of nput operands: Brent-Kung parallel prefx adder [5] Number of nput operands: Carry-save mult-operand adder [42] Number of results at output: Flagged parallel-prefx adder [8] The performance measure of each algorthm s also dependent on desgn technques that are employed to create each crcut. The next secton presents the crcut and layout desgn technques. 2.2 Crcut and Layout Desgn Technques IC fabrcaton technologes can be classfed nto full-custom, sem-custom, and programmable ICs. Further dstnctons are made wth respect to crcut desgn technques and layout desgn technques, whch are strongly related [58] Layout-Based Desgn Technques. In layout-based desgn technques, dedcated full-custom layout s drawn manually for crcuts desgned at the transstor level. The ntal desgn effort s very hgh, but maxmum crcut performance and layout effcency s acheved. Full-custom cells are entrely desgned by hand for dedcated hgh performance unts, e.g., arthmetc unts. The tled-layout technque can be used to smplfy, automate, and parameterze the layout tas. For reuse purpose, the crcuts and layouts are often collected n lbrares together wth automatc generators. Mega-cells are full-custom cells for unversal functons whch need no parameterzaton. Macro-cells are

19 9 used for large crcut components wth regular structure and need word-length parameterzaton. Datapaths are usually realzed n a bt-slced layout style, whch allows parameterzaton of word length and concatenaton of arbtrary datapath elements for logc, arthmetc, and storage functons. Snce adders are too small to be mplemented as macro cells, they are usually realzed as data-path elements Cell-Based Desgn Technques. At a hgher level of abstracton, arbtrary crcuts can be composed from elementary logc gates and storage elements contaned n a lbrary of pre-desgned cells. The layout s automatcally composed from correspondng layout cells usng dedcated layout strateges, dependng on the used IC technology. Cellbased desgn technques are used n standard-cell, gate-array, sea-of-gates, and feldprogrammable gate-array (FPGA) technologes. The desgn of logc crcuts does not dffer consderably among the dfferent cell-based IC technologes. Crcuts are obtaned from schematc entry, behavoral synthess, or crcut generators (structural synthess). Due to the requred generc propertes of the cells, more conventonal logc styles have to be used for ther crcut mplementaton [58]. The advantages of cell-based desgn technques le n ther unversal usage, automated synthess and layout generaton for arbtrary crcuts, portablty between tools and lbrares, hgh desgn productvty, hgh-relablty, and hgh flexblty n floorplannng. Ths comes at the prce of lower crcut performance wth respect to speed and area. Cell-based desgn technques are manly used for the mplementaton of random logc and custom crcuts for whch no approprate lbrary component are avalable and custom mplementaton proves costly. Cell-based desgn technques are wdely used n the ASIC desgn communty.

20 10 Standard Cells Standard cells [22] [49] represent the hghest performance cell-based technology. The layout of the cells s full-custom, whch mandates for full-custom fabrcaton of the wafers. Ths n turn enables the combnaton of standard cells wth custom-layout components on the same de. For layout generaton, the standard cells are placed n rows and connected through ntermedate routng channels. Wth the ncreasng number of routng layers and the over-the-cell routng capabltes n modern process technologes, the layout densty of standard cells gets close to the densty obtaned from full-custom layout. The remanng drawbac s the restrcted use of hgh-performance crcut technques [58]. Gate-arrays and sea-of gates On gate-arrays and sea-of gates, pre-processed wafers wth unconnected crcut elements are used. Thus, only metallzaton used for the nterconnect s customzed, resultng n lower producton costs and faster turnaround tmes. Crcut performance and layout flexblty s lower than for standard cells, whch n partcular decreases mplementaton effcency of regular structures such as macro-cells. FPGA Cells Feld-programmable gate-arrays (FPGA) [29] are electroncally programmable generc ICs. They are organzed as an array of logc blocs and routng channels, and the confguraton s stored n a statc memory or programmed e.g., usng ant-fuses. Agan, a lbrary of logc cells and macros allow flexble and effcent desgn of arbtrary crcuts. Turnaround tmes are very fast mang FPGAs the deal soluton for rapd prototypng. On the other hand, low crcut performance, lmted crcut complexty, and hgh de costs

21 11 severely lmt ther area of applcaton [29]. The followng secton gves a synopss of the method employed for the completon of ths research 2.3 Automated Crcut Synthess and Optmzaton Crcut synthess denotes the automated generaton of logc networs from behavoral descrptons at an arbtrary level. Synthess s becomng a ey ssue n VLSI desgn for many reasons. Increasng crcut complextes, shorter development tmes, as well as effcent and flexble usage of cell and component lbrares can only be handled wth the ad of powerful desgn automaton tools. Arthmetc synthess addresses the effcent mappng of arthmetc functons onto exstng arthmetc components and logc gates Hgh-Level Synthess. Hgh-level synthess or behavoral synthess allows the translaton of algorthmc or behavoral descrptons of hgh abstracton level down to Regster Transfer Logc (RTL) representaton, whch can be processed further by lowlevel synthess tools. Hgh-level arthmetc synthess maes use of arthmetc transformatons n order to optmze hardware usage under gven performance crtera. Thereby, arthmetc lbrary components are regarded as resources for mplementng the basc arthmetc operatons [58] Low-Level Synthess. Low-level synthess or logc synthess translates an RTL specfcaton nto a generc logc networ. For random logc, synthess s acheved by establshng the logc equatons for all outputs and mplementng them n a logc networ Data-Path Synthess. Effcent arthmetc crcuts contan very specfc structures of large logc depth and hgh factorzaton degree. Ther drect synthess from logc equatons s not feasble. Therefore, parameterzed netlst generators usng dedcated

22 12 algorthms are used nstead. Most synthess tools nclude generators for the basc arthmetc functons, such as comparators, ncrementers, adders, and multplers. For other mportant operatons, such as squarng and dvson, usually no generators are provded and thus synthess of effcent crcutry s not avalable. Also the performance of the commonly used archtectures vares consderably, whch often leads to sub-optmal cell-based crcut mplementatons [58] Optmzaton of Combnatonal Crcuts. The optmzaton of combnatonal crcuts connotes the automated mnmzaton of a logc netlst wth respect to delay, area, and power dsspaton measures of the resultng crcut, and the technology mappng. The appled algorthms are very powerful for optmzaton of random logc by performng steps le flattenng, logc mnmzaton [29], tmng-drven factorzaton, and technology mappng. However the potental for optmzaton s lmted for networs wth large logc depth and hgh factorzaton degree, especally arthmetc crcuts. Therefore, only local logc mnmzaton s possble, leavng the global crcut archtecture bascally unchanged. Thus the realzaton of well-performng arthmetc crcuts reles more on effcent datapath synthess than on smple logc optmzaton Hardware Descrpton Languages. Hardware descrpton languages (HDL) allow the specfcaton of hardware at dfferent levels of abstracton, servng as entry ponts to hardware synthess. Verlog s one of the most wdely used and powerful languages. It enables the descrpton of crcuts at the behavoral and structural level. Synthess of arthmetc unts s ntated by usng standard arthmetc operator symbols n the Verlog code [45] for whch the correspondng bult-n netlst generators are called by the synthess tool. The advantages of utlzng Verlog over schematc entry le n the

23 13 possblty of behavoral hardware descrpton, the parameterzablty crcuts, and portablty of code. Now that t has been establshed how to desgn the schematcs, the next secton focuses on the desgn crtera for VLSI crcuts. 2.4 Crcut Complexty and Performance Measures One mportant aspect n desgn automaton s the complexty and performance estmaton of a crcut. At a hgher desgn level, ths s acheved by usng characterzaton nformaton of the hgh-level components to be used and by complexty estmaton of the nterconnect [58]. At gate-level, however, estmaton s more dffcult and less accurate because crcut sze and performance strongly depend on the gate-level synthess results and on the physcal cell arrangement and routng. Estmates of the expected area, speed, and power dsspaton for a compled cellbased crcut can be found as a functon of the operand word length Area. Slcon area on a VLSI chp s taen up by the actve crcut elements and ther nterconnectons. In cell-based desgn technques, the followng crtera for area modelng can be formulated [19]: Total crcut complexty (GE total ) can be measured by the number of gate equvalents. (1GE 1 2-nput NAND gate 4 MOSFETS) Crcut Area (A crcut ) s occuped by logc cells and nter-cell wrng. (A crcut = A cells + A wrng ) Total cell area (A cells ) s proportonal to the number of transstors or GE total contaned n a crcut. Ths number s nfluenced by technology mappng. (A cells α GE total ) Wrng area (A wrng ) s proportonal to the total wre length. (A wrng α L total )

24 14 Total wre length (L total ) can be estmated from the number of nodes and the average wre length of a node, or more accurately from the sum of cell fan-out and the average wre length of cell to cell connectons. The wre lengths also depend on crcut sze, crcut connectvty, and layout topology. (L total α FO total ) Cell fan out (FO) s the number of cell nputs a cell output s drvng. Fan-n s the number of nputs to a cell, whch for many combnatonal gates s proportonal to the sze of the cell. Snce the sum of the cell fan-out (FO total ) of a crcut s equvalent to the sum of the cell fan-n t s also proportonal to the crcut sze. (FO total α GE total ) Delay. Propagaton delay n a crcut s determned by the cell and nterconnecton delays on the crtcal path. Indvdual cell and node values are relevant for path delays. Crtcal path evaluaton s done by statc tmng analyss whch nvolves graph-based search algorthms. Tmngs are also dependent on temperature, voltage, and process parameters [54]. Maxmum delay (t crt-path ) of a crcut s equal to the sum of cell nertal delays, cell output ramp delays, and wre delays on the crtcal path. (t crt-path = Σє crtpath ((t cell + t ramp ) + Σє crt-path t wre ) Cell delay (t cell ) depends on the transstor-level crcut mplementaton and the complexty of a cell. All smple gates have comparable delays. Complex gates usually contan tree-le crcut and transstor arrangements, resultng n logarthmc delay to area dependences. (t cell α log(a cell )

25 15 Ramp Delay (t ramp ) s the tme t taes for a cell output to drve the attached capactve load, whch s made up of nterconnect and cell nput loads. The ramp delay depends lnearly on the capactve load attached, whch n turn depends lnearly on the fan-out of the cell. (t ramp α FO cell ) Wre delay (t wre ) s the RC-delay of a wre, whch depends on the wre length. RC delays are neglgble compared to cell and ramp delays for small crcuts such as adders. (t wre =0) A rough delay estmaton s possble by consderng szes and wth a small weghtng factor, fan-out of the cells on the crtcal path. (t crt-path α Σє crt-path (log (A cell ) + FO cell ) Power. An ncreasngly mportant parameter for VLSI crcuts s power dsspaton. Pea power s a problem wth respect to crcut relablty whch, however, can be dealt wth by careful desgn. On the other hand, average power dsspaton s becomng a crucal desgn constrant n many modern applcatons, such as hghperformance mcroprocessors and portable applcatons, due to the heat removal problems and power budget lmtatons. The followng prncples hold for average power dsspaton n synchronous CMOS crcuts [30]: Total power (P total ) n CMOS crcuts s domnated by the dynamc swtchng of crcut elements, whereas dynamc short-crcut currents and statc leaage are of less mportance. Thus, power dsspaton can be assumed proportonal to the total capactance to be swtched, the square of the supply voltage, the

26 16 cloc frequency, and the swtchng actvty α n a crcut. (P total = ½. C total. V 2 dd. f cl. α ) Total capactance (C total ) n a CMOS crcut s the sum of the capactances from transstor gates, sources, and drans and from wrng. Thus, the total capactance s proportonal to the number of transstors and the amount of wrng, both of whch are roughly proportonal to crcut sze. (C total α GE total ) Supply voltage (V dd ) and cloc frequency (f cl ) can be regarded as constant wthn a crcut and therefore are not relevant n out crcut comparsons. (V dd, f cl = constant) The swtchng actvty factor (α) gves a measure for the number of transent nodes per cloc cycle and depends on nput patterns and crcut characterstcs. In many cases, nput patterns to datapaths and arthmetc unts are assumed to be random, whch result n an average transton actvty of 50% on all nputs. Sgnal propagaton through several levels of combnatonal logc may decrease or ncrease transton actvtes, dependng on crcut structure. (α = constant) For arthmetc unts wth constant nput swtchng actvty, power dsspaton s approxmately proportonal to crcut sze. (P total α GE total ) 2.5 Summary Arthmetc unts belong to the basc and most crucal buldng blocs n many ntegrated crcuts, and ther performance depends on the effcent hardware mplementaton of the underlyng arthmetc operatons. Advances n computer-aded desgn as well as the ever growng desgn productvty demands tend to prefer cell-based

27 17 desgn technques and hardware synthess, also for arthmetc components. The followng chapter wll dscuss several adder desgns that formed the bass of such arthmetc components.

28 18 CHAPTER 3 ADDER DESIGNS Addton s the most common arthmetc operaton and also serves as the buldng bloc for syntheszng all other operatons. Wthn dgtal computers, addton s performed extensvely both, n explctly specfed computaton steps and as a part of mplct ones dctated by ndexng and other forms of address arthmetc [43]. In smple ALUs due to the lac of dedcated hardware for fast multplcaton and dvson, these latter operatons are performed as sequences of addtons. Subtracton s normally performed by negatng the subtrahend and addng the result to the mnuend. Ths s qute natural, gven that an adder must handle sgned numbers anyway. Ths chapter ntroduces the basc prncples and crcut structures used for the addton of sngle bts and of two or more multple bnary numbers. Bnary carry - propagate addton s formulated as a prefx problem, and the fundamental algorthms and speed-up technques for the effcent solutons of ths problem have been descrbed Bt Adders As the basc combnatonal addton structure, a 1-bt adder computes the sum of m nput bts of the same magntude. It s also called an (m,) counter (See Fg. 3.1) because t counts the number of 1s at the m nputs (a m-1, a m-2,, a 0 ) and outputs a bt sum (s -1, s -2,, s 0 ), where = log( m +1) same s as follows [58] 1 j= 0. The arthmetc equaton representng the m 1 j 2 s = a (3.1) j = 0

29 19 a a m-1 (m,).... s -1 s 0 Fgure 3.1. (m,) counter Half-Adder (2,2)-Counter. The half adder s a (2,2) counter. The most sgnfcant sum bt s called the carry out, c out because t carres an overflow to the next hgher bt poston. Fgure 3.2 depcts the logc symbol and the crcut mplementaton of the halfadder. The correspondng arthmetc and logc equatons are as follows n equatons 3.2 and 3.3 respectvely. 2 c out + s = a + b s = ( a + b) mod 2 (3.2) c out = ( a + b) dv2 = 1/ 2( a + b s) s = a b c out = a b (3.3)

30 20 a b a b c out HA s s c out Fgure 3.2. Symbol and Logc Crcut for Half Adder Full Adder (3,2)-Counter. The full adder s a (3,2) counter. The thrd nput s called c n snce t represents the carry bt from a hgher bt poston. Important nternal sgnals wthn a full adder are the bt generate and the bt propagate sgnals represented by g and p respectvely. The generate sgnal represents whether the carry sgnal, 0 or 1 wll be generated wthn the full adder. The propagate sgnal ndcates f the carry- n at the nput of the full adder wll be propagated to the carry-out of the full adder unchanged. The arthmetc and logc equatons for all the sgnals encountered wthn a full adder are gven n equatons 3.4 and 3.5 respectvely [58]. 2 c + s = a + b + out c n s = a + b + c ( n ) mod 2 (3.4) c out = ( a + b + c ) dv2 = 1/ 2( a + b + c s) n n g = a b p = a b s = a b c = p n c n (3.5) c n = a b + ac n + bc n = g + p c n

31 21 Fgure 3.3 depcts the logc symbol and the mplementaton crcut for a full adder desgn utlzed for the purpose of ths thess. a a b g HA c out FA c n c out p b c n s HA s Fgure 3.3. Symbol and Logc Crcut for Full Adder 3.2 Carry Propagate Adders A carry-propagate adder adds 2 n-bt operands, A=(a n-1, a n-2,, a 0 ) and B=(b n-1, b n-2,, b 0 ) and an optonal carry-n, c n by performng carry-propagaton. The result s an rredundant (n+1)-bt number consstng of the n-bt sum S=(s n-1, s n-2,, s 0 ) and a carryout, c out. Equaton 3.6 represents the arthmetc equatons for a conventonal carrypropagate adder. The logc equatons are represented n equaton 3.7 [58]. n 2 c + S = A + B + c out n 2 n 1 n 1 n n c + out s = a b + = 0 = 0 = 0 c n (3.6) g = a b p = a b s = p c (3.7) c +1 = g + p c ; =0, 1,, n-1

32 22 Also note that, c 0 =c n and c n =c out. The carry propagate adder can be mplemented as a combnatonal crcut usng n full adders connected n seres (See Fg. 3.4) and s called a Rpple-Carry Adder. A 3 B 3 A 2 B 2 A 1 A 0 B 0 B 1 C Cout C 3 C bt adder 1-bt adder 1-bt adder 1-bt adder C n S 3 S 2 S 1 S 0 Fgure 3.4. Rpple Carry Adder The followng expresson depcts the latency of an n-bt rpple carry adder[42]. T rpple add = TFA ( a, b c+ 1) + ( n 2) TFA ( c c+ 1) + TFA ( c s ) (3.8) where T FA (nput->output) represents the latency of a full adder on the path between ts specfed nput and output. As an approxmaton to the foregong, t can be sad that the latency of a rpple carry adder s nt FA. 3.3 Carry Select Adders One of the earlest logarthmc tme adder desgns s based on the condtonal - sum addton algorthm. In ths scheme, blocs of bts are added n two ways: assumng an ncomng carry of 0 or of 1, wth the correct outputs selected later as the bloc s true carry-n becomes nown. Ths s one of the speed-up technques that s used n order to reduce the latency of carry propagaton as seen wth the rpple-carry adder. Wth each level of selecton, the number of nown output bts doubles, leadng to a logarthmc number of levels and thus logarthmc tme addton as opposed to the lnear tme addton of a carry-propagate adder. An analyss of ths scheme s presented next.

33 23 The basc problem faced n speedng up carry propagaton s the fast processng of a late carry nput. Snce ths carry-n can have only two values, 0 and 1, the two possble addton results can be pre-computed and selected afterwards by the late carry-n usng small and constant tme. The n- bt adder s broen down nto groups of m bts. Each group of m-bts are added utlzng what are called m-bt condtonal adders. An m-bt condtonal adder s shown n Fgure 3.5 [20]. A B Condtonal Adder m m m-bt ADDER 1 m-bt ADDER 0 m+1 m ( C, S ) m ( 0, S 0 ) C m Fgure 3.5. m-bt Condtonal Sum Adder Equaton 3.9 [20] gves a mathematcal representaton of a condtonal sum adder ( C ( C 0 m 1 m, S, S 0 1 ) = ADD( A, B, c ) = ADD( A, B, c 0 0 = 0) = 1) (3.9) Here, A, B and S are all m-bt vectors. C 0 m represents the m-bt vector of all the carry outputs assumng c 0 =0. Smlarly, C 1 m represents the m-bt vector of all the carry outputs assumng c 0 =1. S 0 and S 1 represent the m-bt vector consstng of all the sum bts assumng c 0 =0 and c 0 =1 respectvely. Such m-bt condtonal adders are then combned as shown n Fgure 3.6 to obtan a carry-select adder.

34 24 c 0ut A 12:15 B 12:15 A 8:11 B 8:11 A 4:7 B 4:7 A 3:0 B 3: bt c 4-bt c 4-bt 12 8 c Condtonal 4 Condtonal Condtonal Adder Adder Adder 4-bt Adder c S 15:12 S 11:8 S 7:4 S 3:0 Fgure 3.6. Carry-Select Adder From Fgures 3.5 and 3.6, t can be deduced that, each carry - select adder s composed of an ntal full adder (fa), at the LSB poston, a seres of full adders (bfa) that generate 2 sets of results for each possble value of the carry-sgnal. Each group wthn the carry-select adder wll also consst of an ntal full adder (bfa) at the group LSB, and a fnal carry-generator (bcg) at the group MSB. c pb and c tb denote the carry-out of the prevous and the current bloc respectvely. The logcal equatons for a carry select adder are gven as n equatons [58]. Ifa c tb = a0b0 + a0c0 + b0c0 (3.10) bfa g p c c s s s = a b p = c = a b p = g = g = = s 0 pb + p + c s 1 pb (3.11)

35 25 bfa g p c c s s s = a b = c = a b = g = g s 0 pb + p c p c = p c = p c + c 0 1 s 1 pb (3.12) bcg c (3.13) = tb c+ 1 c pbc+ 1 The delay of carry-select adder therefore s represented as n equaton 3.14 [42]. n T = mt + ( 1) T m carry sel FA MUX (3.14) T MUX represents the delay of a multplexer. The advantage of utlzng a carry - select adder s that the delay rses logarthmcally nstead of lnearly. However, t also has a hgh hardware overhead snce t requres double the number of carry-propagate adders. In addton, t also requres a set (n/m-1) set of multplexers. 3.4 Carry Sp Adder The carry - sp adder s obtaned by a modfcaton of the rpple - carry adder. The objectve s to reduce the worst-case delay by reducng the number of full adder cells through whch the carry has to propagate. To acheve ths, the adder s dvded nto groups of m bts and the carry nto group j+1 s determned by one of the followng two condtons [20] 1. The carry s propagated by group j. That s, the carry-out of group j s equal to the carry-n of that group. Ths stuaton occurs only when the sum of the m nputs to that group s equal to 2 1.

36 26 2. The carry s not propagated by the group (that s, t s generated or lled nsde the group). Consequently, to reduce the length of the propagaton of the carry, a sp networ s provded for each group of m bts so that when a carry s propagated by ths group, the sp networ maes the carry bypass the group. The m-bt adder s shown n 3.7, and a networ of these modules mplementng an n-bt adder s ndcated n Fgure 3.8. The analytcal and logcal analyss s presented next. A : B : c 0 +1 CPA 1 c P : S : Fgure 3.7. Carry-Sp Bloc Carry computaton for a sngle bt poston, c +1 = p g + p c can be reformulated for a group of bts as n equaton 3.15 c = P c + Pc ' + 1 : (3.15) A 12:15 B 12:15 A 8:11 B 8:11 A 4:7 B 4:7 A 3:0 B 3: c out 4-bt CSK c 12 4-bt CSK c 8 4-bt CSK c 4 4-bt CSK c 0 S 12:15 S 8:11 S 4:7 S 3:0 Fgure 3.8. Carry-Sp Adder

37 27 where P : denotes the group propagate of the carry-propagate adder and acts as a select sgnal n the multplexer structure and s gven by P : 1 = AND p. c s the carry - out of j= j the partal carry propagate adder. Two cases can be dstngushed [58]: P : =0 : The carry c +1 s generated wthn the carry propagate adder and selected by the multplexer as carry - out c +1. The carry - n c does not propagate through the carry propagate adder to the carry out c +1 P : =0 : The carry - n c propagates through the carry propagate adder to c +1 but s not selected by the multplexer. It sps the carry propagate adder and s drectly selected as the carry - out c +1 nstead. Thus the combnatonal path from the carry - n through the carry-propagate adder s never actvated. In other words, the slow carry - chan path from the carry - n to the carry - out through the carry propagate adder s broen by the adder tself or the multplexer. The resultng carry - sp addton bloc, therefore s a regular carry propagate adder wth a small and constant tme delay from c c + 1 whch s why t speeds up carry propagaton. Note that that the multplexer n ths crcut s logcally redundant,.e., the sgnals c +1 and c +1 are logcally equvalent and dffer only n sgnal delays. The carry - n, c has a reconvergent fan-out. Ths nherent logc-redundancy results n a false longest path whch leads from the carry-n through the carry-propagate adder to the carry-out. Ths poses a problem n automatc logc optmzaton and statc tmng analyss. Redundancy removal technques exst whch are based on duplcaton of the carry-chan n the carrypropagate adder: one carry-chan computes the carry-out wthout a carry-n, whle the other taes the carry-n for calculaton of the sum bts. Ths scheme however sgnfes a consderable amount of addtonal logc compared to the redundant carry-sp scheme.

38 28 The worst case delay of a carry-sp adder from Fgures 3.7 and 3.8 can be formulated as [7] n T CSK = mtc + tmux + 2 t mux + ( m 1) tc + t m s (3.16) Here, T CSK represents the delay of the carry-sp adder, t mux s the propagaton delay of the multplexer, t c s the delay from the nputs of the full adder to the carryoutput and t s s the delay from the nputs to the sum output. m s the sze of the groups nto whch the adder s dvded and n s the sze of the nput operands. As can be seen from equaton 3.16, the delay of the adder depends on the sze of the group m. In order to acheve mnmum delay, the sze of group m can be found by dfferentatng equaton The optmal value of m s represented below [26] m = t / t n (3.17) opt mux 2 c Ths analyss assumes that all groups are of the same sze. However, ths does not produce mnmum delay. Ths s due to the fact that, for nstance, carres generated n the frst group have to traverse more sp networs to get to the last group than carres generated n some nternal group. To determne the worst case, delays of all propagaton chans need to be compared. A partcular chan s ntated n group and termnates n group j, wth j, beng propagated by the j--1 groups n between. Consequently, f group, has sze m, T = max(( m + m 1) t + ( j 1) t ) + t + t CSK, j j c mux mux s (3.18) wth Σm =n. The worst case delay, therefore can be reduced by reducng the sze of the groups close to the begnnng and end, as llustrated n Fgure 3.9.

39 29 Group Sze m M-1 M number of groups 0 Group Fgure 3.9. Optmal Group Szes for Carry-Sp Adder 3.5 Carry Save Adder A row of bnary full adders can be vewed as a mechansm to reduce three numbers to two numbers to ther sum. Fgure 3.10 shows the relatonshp of a rpple-carry adder for the latter reducton and a carry-save adder for the former. The dotted lnes represent the flow of the carry sgnals wthn a rpple carry adder. In the carry-save adder, the carry sgnals are separate outputs whch are then passed on to another level of a carrypropagate adder along wth the S bts to generate the fnal sum [15]. A 3 B 3 A 2 B 2 A 1 A 0 B 0 B 1 C out C 1 C 2 1-bt adder 1-bt adder 1-bt adder 1-bt adder C 3 C n S 0 C 1 C 2 C 3 S 1 S 2 S 3 Fgure Carry-Save Adder The basc dea s to perform an addton of three bnary vectors usng an array of one - bt adders wthout propagatng the carres. The sum s a redundant n - dgt carry - save

40 30 number, consstng of two bnary numbers S (sum bts) and C (carry bts). A carry - save adder accepts three bnary nput operands or, alternatvely, one bnary and one carry - save operand. It has a constant delay ndependent of n. Mathematcally [58], = + = = = = = + = = + 2 0, 1, j j j n j n n n a s c a s c A A A S C (3.19) where =0,1,.,n Parallel Prefx Adders The addton of two bnary numbers can be formulated as a prefx problem. The correspondng parallel-prefx algorthms can be used for speedng up bnary addton and for llustratng and understandng varous addton prncples. Ths secton ntroduces a mathematcal and vsual formalsm for prefx problems and algorthms Prefx Problems. In a prefx problem, n outputs (y n-1, y n-2,., y 0 ) are computed from n nputs (x n-1, x n-2,.,x 0 ) usng an arbtrary assocatve operator as follows [21]: x x x x y x x x y x x y x y n n n = = = = (3.20) The problem can also be formulated recursvely: 1; 0 0 = = y x y x y 1,2,..., 1 = n (3.21)

41 31 In other words, n a prefx problem, every output depends on all nputs of equal or lower magntude, and every nput nfluences all outputs of equal or hgher magntude. Due to the assocatvty of the prefx operator, the ndvdual operatons can be carred out n any order. In partcular, sequences of operatons can be grouped n order to solve the prefx problem partally and n parallel groups (.e., sequences) of nput bts (x, x -1,.,x ), resultng n the group varables Y :. At hgher levels, sequences of group varables can agan be evaluated, yeldng m levels of ntermedate group varable Y l :. It denotes the prefx result of bts (x, x -1,.,x ) at level l. The group varables of the last level m must cover all bts from to 0 (Y m :) and therefore represent the results of the prefx problem [58]. Y Y y 0 : l : = x = Y = Y l 1 : j + 1 m :0 ; l 1 Y j:, = 0,1,..., n 1 j ; l = 1,2,..., m (3.22) Note, that for j=, the group varable Y l-1 : s unchanged (.e., Y l : = Y l-1 :). Snce prefx problems descrbe a combnatonal nput-to output relatonshp, they can be solved by logc networs, whch wll be the major focus n the followng text. Varous seral and parallel algorthms exst for solvng prefx problems, dependng on the bt groupng propertes n equaton They result n very dfferent sze and delay performance measures when mapped onto a logc networ Prefx Algorthms. Two categores of prefx algorthms can be dstngushed; the seral prefx, and the tree-prefx algorthms. Tree-prefx algorthms nclude parallelsm for calculaton speed-up, and therefore form the category of parallel-prefx algorthms.

42 32 Equaton 3.21 represents a seral algorthm for solvng the prefx problem. The seral-prefx algorthm needs a mnmal number of bnary operatons and s nherently slow (O(n)). Accordng to equaton 3.20, all outputs can be computed separately and n parallel. By arrangng the operatons n a tree structure, the computaton tme for each output can be reduced to O(log n). However, the overall number of operatons to be evaluated and wth that the hardware costs grow wth (O(n 2 )) f ndvdual evaluaton trees are used for each output. As a tradeoff, the ndvdual output evaluaton trees can be merged (.e., common sub-expressons be shared) to a certan degree accordng to dfferent tree-prefx algorthms, reducng the area complexty to O(n log n) or even O(n). Bnary addton has been presented as a prefx computaton next. The prefx problem of bnary carry-propagate addton computes the generaton and propagaton of carry sgnals. The ntermedate prefx varables can have three dfferent values.e., generate a carry 0 (or ll a carry 1), generate a carry 1, and propagate the carry-n. The varables are coded by two bts, group generate l G :, and group propagate l P :. The generate/propagate form a sgnal par l l l Y : G:, P : = at level l. The ntal prefx sgnal pars ( o o G P : :, ) correspondng to the bt generate g and bt propagate p sgnals have to be computed from the addton nput operands n a pre processng step. The prefx sgnal pars of level l are then calculated from the sgnals of level l-1 by an arbtrary prefx algorthm usng the bnary operaton [58] l l l 1 l 1 l 1 l 1 ( G, P ) = ( G, P ) ( G, P ) (3.23) : : : j : j j: j:

43 33 = G l 1 l 1 l 1 : j + P : j G j:, P l 1 : j P l 1 j: In the prefx tree, there are n columns, correspondng to the number of nput bts. The gates performng the operaton and whch wor n parallel are arranged n the same row, and smlarly, the same gates connected n seres are placed n consecutve rows. Thus, the number of rows m corresponds to the number of bnary operatons to be evaluated n seres. The outputs of row l are the group varables l l l Y : G:, P : =. The generate/propagate sgnals from the last prefx stage ( m m G P : :, ) are used to compute the carry sgnals c. The sum bts, s are fnally obtaned from a post processng step. The parallel prefx adders therefore can be represented as shown n Fgure 3.11 A n-1:0 B n-1:0 Pre-Preprocessng g n-1:0 p n-1:0 c n-1:0 Prefx Carry Tree Post-processng S n-1:0 Fgure Parallel Prefx Adder Combnng equatons 3.22 and 3.23 yelds the followng generate-propagatebased addton prefx formalsm [58]: g p = a b = a b ; = 0,1,..., n 1 (3.24)

TOPICS MULTIPLIERLESS FILTER DESIGN ELEMENTARY SCHOOL ALGORITHM MULTIPLICATION

TOPICS MULTIPLIERLESS FILTER DESIGN ELEMENTARY SCHOOL ALGORITHM MULTIPLICATION 1 2 MULTIPLIERLESS FILTER DESIGN Realzaton of flters wthout full-fledged multplers Some sldes based on support materal by W. Wolf for hs book Modern VLSI Desgn, 3 rd edton. Partly based on followng papers:

More information

Department of Electrical & Electronic Engineeing Imperial College London. E4.20 Digital IC Design. Median Filter Project Specification

Department of Electrical & Electronic Engineeing Imperial College London. E4.20 Digital IC Design. Median Filter Project Specification Desgn Project Specfcaton Medan Flter Department of Electrcal & Electronc Engneeng Imperal College London E4.20 Dgtal IC Desgn Medan Flter Project Specfcaton A medan flter s used to remove nose from a sampled

More information

Lecture 4: Adders. Computer Systems Laboratory Stanford University

Lecture 4: Adders. Computer Systems Laboratory Stanford University Lecture 4: Adders Computer Systems Laboratory Stanford Unversty horowtz@stanford.edu Copyrght 2004 by Mark Horowtz (w/ Fgures from Hgh-Performance Mcroprocessor Desgn IEEE And Fgures from Bora Nkolc 1

More information

Clock-Gating and Its Application to Low Power Design of Sequential Circuits

Clock-Gating and Its Application to Low Power Design of Sequential Circuits Clock-Gatng and Its Applcaton to Low Power Desgn of Sequental Crcuts ng WU Department of Electrcal Engneerng-Systems, Unversty of Southern Calforna Los Angeles, CA 989, USA, Phone: (23)74-448 Massoud PEDRAM

More information

A New Design of Multiplier using Modified Booth Algorithm and Reversible Gate Logic

A New Design of Multiplier using Modified Booth Algorithm and Reversible Gate Logic Internatonal Journal of Computer Applcatons Technology and Research A New Desgn of Multpler usng Modfed Booth Algorthm and Reversble Gate Logc K.Nagarjun Department of ECE Vardhaman College of Engneerng,

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Logical Effort of Higher Valency Adders

Logical Effort of Higher Valency Adders Logcal Effort of gher Valency Adders Davd arrs arvey Mudd College E. Twelfth St. Claremont, CA Davd_arrs@hmc.edu Abstract gher valency parallel prefx adders reduce the number of logc levels at the expense

More information

Chapter - 2. Distribution System Power Flow Analysis

Chapter - 2. Distribution System Power Flow Analysis Chapter - 2 Dstrbuton System Power Flow Analyss CHAPTER - 2 Radal Dstrbuton System Load Flow 2.1 Introducton Load flow s an mportant tool [66] for analyzng electrcal power system network performance. Load

More information

Note 10. Modeling and Simulation of Dynamic Systems

Note 10. Modeling and Simulation of Dynamic Systems Lecture Notes of ME 475: Introducton to Mechatroncs Note 0 Modelng and Smulaton of Dynamc Systems Department of Mechancal Engneerng, Unversty Of Saskatchewan, 57 Campus Drve, Saskatoon, SK S7N 5A9, Canada

More information

Speeding up Computation of Scalar Multiplication in Elliptic Curve Cryptosystem

Speeding up Computation of Scalar Multiplication in Elliptic Curve Cryptosystem H.K. Pathak et. al. / (IJCSE) Internatonal Journal on Computer Scence and Engneerng Speedng up Computaton of Scalar Multplcaton n Ellptc Curve Cryptosystem H. K. Pathak Manju Sangh S.o.S n Computer scence

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

A Novel, Low-Power Array Multiplier Architecture

A Novel, Low-Power Array Multiplier Architecture A Noel, Low-Power Array Multpler Archtecture by Ronak Bajaj, Saransh Chhabra, Sreehar Veeramachanen, MB Srnas n 9th Internatonal Symposum on Communcaton and Informaton Technology 29 (ISCIT 29) Songdo -

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

Grover s Algorithm + Quantum Zeno Effect + Vaidman

Grover s Algorithm + Quantum Zeno Effect + Vaidman Grover s Algorthm + Quantum Zeno Effect + Vadman CS 294-2 Bomb 10/12/04 Fall 2004 Lecture 11 Grover s algorthm Recall that Grover s algorthm for searchng over a space of sze wors as follows: consder the

More information

( ) = ( ) + ( 0) ) ( )

( ) = ( ) + ( 0) ) ( ) EETOMAGNETI OMPATIBIITY HANDBOOK 1 hapter 9: Transent Behavor n the Tme Doman 9.1 Desgn a crcut usng reasonable values for the components that s capable of provdng a tme delay of 100 ms to a dgtal sgnal.

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

Continued..& Multiplier

Continued..& Multiplier CS222: Computer Arthmetc : Adder Contnued..& Multpler Dr. A. Sahu Dept of Comp. Sc. & Engg. Indan Insttute of Technology Guwahat 1 Outlne Adder Unversal Use (N bt addton) RppleCarry Adder, Full Adder,

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

High Performance Rotation Architectures Based on the Radix-4 CORDIC Algorithm

High Performance Rotation Architectures Based on the Radix-4 CORDIC Algorithm IEEE TRANSACTIONS ON COMPUTERS, VOL. 46, NO. 8, AUGUST 997 855 Hgh Performance Rotaton Archtectures Based on the Radx-4 CORDIC Algorthm Elsardo Antelo, Julo Vllalba, Javer D. Bruguera, and Emlo L. Zapata

More information

Problem Set 9 Solutions

Problem Set 9 Solutions Desgn and Analyss of Algorthms May 4, 2015 Massachusetts Insttute of Technology 6.046J/18.410J Profs. Erk Demane, Srn Devadas, and Nancy Lynch Problem Set 9 Solutons Problem Set 9 Solutons Ths problem

More information

VQ widely used in coding speech, image, and video

VQ widely used in coding speech, image, and video at Scalar quantzers are specal cases of vector quantzers (VQ): they are constraned to look at one sample at a tme (memoryless) VQ does not have such constrant better RD perfomance expected Source codng

More information

College of Computer & Information Science Fall 2009 Northeastern University 20 October 2009

College of Computer & Information Science Fall 2009 Northeastern University 20 October 2009 College of Computer & Informaton Scence Fall 2009 Northeastern Unversty 20 October 2009 CS7880: Algorthmc Power Tools Scrbe: Jan Wen and Laura Poplawsk Lecture Outlne: Prmal-dual schema Network Desgn:

More information

Annexes. EC.1. Cycle-base move illustration. EC.2. Problem Instances

Annexes. EC.1. Cycle-base move illustration. EC.2. Problem Instances ec Annexes Ths Annex frst llustrates a cycle-based move n the dynamc-block generaton tabu search. It then dsplays the characterstcs of the nstance sets, followed by detaled results of the parametercalbraton

More information

Foundations of Arithmetic

Foundations of Arithmetic Foundatons of Arthmetc Notaton We shall denote the sum and product of numbers n the usual notaton as a 2 + a 2 + a 3 + + a = a, a 1 a 2 a 3 a = a The notaton a b means a dvdes b,.e. ac = b where c s an

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

Numerical Heat and Mass Transfer

Numerical Heat and Mass Transfer Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and

More information

A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS

A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS HCMC Unversty of Pedagogy Thong Nguyen Huu et al. A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS Thong Nguyen Huu and Hao Tran Van Department of mathematcs-nformaton,

More information

On balancing multiple video streams with distributed QoS control in mobile communications

On balancing multiple video streams with distributed QoS control in mobile communications On balancng multple vdeo streams wth dstrbuted QoS control n moble communcatons Arjen van der Schaaf, José Angel Lso Arellano, and R. (Inald) L. Lagendjk TU Delft, Mekelweg 4, 68 CD Delft, The Netherlands

More information

Prof. Dr. I. Nasser Phys 630, T Aug-15 One_dimensional_Ising_Model

Prof. Dr. I. Nasser Phys 630, T Aug-15 One_dimensional_Ising_Model EXACT OE-DIMESIOAL ISIG MODEL The one-dmensonal Isng model conssts of a chan of spns, each spn nteractng only wth ts two nearest neghbors. The smple Isng problem n one dmenson can be solved drectly n several

More information

Variability-Driven Module Selection with Joint Design Time Optimization and Post-Silicon Tuning

Variability-Driven Module Selection with Joint Design Time Optimization and Post-Silicon Tuning Asa and South Pacfc Desgn Automaton Conference 2008 Varablty-Drven Module Selecton wth Jont Desgn Tme Optmzaton and Post-Slcon Tunng Feng Wang, Xaoxa Wu, Yuan Xe The Pennsylvana State Unversty Department

More information

Queueing Networks II Network Performance

Queueing Networks II Network Performance Queueng Networks II Network Performance Davd Tpper Assocate Professor Graduate Telecommuncatons and Networkng Program Unversty of Pttsburgh Sldes 6 Networks of Queues Many communcaton systems must be modeled

More information

EEE 241: Linear Systems

EEE 241: Linear Systems EEE : Lnear Systems Summary #: Backpropagaton BACKPROPAGATION The perceptron rule as well as the Wdrow Hoff learnng were desgned to tran sngle layer networks. They suffer from the same dsadvantage: they

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

Suppose that there s a measured wndow of data fff k () ; :::; ff k g of a sze w, measured dscretely wth varable dscretzaton step. It s convenent to pl

Suppose that there s a measured wndow of data fff k () ; :::; ff k g of a sze w, measured dscretely wth varable dscretzaton step. It s convenent to pl RECURSIVE SPLINE INTERPOLATION METHOD FOR REAL TIME ENGINE CONTROL APPLICATIONS A. Stotsky Volvo Car Corporaton Engne Desgn and Development Dept. 97542, HA1N, SE- 405 31 Gothenburg Sweden. Emal: astotsky@volvocars.com

More information

A Hybrid Variational Iteration Method for Blasius Equation

A Hybrid Variational Iteration Method for Blasius Equation Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 10, Issue 1 (June 2015), pp. 223-229 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) A Hybrd Varatonal Iteraton Method

More information

Winter 2008 CS567 Stochastic Linear/Integer Programming Guest Lecturer: Xu, Huan

Winter 2008 CS567 Stochastic Linear/Integer Programming Guest Lecturer: Xu, Huan Wnter 2008 CS567 Stochastc Lnear/Integer Programmng Guest Lecturer: Xu, Huan Class 2: More Modelng Examples 1 Capacty Expanson Capacty expanson models optmal choces of the tmng and levels of nvestments

More information

Formulas for the Determinant

Formulas for the Determinant page 224 224 CHAPTER 3 Determnants e t te t e 2t 38 A = e t 2te t e 2t e t te t 2e 2t 39 If 123 A = 345, 456 compute the matrx product A adj(a) What can you conclude about det(a)? For Problems 40 43, use

More information

Estimating Delays. Gate Delay Model. Gate Delay. Effort Delay. Computing Logical Effort. Logical Effort

Estimating Delays. Gate Delay Model. Gate Delay. Effort Delay. Computing Logical Effort. Logical Effort Estmatng Delas Would be nce to have a back of the envelope method for szng gates for speed Logcal Effort ook b Sutherland, Sproull, Harrs Chapter s on our web page Gate Dela Model Frst, normalze a model

More information

NP-Completeness : Proofs

NP-Completeness : Proofs NP-Completeness : Proofs Proof Methods A method to show a decson problem Π NP-complete s as follows. (1) Show Π NP. (2) Choose an NP-complete problem Π. (3) Show Π Π. A method to show an optmzaton problem

More information

Appendix B: Resampling Algorithms

Appendix B: Resampling Algorithms 407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles

More information

CSE4210 Architecture and Hardware for DSP

CSE4210 Architecture and Hardware for DSP 4210 Archtecture and Hardware for DSP Lecture 1 Introducton & Number systems Admnstratve Stuff 4210 Archtecture and Hardware for DSP Text: VLSI Dgtal Sgnal Processng Systems: Desgn and Implementaton. K.

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

18.1 Introduction and Recap

18.1 Introduction and Recap CS787: Advanced Algorthms Scrbe: Pryananda Shenoy and Shjn Kong Lecturer: Shuch Chawla Topc: Streamng Algorthmscontnued) Date: 0/26/2007 We contnue talng about streamng algorthms n ths lecture, ncludng

More information

Application of Nonbinary LDPC Codes for Communication over Fading Channels Using Higher Order Modulations

Application of Nonbinary LDPC Codes for Communication over Fading Channels Using Higher Order Modulations Applcaton of Nonbnary LDPC Codes for Communcaton over Fadng Channels Usng Hgher Order Modulatons Rong-Hu Peng and Rong-Rong Chen Department of Electrcal and Computer Engneerng Unversty of Utah Ths work

More information

Fundamental loop-current method using virtual voltage sources technique for special cases

Fundamental loop-current method using virtual voltage sources technique for special cases Fundamental loop-current method usng vrtual voltage sources technque for specal cases George E. Chatzaraks, 1 Marna D. Tortorel 1 and Anastasos D. Tzolas 1 Electrcal and Electroncs Engneerng Departments,

More information

Module 9. Lecture 6. Duality in Assignment Problems

Module 9. Lecture 6. Duality in Assignment Problems Module 9 1 Lecture 6 Dualty n Assgnment Problems In ths lecture we attempt to answer few other mportant questons posed n earler lecture for (AP) and see how some of them can be explaned through the concept

More information

Real-Time Systems. Multiprocessor scheduling. Multiprocessor scheduling. Multiprocessor scheduling

Real-Time Systems. Multiprocessor scheduling. Multiprocessor scheduling. Multiprocessor scheduling Real-Tme Systems Multprocessor schedulng Specfcaton Implementaton Verfcaton Multprocessor schedulng -- -- Global schedulng How are tasks assgned to processors? Statc assgnment The processor(s) used for

More information

Inductance Calculation for Conductors of Arbitrary Shape

Inductance Calculation for Conductors of Arbitrary Shape CRYO/02/028 Aprl 5, 2002 Inductance Calculaton for Conductors of Arbtrary Shape L. Bottura Dstrbuton: Internal Summary In ths note we descrbe a method for the numercal calculaton of nductances among conductors

More information

Assortment Optimization under MNL

Assortment Optimization under MNL Assortment Optmzaton under MNL Haotan Song Aprl 30, 2017 1 Introducton The assortment optmzaton problem ams to fnd the revenue-maxmzng assortment of products to offer when the prces of products are fxed.

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Measurement Uncertainties Reference

Measurement Uncertainties Reference Measurement Uncertantes Reference Introducton We all ntutvely now that no epermental measurement can be perfect. It s possble to mae ths dea quanttatve. It can be stated ths way: the result of an ndvdual

More information

Interconnect Optimization for Deep-Submicron and Giga-Hertz ICs

Interconnect Optimization for Deep-Submicron and Giga-Hertz ICs Interconnect Optmzaton for Deep-Submcron and Gga-Hertz ICs Le He http://cadlab.cs.ucla.edu/~hele UCLA Computer Scence Department Los Angeles, CA 90095 Outlne Background and overvew LR-based STIS optmzaton

More information

Outline. Communication. Bellman Ford Algorithm. Bellman Ford Example. Bellman Ford Shortest Path [1]

Outline. Communication. Bellman Ford Algorithm. Bellman Ford Example. Bellman Ford Shortest Path [1] DYNAMIC SHORTEST PATH SEARCH AND SYNCHRONIZED TASK SWITCHING Jay Wagenpfel, Adran Trachte 2 Outlne Shortest Communcaton Path Searchng Bellmann Ford algorthm Algorthm for dynamc case Modfcatons to our algorthm

More information

Power Efficient Design and Implementation of a Novel Constant Correction Truncated Multiplier

Power Efficient Design and Implementation of a Novel Constant Correction Truncated Multiplier APSIPA ASC 11 X an Power Effcent Desgn and Implementaton of a Novel Constant Correcton Truncated Multpler Yu Ren, Dong Wang, Lebo Lu, Shouy Yn and Shaojun We Tsnghua Unversty, Bejng E-mal: reneereny@gmal.com

More information

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography CSc 6974 and ECSE 6966 Math. Tech. for Vson, Graphcs and Robotcs Lecture 21, Aprl 17, 2006 Estmatng A Plane Homography Overvew We contnue wth a dscusson of the major ssues, usng estmaton of plane projectve

More information

Second Order Analysis

Second Order Analysis Second Order Analyss In the prevous classes we looked at a method that determnes the load correspondng to a state of bfurcaton equlbrum of a perfect frame by egenvalye analyss The system was assumed to

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud Resource Allocaton wth a Budget Constrant for Computng Independent Tasks n the Cloud Wemng Sh and Bo Hong School of Electrcal and Computer Engneerng Georga Insttute of Technology, USA 2nd IEEE Internatonal

More information

An Interactive Optimisation Tool for Allocation Problems

An Interactive Optimisation Tool for Allocation Problems An Interactve Optmsaton ool for Allocaton Problems Fredr Bonäs, Joam Westerlund and apo Westerlund Process Desgn Laboratory, Faculty of echnology, Åbo Aadem Unversty, uru 20500, Fnland hs paper presents

More information

Adders. Today: Adders. EE M216A.:. Fall Prof. Dejan Marković Lecture 9. Basic terminology. Adder building blocks

Adders. Today: Adders. EE M216A.:. Fall Prof. Dejan Marković Lecture 9. Basic terminology. Adder building blocks EE M216.:. Fall 21 Lecture 9 dders Prof. Dejan Markovć ee216a@gmal.com Today: dders asc termnology dder buldng blocks asc adder topologes Lnear adders (n < 16) Fast parallel adders (n > 16) Some examples

More information

Energy Storage Elements: Capacitors and Inductors

Energy Storage Elements: Capacitors and Inductors CHAPTER 6 Energy Storage Elements: Capactors and Inductors To ths pont n our study of electronc crcuts, tme has not been mportant. The analyss and desgns we hae performed so far hae been statc, and all

More information

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system Transfer Functons Convenent representaton of a lnear, dynamc model. A transfer functon (TF) relates one nput and one output: x t X s y t system Y s The followng termnology s used: x y nput output forcng

More information

The optimal delay of the second test is therefore approximately 210 hours earlier than =2.

The optimal delay of the second test is therefore approximately 210 hours earlier than =2. THE IEC 61508 FORMULAS 223 The optmal delay of the second test s therefore approxmately 210 hours earler than =2. 8.4 The IEC 61508 Formulas IEC 61508-6 provdes approxmaton formulas for the PF for smple

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results.

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results. Neural Networks : Dervaton compled by Alvn Wan from Professor Jtendra Malk s lecture Ths type of computaton s called deep learnng and s the most popular method for many problems, such as computer vson

More information

Time-Varying Systems and Computations Lecture 6

Time-Varying Systems and Computations Lecture 6 Tme-Varyng Systems and Computatons Lecture 6 Klaus Depold 14. Januar 2014 The Kalman Flter The Kalman estmaton flter attempts to estmate the actual state of an unknown dscrete dynamcal system, gven nosy

More information

Digital Signal Processing

Digital Signal Processing Dgtal Sgnal Processng Dscrete-tme System Analyss Manar Mohasen Offce: F8 Emal: manar.subh@ut.ac.r School of IT Engneerng Revew of Precedent Class Contnuous Sgnal The value of the sgnal s avalable over

More information

The Minimum Universal Cost Flow in an Infeasible Flow Network

The Minimum Universal Cost Flow in an Infeasible Flow Network Journal of Scences, Islamc Republc of Iran 17(2): 175-180 (2006) Unversty of Tehran, ISSN 1016-1104 http://jscencesutacr The Mnmum Unversal Cost Flow n an Infeasble Flow Network H Saleh Fathabad * M Bagheran

More information

Some Comments on Accelerating Convergence of Iterative Sequences Using Direct Inversion of the Iterative Subspace (DIIS)

Some Comments on Accelerating Convergence of Iterative Sequences Using Direct Inversion of the Iterative Subspace (DIIS) Some Comments on Acceleratng Convergence of Iteratve Sequences Usng Drect Inverson of the Iteratve Subspace (DIIS) C. Davd Sherrll School of Chemstry and Bochemstry Georga Insttute of Technology May 1998

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Section 8.3 Polar Form of Complex Numbers

Section 8.3 Polar Form of Complex Numbers 80 Chapter 8 Secton 8 Polar Form of Complex Numbers From prevous classes, you may have encountered magnary numbers the square roots of negatve numbers and, more generally, complex numbers whch are the

More information

Economics 101. Lecture 4 - Equilibrium and Efficiency

Economics 101. Lecture 4 - Equilibrium and Efficiency Economcs 0 Lecture 4 - Equlbrum and Effcency Intro As dscussed n the prevous lecture, we wll now move from an envronment where we looed at consumers mang decsons n solaton to analyzng economes full of

More information

A FAST HEURISTIC FOR TASKS ASSIGNMENT IN MANYCORE SYSTEMS WITH VOLTAGE-FREQUENCY ISLANDS

A FAST HEURISTIC FOR TASKS ASSIGNMENT IN MANYCORE SYSTEMS WITH VOLTAGE-FREQUENCY ISLANDS Shervn Haamn A FAST HEURISTIC FOR TASKS ASSIGNMENT IN MANYCORE SYSTEMS WITH VOLTAGE-FREQUENCY ISLANDS INTRODUCTION Increasng computatons n applcatons has led to faster processng. o Use more cores n a chp

More information

Simultaneous Optimization of Berth Allocation, Quay Crane Assignment and Quay Crane Scheduling Problems in Container Terminals

Simultaneous Optimization of Berth Allocation, Quay Crane Assignment and Quay Crane Scheduling Problems in Container Terminals Smultaneous Optmzaton of Berth Allocaton, Quay Crane Assgnment and Quay Crane Schedulng Problems n Contaner Termnals Necat Aras, Yavuz Türkoğulları, Z. Caner Taşkın, Kuban Altınel Abstract In ths work,

More information

Design and Optimization of Fuzzy Controller for Inverse Pendulum System Using Genetic Algorithm

Design and Optimization of Fuzzy Controller for Inverse Pendulum System Using Genetic Algorithm Desgn and Optmzaton of Fuzzy Controller for Inverse Pendulum System Usng Genetc Algorthm H. Mehraban A. Ashoor Unversty of Tehran Unversty of Tehran h.mehraban@ece.ut.ac.r a.ashoor@ece.ut.ac.r Abstract:

More information

Pulse Coded Modulation

Pulse Coded Modulation Pulse Coded Modulaton PCM (Pulse Coded Modulaton) s a voce codng technque defned by the ITU-T G.711 standard and t s used n dgtal telephony to encode the voce sgnal. The frst step n the analog to dgtal

More information

Analysis of Queuing Delay in Multimedia Gateway Call Routing

Analysis of Queuing Delay in Multimedia Gateway Call Routing Analyss of Queung Delay n Multmeda ateway Call Routng Qwe Huang UTtarcom Inc, 33 Wood Ave. outh Iseln, NJ 08830, U..A Errol Lloyd Computer Informaton cences Department, Unv. of Delaware, Newark, DE 976,

More information

THE SUMMATION NOTATION Ʃ

THE SUMMATION NOTATION Ʃ Sngle Subscrpt otaton THE SUMMATIO OTATIO Ʃ Most of the calculatons we perform n statstcs are repettve operatons on lsts of numbers. For example, we compute the sum of a set of numbers, or the sum of the

More information

NON LINEAR ANALYSIS OF STRUCTURES ACCORDING TO NEW EUROPEAN DESIGN CODE

NON LINEAR ANALYSIS OF STRUCTURES ACCORDING TO NEW EUROPEAN DESIGN CODE October 1-17, 008, Bejng, Chna NON LINEAR ANALYSIS OF SRUCURES ACCORDING O NEW EUROPEAN DESIGN CODE D. Mestrovc 1, D. Czmar and M. Pende 3 1 Professor, Dept. of Structural Engneerng, Faculty of Cvl Engneerng,

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

CONTRAST ENHANCEMENT FOR MIMIMUM MEAN BRIGHTNESS ERROR FROM HISTOGRAM PARTITIONING INTRODUCTION

CONTRAST ENHANCEMENT FOR MIMIMUM MEAN BRIGHTNESS ERROR FROM HISTOGRAM PARTITIONING INTRODUCTION CONTRAST ENHANCEMENT FOR MIMIMUM MEAN BRIGHTNESS ERROR FROM HISTOGRAM PARTITIONING N. Phanthuna 1,2, F. Cheevasuvt 2 and S. Chtwong 2 1 Department of Electrcal Engneerng, Faculty of Engneerng Rajamangala

More information

CHAPTER 4 SPEECH ENHANCEMENT USING MULTI-BAND WIENER FILTER. In real environmental conditions the speech signal may be

CHAPTER 4 SPEECH ENHANCEMENT USING MULTI-BAND WIENER FILTER. In real environmental conditions the speech signal may be 55 CHAPTER 4 SPEECH ENHANCEMENT USING MULTI-BAND WIENER FILTER 4.1 Introducton In real envronmental condtons the speech sgnal may be supermposed by the envronmental nterference. In general, the spectrum

More information

Supplementary Notes for Chapter 9 Mixture Thermodynamics

Supplementary Notes for Chapter 9 Mixture Thermodynamics Supplementary Notes for Chapter 9 Mxture Thermodynamcs Key ponts Nne major topcs of Chapter 9 are revewed below: 1. Notaton and operatonal equatons for mxtures 2. PVTN EOSs for mxtures 3. General effects

More information

Implementation of Parallel Multiplier Accumulator based on Radix-2 Modified Booth Algorithm Shashi Prabha Singh 1 Uma Sharma 2

Implementation of Parallel Multiplier Accumulator based on Radix-2 Modified Booth Algorithm Shashi Prabha Singh 1 Uma Sharma 2 IJSRD - Internatonal Journal for Scentfc Research & Development Vol. 3, Issue 05, 2015 ISSN (onlne): 2321-0613 Implementaton of Parallel Multpler Accumulator based on Radx-2 Modfed Booth Algorthm Shash

More information

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,

More information

On the correction of the h-index for career length

On the correction of the h-index for career length 1 On the correcton of the h-ndex for career length by L. Egghe Unverstet Hasselt (UHasselt), Campus Depenbeek, Agoralaan, B-3590 Depenbeek, Belgum 1 and Unverstet Antwerpen (UA), IBW, Stadscampus, Venusstraat

More information

Errors for Linear Systems

Errors for Linear Systems Errors for Lnear Systems When we solve a lnear system Ax b we often do not know A and b exactly, but have only approxmatons  and ˆb avalable. Then the best thng we can do s to solve ˆx ˆb exactly whch

More information

A NOTE ON CES FUNCTIONS Drago Bergholt, BI Norwegian Business School 2011

A NOTE ON CES FUNCTIONS Drago Bergholt, BI Norwegian Business School 2011 A NOTE ON CES FUNCTIONS Drago Bergholt, BI Norwegan Busness School 2011 Functons featurng constant elastcty of substtuton CES are wdely used n appled economcs and fnance. In ths note, I do two thngs. Frst,

More information

Solving Nonlinear Differential Equations by a Neural Network Method

Solving Nonlinear Differential Equations by a Neural Network Method Solvng Nonlnear Dfferental Equatons by a Neural Network Method Luce P. Aarts and Peter Van der Veer Delft Unversty of Technology, Faculty of Cvlengneerng and Geoscences, Secton of Cvlengneerng Informatcs,

More information

Lecture 14: Forces and Stresses

Lecture 14: Forces and Stresses The Nuts and Bolts of Frst-Prncples Smulaton Lecture 14: Forces and Stresses Durham, 6th-13th December 2001 CASTEP Developers Group wth support from the ESF ψ k Network Overvew of Lecture Why bother? Theoretcal

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

The KMO Method for Solving Non-homogenous, m th Order Differential Equations

The KMO Method for Solving Non-homogenous, m th Order Differential Equations The KMO Method for Solvng Non-homogenous, m th Order Dfferental Equatons Davd Krohn Danel Marño-Johnson John Paul Ouyang March 14, 2013 Abstract Ths paper shows a smple tabular procedure for fndng the

More information

Coarse-Grain MTCMOS Sleep

Coarse-Grain MTCMOS Sleep Coarse-Gran MTCMOS Sleep Transstor Szng Usng Delay Budgetng Ehsan Pakbazna and Massoud Pedram Unversty of Southern Calforna Dept. of Electrcal Engneerng DATE-08 Munch, Germany Leakage n CMOS Technology

More information

Module #6: Combinational Logic Design with VHDL Part 2 (Arithmetic)

Module #6: Combinational Logic Design with VHDL Part 2 (Arithmetic) ECOM4311 Dgtal Systems Desgn : Combnatonal Logc Desgn wth VHDL Part 2 (Arthmetc) - A crcut that compares dgtal values (.e., Equal, Greater Than, Less Than) Agenda 1. 2. Adders (Rpple Carry, Carry-Look-Ahead)

More information

A linear imaging system with white additive Gaussian noise on the observed data is modeled as follows:

A linear imaging system with white additive Gaussian noise on the observed data is modeled as follows: Supplementary Note Mathematcal bacground A lnear magng system wth whte addtve Gaussan nose on the observed data s modeled as follows: X = R ϕ V + G, () where X R are the expermental, two-dmensonal proecton

More information

Implementation and Study of Reversible Binary Comparators

Implementation and Study of Reversible Binary Comparators Implementaton and Study of Reversble Bnary Comparators Harth M, C Vasanthanayak Abstract Reversble logc contans a feature of recoverng bt loss from unque nput-output mappng. Usng tradtonal (rreversble)

More information

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law: CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and

More information

EEL 6266 Power System Operation and Control. Chapter 3 Economic Dispatch Using Dynamic Programming

EEL 6266 Power System Operation and Control. Chapter 3 Economic Dispatch Using Dynamic Programming EEL 6266 Power System Operaton and Control Chapter 3 Economc Dspatch Usng Dynamc Programmng Pecewse Lnear Cost Functons Common practce many utltes prefer to represent ther generator cost functons as sngle-

More information