Topics: A multiple cycle implementation. Distributed Notes
|
|
- Angelica Baldwin
- 6 years ago
- Views:
Transcription
1 COSC 22: Compter Organization Instrctor: Dr. Amir Asif Department of Compter Science York University Handot # lticycle Implementation of a IPS Processor Topics: A mltiple cycle implementation Distribted Notes
2 Why lticycle? Eample: Assme that the operation times for major fnctional nit in a microprocessor are: emory nit ~ 2ns, ALU and adders ~ 2ns, Register file ~ ns Compare the performance of the following instrction mi Loads: 24%; Stores: 2%; ALU instrctions: 44%; Branches: 8%; Jmps: 2% on the two implementations Implementation I: Each instrction operates in clock cycle Implementation II: Each instrction is as long as it needs to be. Instrction Class Fnctional nits sed (Steps involved) ALU type Instrction fetch Register Access ALU Register Access 6ns Load word Instrction fetch Register Access ALU emory Access Register Access 8ns Store word Instrction fetch Register Access ALU emory Access 7ns Branch Instrction fetch Register Access ALU 5ns Branch Instrction fetch 2ns Average time per instrction: Implementation : ~ 8ns Implementation 2: ~.24(8)+.2(7)+.44(6)+.8(5)+.2(2) = 6.34ns 2
3 lticycle Implementation Instrction: Eection of each instrction is broken into different steps Each step reqires clock cycle Each instrction takes mltiple clock cycles Fnctional Unit: Can be sed more than once in an instrction (bt still only once in a clock cycle) Advantages: Fnctional nits can be shared ALU and adder is combined Single memory is sed for instrctions and data 3
4 lticycle Implementation: Abstract Diagram P C A d d r e s s e m o r y D a t a o r e m o r y D a t a R e g i s t e r # R e g i s t e r s R e g i s t e r # R e g i s t e r # A B A L U A L U O t One ALU is sed for incrementing PC and for arithmetic operations Data memory and Instrction memory are combined 5 additional registers are added. An instrction register (IR) to hold instrctions before distribting data to register file or ALU 2. A memory data register (DR) to hold data before distribting to register file or ALU 3. Regsiters A and B that hold data before the ALU 4. Register ALUot that hold data compted by ALU 4
5 lticycle Implementation: ltipleers added P C A d d r e s s e m o r y e m D a t a [ ] [ 2 6 ] [ 5 ] [ 5 ] e m o r y [ 5 ] 6 2 R e g i s t e r s S i g n e t e n d S h i f t l e f t 2 A B Z e r o A L U A L U r e s l t A L U O t Becase fnctional nits are shared, mltipleers are added to select data between different devices. UX before memory selects either the PC otpt (fetch instrction) or ALU otpt (storing data) 2. UX before write register selects write-register nmber (instrction [5-] or instrction[2-6]) 3. UX before write data selects data from ALUOt (R-type instrction) or emdata (lw instrction) 4. Upper UX before ALU selects PC otpt (increment PC) or Read data (R-type instrction) 5. Lower UX before ALU selects Read data 2, or sign etended instrction[5-] or shift left sign etended instrction[5-], or 4 5
6 lticycle Implementation: Controls added I o r D e m e m I R R e g D s t R e g A L U S r c A P C A d d r e s s e m o r y e m D a t a [ ] [ 2 6 ] [ 5 ] [ 5 ] e m o r y [ 5 ] 6 2 R e g i s t e r s S i g n e t e n d S h i f t l e f t 2 A B A L U c o n t r o l Z e r o A L U A L U r e s l t A L U O t [ 5 ] e m t o R e g A L U S r c B A L U O p Becase fnctional nits are shared, mltipleers are added to select data between different devices. UX before memory selects either the PC otpt (fetch instrction) or ALU otpt (storing data) 2. UX before write register selects write-register nmber (instrction [5-] or instrction[2-6]) 6
7 lticycle Implementation: Control Units added P C A d d r e s s e m o r y e m D a t a [ ] [ 2 6 ] [ 5 ] [ 5 ] e m o r y P C C o n d P C I o r D e m e m e m t o R e g I R O t p t s C o n t r o l O p [ 5 ] [ 5 ] P C S o r c e A L U O p A L U S r c B 6 A L U S r c A R e g R e g D s t [ 25 ] S h i f t l e f t 2 [ 3-26 ] P C [ 3-28 ] 2 R e g i s t e r s 2 S i g n 32 e t e n d S h i f t l e f t 2 A B A L U c o n t r o l Z e r o A L U A L U r e s l t J m p a d d r e s s [ 3 - ] A L U O t 2 [ 5 ] 7
8 Action of -bit Control Signals Control Inpt Effect when Deasserted () Effect when asserted () IorD PC spplies address to memory (instrction fetch) ALUot spplies address to memory (lw/sw) emread None emory content specified by address is placed on emdata o/p (lw/any instrction) emwrite None I/p Write data is stored at specified address (sw) IRWrite None emdata o/p is written on IR (instrction fetch) RegDst Write Register specified by Instrction[2-6] (lw) WriteRegister specified by Instrction[5-] (R-type) RegWrite None Data from WriteData i/p is written on the register specified by WriteRegister nmber ALUSrcA PC is the first operand in ALU (increment PC) Register A is the first operand in ALU emtoreg WriteData of the register file comes from ALUOt WriteData of the register file comes from DR PCWrite PCWriteCond Operation at PC depends on PCWriteCond and zero otpt of ALU Operation at PC depends on PCWrite PC is written; Sorce is determined by PCSorce PC is written if zero o/p of ALU = ; Sorce is determined by PCSorce 8
9 Action of 2-bit Control Signals Control Inpt Vale Effect ALU performs an add operation ALUOp ALUSrcB PCSorce ALU performs a sbtract operation The fnction field of Instrction defines the operation of ALU The second operand of ALU comes from Register B The second operand of ALU = 4 The second operand of ALU is sign etended Instrction[5-] The second operand of ALU is sign etended, 2-bit left shifted Instrction[5-] Otpt of ALU (PC + 4) is sent to PC Contents of ALUOt (branch target address = PC offset) is sent to PC Contents of Instrction[25-], shift left by 2, and concatenated with the SB 4-bits of PC is sent to PC (jmp instrction) 9
10 Breaking the Instrction Eection into Clock Cycles Eection of each instrction is broken into a series of steps Each step is balanced to do almost eqal amont of work Each step takes one clock cycle Each step contains at the most ALU operation, or register file access, or memory access Operations listed in step occrs in parallel in clock cycle Different steps occr in different clock cycles Different steps are:. Instrction fetch step 2. Instrction decode and register fetch step 3. Eection, memory address comptation, or branch completion step 4. emory access of R-type instrction completion step 5. emory read completion step
11 lticycle Implementation: Control Units added P C A d d r e s s e m o r y e m D a t a [ ] [ 2 6 ] [ 5 ] [ 5 ] e m o r y P C C o n d P C I o r D e m e m e m t o R e g I R O t p t s C o n t r o l O p [ 5 ] [ 5 ] P C S o r c e A L U O p A L U S r c B 6 A L U S r c A R e g R e g D s t [ 25 ] S h i f t l e f t 2 [ 3-26 ] P C [ 3-28 ] 2 R e g i s t e r s 2 S i g n 32 e t e n d S h i f t l e f t 2 A B A L U c o n t r o l Z e r o A L U A L U r e s l t J m p a d d r e s s [ 3 - ] A L U O t 2 [ 5 ]
12 Step : Instrction Fetch Fetch instrction from memory and compte the address of net seqential instrction IR = emory[pc]; PC = PC + 4; Operation:. Send PC to the memory as address (IorD = ) 2. Read memory cell defined by PC (emread = ) 3. Copy otpt of memory (edata) into IR (IRwrite = ) 4. Increment PC by 4 (ALUSrcA =, ALUSrcB =, PCSrc = ) 5. Store (PC + 4) into PC (PCWrite = ) 2
13 Step 2: Instrction Decode and Register Fetch Read register rs in register file and store content of rs in register A Read rt in register file and store content of rt from register file Compte branch target address A = Reg[IR[25-2]]; B = Reg[IR[2-6]]; ALUOt = PC + (sign-etend(ir[5-]) << 2); Operation:. Access register file to write rs in A. 2. Access register file to write rt in B. 3. Compte branch target address and store in ALUOt (ALUSrcA = ; ALUSrcB = ) Remember that ALU mst add (ALUOp = ) After this step, one of the for actions are possible: emory reference (lw/sw), R-type, Branch, or Jmp 3
14 Step 3: Eection, emory address Comptation, or Branch Completion emory Reference (sw/lw): ALUOt = A + sign-etend(ir[5-]) ALU adds content of A and sign-etend(ir[5-]) (ALUSrcA =, ALUSrcB = ), (ALUOp = ) R-type (add/sb/or/and): ALUOt = A op B ALU performs specified operation on A and B (ALUSrcA =, ALUSrcB = ), Operation of ALU is determined by the fnction field code (ALUOp = ) Branch (beq): if (A == B) PC = ALUOt; ALU does the eqal comparison operation on A and B (ALUSrcA =, ALUSrcB = ), ALU mst sbtract (ALUOp = ) Update PC with ALUOt if A == B (PCWriteCond =, PCSorce = ). Complete. Jmp (j): PC = PC[3-28] (IR[25-) << 2); PC gets overwritten by otpt of jmp address UX (PCSorce =, PCWrite = ). Complete. 4
15 Step 4: emory Access or R-type Instrction Completion emory Reference (sw/lw): DR = emory[aluot]; (for lw) or emory[aluot] = B; (for sw). Address from ALUOt is applied at address i/p of memory (IorD = ) 2. For sw, emwrite =. For lw, emread =. sw is complete. R-type Instrction (add/sb/or/and): Reg[IR[5-]] = ALUOt; ALUOt is stored into the register specified by IR[5-] (emtoreg =, RegWrite = ). Complete. 5
16 Step 5: emory Read Completion load (lw): Reg[IR[2-6]] = DR; DR is stored into the register specified by IR[2-6] (emtoreg =, RegWrite =, RegDst = ) 6
17 Smmary of Steps sed in different Instrctions Step Name Instrction fetch Instrction decode / Register fetch Action for R-type Instrction emory Reference Instrction Branch Jmp IR = emory[pc]; PC = PC + 4; A = Reg[IR[25-2]]; B = Reg[IR[2-6]]; ALUOt = PC + (sign-etend(ir[5-])<<2); R-type Eection / address comptation / Branch / Jmp ALUOt = A op B ALUOt = A + signetend(ir[5-]) if(a == B) then PC = ALUOt; PC = PC[3-28] (IR[25-)<<2); emory Access / R-type Completion Reg[IR[5-]] = ALUOt; lw: DR = emory[aluot] or sw: emory[aluot] = B emory Read Completion lw: Reg[IR[2-6]]=DR; 7
18 ltipath Datapath Implementation: Control Recall that design of single cycle datapath was based on a combinational circit Design of mlticycle datapath is more complicated. Instrctions are eected in a series of steps 2. Each step mst occr in a seqence 3. Control of mlticycle mst specify both the control signals and the net step The control of a mlticycle datapath is based on a seqential circit referred to as a finite state machine A finite state diagram for a 2-bit conter Each state specifies a set of otpt By defalt, nspecified otpts are assmed disabled The nmber of the arrows identify inpts State State 3 State State 2 8
19 Finite State achine Control of lticycle Datapath () S t a r t f e t c h / d e c o d e a n d f e t c h ( F i g r e ) e m o r y a c c e s s i n s t r c t i o n s ( F i g r e ) R - t y p e i n s t r c t i o n s ( F i g r e ) B r a n c h i n s t r c t i o n ( F i g r e 5 4 ) J m p i n s t r c t i o n ( F i g r e 5. 4 ) High-Level View 9
20 Finite State achine Control of lticycle Datapath (2) S t a r t f e t c h e m A L U S r c A = I o r D = I R A L U S r c B = A L U O p = P C P C S o r c e = d e c o d e / R e g i s t e r f e t c h A L U S r c A = A L U S r c B = A L U O p = ( O p = ' J P ' ) e m o r y r e f e r e n c e F S ( F i g r e ) R - t y p e F S ( F i g r e ) B r a n c h F S ( F i g r e 5. 4 ) J m p F S ( F i g r e 5. 4 ) Fig. 5.37: Steps and 2: Instrction Fetch and Decode Instrctions 2
21 Finite State achine Control of lticycle Datapath (3) 2 F r o m s t a t e A L U S r c A = A L U S r c B = A L U O p = ( O p = ' L W ' ) o r ( O p = ' S W ' ) e m o r y a d d r e s s c o m p t a t i o n 3 ( O p = ' L W ' ) e m o r y a c c e s s 5 e m o r y a c c e s s Fig. 5.38: Finite State achine for emory Reference Instrctions e m I o r D = e m I o r D = 4 - b a c k s t e p R e g e m t o R e g = R e g D s t = T o s t a t e ( F i g r e ) 2
22 Finite State achine Control of lticycle Datapath (4) F r o m s t a t e ( O p = R - t y p e ) 6 E e c t i o n A L U S r c A = A L U S r c B = A L U O p = 7 R - t y p e c o m p l e t i o n R e g D s t = R e g e m t o R e g = T o s t a t e ( F i g r e ) Fig. 5.39: Finite State achines for R-type Instrctions 22
23 Finite State achine Control of lticycle Datapath (5) 8 F r o m s t a t e ( O p = ' B E Q ' ) A L U S r c A = A L U S r c B = A L U O p = P C C o n d P C S o r c e = B r a n c h c o m p l e t i o n 9 F r o m s t a t e ( O p = ' J ' ) P C P C S o r c e = J m p c o m p l e t i o n T o s t a t e ( F i g r e ) Fig. 5.4: Finite State achine for Branch Instrction T o s t a t e ( F i g r e ) Fig. 5.4: Finite State achine for Jmp Instrction 23
24 2 e m o r y a d d r e s s c o m p t a t i o n A L U S r c A = A L U S r c B = A L U O p = S t a r t 6 e m A L U S r c A = I o r D = I R A L U S r c B = A L U O p = P C P C S o r c e = E e c t i o n A L U S r c A = A L U S r c B = A L U O p = f e t c h 8 B r a n c h c o m p l e t i o n A L U S r c A = A L U S r c B = A L U O p = P C C o n d P C S o r c e = d e c o d e / f e t c h 9 A L U S r c A = A L U S r c B = A L U O p = ( O p = ' J ') J m p c o m p l e t i o n P C P C S o r c e = 3 ( O p = ' L W ') e m o r y a c c e s s 5 e m o r y a c c e s s 7 R - t y p e c o m p l e t i o n e m I o r D = e m I o r D = R e g D s t = R e g e m t o R e g = 4 - b a c k s t e p R e g D s t = R e g e m t o R e g = 24
25 lticycle Implementation: Control Units added P C A d d r e s s e m o r y e m D a t a [ ] [ 2 6 ] [ 5 ] [ 5 ] e m o r y P C C o n d P C I o r D e m e m e m t o R e g I R O t p t s C o n t r o l O p [ 5 ] [ 5 ] P C S o r c e A L U O p A L U S r c B 6 A L U S r c A R e g R e g D s t [ 25 ] S h i f t l e f t 2 [ 3-26 ] P C [ 3-28 ] 2 R e g i s t e r s 2 S i g n 32 e t e n d S h i f t l e f t 2 A B A L U c o n t r o l Z e r o A L U A L U r e s l t J m p a d d r e s s [ 3 - ] A L U O t 2 [ 5 ] 25
26 Finite State achine Control of lticycle Datapath (5) P C P C C o n d I o r D e m e m C o n t r o l l o g i c I R e m t o R e g P C S o r c e O t p t s A L U O p A L U S r c B A L U S r c A R e g R e g D s t I n p t s N S 3 N S 2 N S N S O p 5 O p 4 O p 3 O p 2 O p O p S 3 S 2 S S o p c o d e f i e l d S t a t e 26
Designing MIPS Processor
CSE 675.: Introdction to Compter Architectre Designing IPS Processor (lti-cycle) Presentation H Reading Assignment: 5.5,5.6 lti-cycle Design Principles Break p eection of each instrction into steps. The
More informationReview. Combined Datapath
Review Topics:. A single cycle implementation 2. State Diagrams. A mltiple cycle implementation COSC 22: Compter Organization Instrctor: Dr. Amir Asif Department of Compter Science York University Handot
More informationInstruction register. Data. Registers. Register # Memory data register
Where we are headed Single Cycle Problems: what if we had a more complicated instrction like floating point? wastefl of area One Soltion: se a smaller cycle time have different instrctions take different
More informationDesigning Single-Cycle MIPS Processor
CSE 32: Introdction to Compter Architectre Designing Single-Cycle IPS Processor Presentation G Stdy:.-. Gojko Babić 2/9/28 Introdction We're now ready to look at an implementation of the system that incldes
More informationCPU DESIGN The Single-Cycle Implementation
22 ompter Organization Seqential vs. ombinational ircits Digital circits can be classified into two categories: DESIGN The Single-ycle Implementation. ombinational ircits: m, 2. Seqential ircits: flip-flops,
More informationLecture 12: Pipelined Implementations: Control Hazards and Resolutions
18-447 Lectre 12: Pipelined Implementations: Control Hazards and Resoltions S 09 L12-1 James C. Hoe Dept of ECE, CU arch 2, 2009 Annoncements: Spring break net week!! Project 2 de the week after spring
More information中村宏 工学部計数工学科 情報理工学系研究科システム情報学専攻 計算システム論第 2 1
計算システム論第 2 (3 章 ) 中村宏 工学部計数工学科 情報理工学系研究科システム情報学専攻 計算システム論第 2 IPS Assembly Langage ( 抜粋 ) IPS operands Name Eample Comments $s-$s7, $t-$t9, $zero, Fast locations for data. In IPS, data mst be in registers
More informationCSE Computer Architecture I
Execution Sequence Summary CSE 30321 Computer Architecture I Lecture 17 - Multi Cycle Control Michael Niemier Department of Computer Science and Engineering Step name Instruction fetch Instruction decode/register
More informationComputer Engineering Department. CC 311- Computer Architecture. Chapter 4. The Processor: Datapath and Control. Single Cycle
Computer Engineering Department CC 311- Computer Architecture Chapter 4 The Processor: Datapath and Control Single Cycle Introduction The 5 classic components of a computer Processor Input Control Memory
More informationL07-L09 recap: Fundamental lesson(s)!
L7-L9 recap: Fundamental lesson(s)! Over the next 3 lectures (using the IPS ISA as context) I ll explain:! How functions are treated and processed in assembly! How system calls are enabled in assembly!
More informationLecture 9: Control Hazard and Resolution. James C. Hoe Department of ECE Carnegie Mellon University
18 447 Lectre 9: Control Hazard and Resoltion James C. Hoe Department of ECE Carnegie ellon University 18 447 S18 L09 S1, James C. Hoe, CU/ECE/CALC, 2018 Yor goal today Hosekeeping simple control flow
More informationConcepts Introduced. Digital Electronics. Logic Blocks. Truth Tables
Concepts Introdced Digital Electronics trth tables, logic eqations, and gates combinational logic seqential logic Digital electronics operate at either high or low voltage. Compters se a binary representation
More informationComputer Architecture Lecture 5: ISA Wrap-Up and Single-Cycle Microarchitectures
8-447 Compter Architectre Lectre 5: ISA Wrap-Up and Single-Cycle icroarchitectres Prof. Onr tl Carnegie ellon University Spring 22, /25/22 Homework Was de Wednesday! 34 received 2 Reminder: Homeworks for
More informationCPU DESIGN The Single-Cycle Implementation
CSE 202 Computer Organization CPU DESIGN The Single-Cycle Implementation Shakil M. Khan (adapted from Prof. H. Roumani) Dept of CS & Eng, York University Sequential vs. Combinational Circuits Digital circuits
More informationPipeline Datapath. With some slides from: John Lazzaro and Dan Garcia
Pipeline Datapath With some slides from: John Lazzaro and Dan Garcia The single cycle CPU Instrction [25 ] Shift Jmp address [3 ] left 2 26 28 PC+ [3 28] Instrction [3 26] Control RegDst Jmp Branch em
More informationPipelined Datapath. Reading. Sections Practice Problems: 1, 3, 8, 12 (2) Lecture notes from MKP, H. H. Lee and S.
Pipelined Datapath Lectre notes from KP, H. H. Lee and S. Yalamanchili Sections 4.5 4. Practice Problems:, 3, 8, 2 Reading (2) Pipeline Performance Assme time for stages is v ps for register read or write
More informationDesign of Digital Circuits Lecture 14: Microprogramming. Prof. Onur Mutlu ETH Zurich Spring April 2017
Design of Digital Circuits Lecture 4: Microprogramming Prof. Onur Mutlu ETH Zurich Spring 27 7 April 27 Agenda for Today & Next Few Lectures! Single-cycle Microarchitectures! Multi-cycle and Microprogrammed
More informationProcessor Design & ALU Design
3/8/2 Processor Design A. Sahu CSE, IIT Guwahati Please be updated with http://jatinga.iitg.ernet.in/~asahu/c22/ Outline Components of CPU Register, Multiplexor, Decoder, / Adder, substractor, Varity of
More informationProject Two RISC Processor Implementation ECE 485
Project Two RISC Processor Implementation ECE 485 Chenqi Bao Peter Chinetti November 6, 2013 Instructor: Professor Borkar 1 Statement of Problem This project requires the design and test of a RISC processor
More informationProblem Class 4. More State Machines (Problem Sheet 3 con t)
Problem Class 4 More State Machines (Problem Sheet 3 con t) Peter Cheng Department of Electrical & Electronic Engineering Imperial College London URL: www.ee.imperial.ac.k/pcheng/ee2_digital/ E-mail: p.cheng@imperial.ac.k
More informationFast Path-Based Neural Branch Prediction
Fast Path-Based Neral Branch Prediction Daniel A. Jiménez http://camino.rtgers.ed Department of Compter Science Rtgers, The State University of New Jersey Overview The context: microarchitectre Branch
More informationEC 413 Computer Organization
EC 413 Computer Organization rithmetic Logic Unit (LU) and Register File Prof. Michel. Kinsy Computing: Computer Organization The DN of Modern Computing Computer CPU Memory System LU Register File Disks
More informationLecture 13: Sequential Circuits, FSM
Lecture 13: Sequential Circuits, FSM Today s topics: Sequential circuits Finite state machines 1 Clocks A microprocessor is composed of many different circuits that are operating simultaneously if each
More information61C In the News. Processor Design: 5 steps
www.eetimes.com/electronics-news/23235/thailand-floods-take-toll-on--makers The Thai floods have already claimed the lives of hundreds of pele, with tens of thousands more having had to flee their homes
More informationPipelining. Traditional Execution. CS 365 Lecture 12 Prof. Yih Huang. add ld beq CS CS 365 2
Pipelining CS 365 Lecture 12 Prof. Yih Huang CS 365 1 Traditional Execution 1 2 3 4 1 2 3 4 5 1 2 3 add ld beq CS 365 2 1 Pipelined Execution 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
More informationCSE Computer Architecture I
Single cycle Conrol Implemenaion CSE 332 Compuer Archiecure I l x I Lecure 7 - uli Cycle achines i i [ ] I I r l ichael Niemier Deparmen of Compuer Science and Engineering I ] i X.S. Hu 5- X.S. Hu 5-2
More informationOutcomes. Spiral 1 / Unit 2. Boolean Algebra BOOLEAN ALGEBRA INTRO. Basic Boolean Algebra Logic Functions Decoders Multiplexers
-2. -2.2 piral / Unit 2 Basic Boolean Algebra Logic Functions Decoders Multipleers Mark Redekopp Outcomes I know the difference between combinational and sequential logic and can name eamples of each.
More informationControl. Control. the ALU. ALU control signals 11/4/14. Next: control. We built the instrument. Now we read music and play it...
// CS 2, Fall 2! CS 2, Fall 2! We built the instrument. Now we read music and play it... A simple implementa/on uc/on uct r r 2 Write r Src Src Extend 6 Mem Next: path 7-2 CS 2, Fall 2! signals CS 2, Fall
More informationCprE 281: Digital Logic
CprE 28: Digital Logic Instructor: Alexander Stoytchev http://www.ece.iastate.edu/~alexs/classes/ Simple Processor CprE 28: Digital Logic Iowa State University, Ames, IA Copyright Alexander Stoytchev Digital
More informationDesign. Dr. A. Sahu. Indian Institute of Technology Guwahati
CS222: Processor Design: Multi Cycle Design Dr. A. Sahu Dept of Comp. Sc. & Engg. Indian Institute of Technology Guwahati Mid Semester Exam Multi Cycle design Outline Clock periods in single cycle and
More informationECE290 Fall 2012 Lecture 22. Dr. Zbigniew Kalbarczyk
ECE290 Fall 2012 Lecture 22 Dr. Zbigniew Kalbarczyk Today LC-3 Micro-sequencer (the control store) LC-3 Micro-programmed control memory LC-3 Micro-instruction format LC -3 Micro-sequencer (the circuitry)
More information4. (3) What do we mean when we say something is an N-operand machine?
1. (2) What are the two main ways to define performance? 2. (2) When dealing with control hazards, a prediction is not enough - what else is necessary in order to eliminate stalls? 3. (3) What is an "unbalanced"
More informationPipeline Datapath. With some slides from: John Lazzaro and Dan Garcia
Pipeline path With some slides from: John Lazzaro and Dan Garcia Gotta Do Landry Ann, Brian, Cathy, Dave each have one load of clothes to wash, dry, fold, and pt away Washer takes 3 mintes A B C D Dryer
More informationOn the circuit complexity of the standard and the Karatsuba methods of multiplying integers
On the circit complexity of the standard and the Karatsba methods of mltiplying integers arxiv:1602.02362v1 [cs.ds] 7 Feb 2016 Igor S. Sergeev The goal of the present paper is to obtain accrate estimates
More informationSpiral 1 / Unit 3
-3. Spiral / Unit 3 Minterm and Maxterms Canonical Sums and Products 2- and 3-Variable Boolean Algebra Theorems DeMorgan's Theorem Function Synthesis use Canonical Sums/Products -3.2 Outcomes I know the
More information3. (2) What is the difference between fixed and hybrid instructions?
1. (2 pts) What is a "balanced" pipeline? 2. (2 pts) What are the two main ways to define performance? 3. (2) What is the difference between fixed and hybrid instructions? 4. (2 pts) Clock rates have grown
More information[2] Predicting the direction of a branch is not enough. What else is necessary?
[2] When we talk about the number of operands in an instruction (a 1-operand or a 2-operand instruction, for example), what do we mean? [2] What are the two main ways to define performance? [2] Predicting
More informationFigure 4.9 MARIE s Datapath
Term Control Word Microoperation Hardwired Control Microprogrammed Control Discussion A set of signals that executes a microoperation. A register transfer or other operation that the CPU can execute in
More informationFRÉCHET KERNELS AND THE ADJOINT METHOD
PART II FRÉCHET KERNES AND THE ADJOINT METHOD 1. Setp of the tomographic problem: Why gradients? 2. The adjoint method 3. Practical 4. Special topics (sorce imaging and time reversal) Setp of the tomographic
More information[2] Predicting the direction of a branch is not enough. What else is necessary?
[2] What are the two main ways to define performance? [2] Predicting the direction of a branch is not enough. What else is necessary? [2] The power consumed by a chip has increased over time, but the clock
More informationTEST 1 REVIEW. Lectures 1-5
TEST 1 REVIEW Lectures 1-5 REVIEW Test 1 will cover lectures 1-5. There are 10 questions in total with the last being a bonus question. The questions take the form of short answers (where you are expected
More informationLecture 13: Sequential Circuits, FSM
Lecture 13: Sequential Circuits, FSM Today s topics: Sequential circuits Finite state machines Reminder: midterm on Tue 2/28 will cover Chapters 1-3, App A, B if you understand all slides, assignments,
More informationImplementing the Controller. Harvard-Style Datapath for DLX
6.823, L6--1 Implementing the Controller Laboratory for Computer Science M.I.T. http://www.csg.lcs.mit.edu/6.823 6.823, L6--2 Harvard-Style Datapath for DLX Src1 ( j / ~j ) Src2 ( R / RInd) RegWrite MemWrite
More informationCHAPTER log 2 64 = 6 lines/mux or decoder 9-2.* C = C 8 V = C 8 C * 9-4.* (Errata: Delete 1 after problem number) 9-5.
CHPTER 9 2008 Pearson Education, Inc. 9-. log 2 64 = 6 lines/mux or decoder 9-2.* C = C 8 V = C 8 C 7 Z = F 7 + F 6 + F 5 + F 4 + F 3 + F 2 + F + F 0 N = F 7 9-3.* = S + S = S + S S S S0 C in C 0 dder
More informationUNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL
8th International DAAAM Baltic Conference "INDUSTRIAL ENGINEERING - 19-1 April 01, Tallinn, Estonia UNCERTAINTY FOCUSED STRENGTH ANALYSIS MODEL Põdra, P. & Laaneots, R. Abstract: Strength analysis is a
More informationChapter 4 Supervised learning:
Chapter 4 Spervised learning: Mltilayer Networks II Madaline Other Feedforward Networks Mltiple adalines of a sort as hidden nodes Weight change follows minimm distrbance principle Adaptive mlti-layer
More informationFormal Verification of Systems-on-Chip
Formal Verification of Systems-on-Chip Wolfgang Kunz Department of Electrical & Computer Engineering University of Kaiserslautern, Germany Slide 1 Industrial Experiences Formal verification of Systems-on-Chip
More informationSection 7.4: Integration of Rational Functions by Partial Fractions
Section 7.4: Integration of Rational Fnctions by Partial Fractions This is abot as complicated as it gets. The Method of Partial Fractions Ecept for a few very special cases, crrently we have no way to
More informationComputer Architecture
Lecture 2: Iakovos Mavroidis Computer Science Department University of Crete 1 Previous Lecture CPU Evolution What is? 2 Outline Measurements and metrics : Performance, Cost, Dependability, Power Guidelines
More information1. (2 )Clock rates have grown by a factor of 1000 while power consumed has only grown by a factor of 30. How was this accomplished?
1. (2 )Clock rates have grown by a factor of 1000 while power consumed has only grown by a factor of 30. How was this accomplished? 2. (2 )What are the two main ways to define performance? 3. (2 )What
More informationA Second Datapath Example YH16
A Second Datapath Example YH16 Lecture 09 Prof. Yih Huang S365 1 A 16-Bit Architecture: YH16 A word is 16 bit wide 32 general purpose registers, 16 bits each Like MIPS, 0 is hardwired zero. 16 bit P 16
More informationBuilding a Computer. Quiz #2 on 10/31, open book and notes. (This is the last lecture covered) I wonder where this goes? L16- Building a Computer 1
Building a Computer I wonder where this goes? B LU MIPS Kit Quiz # on /3, open book and notes (This is the last lecture covered) Comp 4 Fall 7 /4/7 L6- Building a Computer THIS IS IT! Motivating Force
More informationReview: Single-Cycle Processor. Limits on cycle time
Review: Single-Cycle Processor Jump 3:26 5: MemtoReg Control Unit LUControl 2: Op Funct LUSrc RegDst PCJump PC 4 uction + PCPlus4 25:2 2:6 2:6 5: 5: 2 3 WriteReg 4: Src Src LU Zero LUResult Write + PC
More informationBLOOM S TAXONOMY. Following Bloom s Taxonomy to Assess Students
BLOOM S TAXONOMY Topic Following Bloom s Taonomy to Assess Stdents Smmary A handot for stdents to eplain Bloom s taonomy that is sed for item writing and test constrction to test stdents to see if they
More informationClock T FF1 T CL1 T FF2 T T T FF T T FF T CL T FF T CL T FF T T FF T T FF T CL. T cyc T H. Clock T FF T T FF T CL T FF T T FF T CL.
etup TA 60 c, vcc 3v Hold TA 30 c, vcc 5v Tkew TL TL TH FF FF 2 T cyc T H T L Clock TpdFF 2 TpdCL2Tetup FF Tcyc TL 2 2 TpdFF TpdCL Tetup FF2 TH 2 T FF T T FF T CL Hold L cd cd T FF T T FF T CL Hold L cd
More informationCPSC 3300 Spring 2017 Exam 2
CPSC 3300 Spring 2017 Exam 2 Name: 1. Matching. Write the correct term from the list into each blank. (2 pts. each) structural hazard EPIC forwarding precise exception hardwired load-use data hazard VLIW
More informationCMP 334: Seventh Class
CMP 334: Seventh Class Performance HW 5 solution Averages and weighted averages (review) Amdahl's law Ripple-carry adder circuits Binary addition Half-adder circuits Full-adder circuits Subtraction, negative
More informationFormal Verification of Systems-on-Chip Industrial Practices
Formal Verification of Systems-on-Chip Industrial Practices Wolfgang Kunz Department of Electrical & Computer Engineering University of Kaiserslautern, Germany Slide 1 Industrial Experiences Formal verification
More informationLecture 3, Performance
Lecture 3, Performance Repeating some definitions: CPI Clocks Per Instruction MHz megahertz, millions of cycles per second MIPS Millions of Instructions Per Second = MHz / CPI MOPS Millions of Operations
More informationFormal Methods for Deriving Element Equations
Formal Methods for Deriving Element Eqations And the importance of Shape Fnctions Formal Methods In previos lectres we obtained a bar element s stiffness eqations sing the Direct Method to obtain eact
More informationDiscontinuous Fluctuation Distribution for Time-Dependent Problems
Discontinos Flctation Distribtion for Time-Dependent Problems Matthew Hbbard School of Compting, University of Leeds, Leeds, LS2 9JT, UK meh@comp.leeds.ac.k Introdction For some years now, the flctation
More informationDepartment of Industrial Engineering Statistical Quality Control presented by Dr. Eng. Abed Schokry
Department of Indstrial Engineering Statistical Qality Control presented by Dr. Eng. Abed Schokry Department of Indstrial Engineering Statistical Qality Control C and U Chart presented by Dr. Eng. Abed
More informationWorkshop on Understanding and Evaluating Radioanalytical Measurement Uncertainty November 2007
1833-3 Workshop on Understanding and Evalating Radioanalytical Measrement Uncertainty 5-16 November 007 Applied Statistics: Basic statistical terms and concepts Sabrina BARBIZZI APAT - Agenzia per la Protezione
More informationLoad. Load. Load 1 0 MUX B. MB select. Bus A. A B n H select S 2:0 C S. G select 4 V C N Z. unit (ALU) G. Zero Detect.
9- Write D data Load eable A address A select B address B select Load R 2 2 Load Load R R2 UX 2 3 UX 2 3 2 3 Decoder D address 2 Costat i Destiatio select 28 Pearso Educatio, Ic.. orris ao & Charles R.
More informationCalculations involving a single random variable (SRV)
Calclations involving a single random variable (SRV) Example of Bearing Capacity q φ = 0 µ σ c c = 100kN/m = 50kN/m ndrained shear strength parameters What is the relationship between the Factor of Safety
More informationSISD SIMD. Flynn s Classification 8/8/2016. CS528 Parallel Architecture Classification & Single Core Architecture C P M
8/8/26 S528 arallel Architecture lassification & Single ore Architecture arallel Architecture lassification A Sahu Dept of SE, IIT Guwahati A Sahu Flynn s lassification SISD Architecture ategories M SISD
More informationLinear System Theory (Fall 2011): Homework 1. Solutions
Linear System Theory (Fall 20): Homework Soltions De Sep. 29, 20 Exercise (C.T. Chen: Ex.3-8). Consider a linear system with inpt and otpt y. Three experiments are performed on this system sing the inpts
More informationJoint Transfer of Energy and Information in a Two-hop Relay Channel
Joint Transfer of Energy and Information in a Two-hop Relay Channel Ali H. Abdollahi Bafghi, Mahtab Mirmohseni, and Mohammad Reza Aref Information Systems and Secrity Lab (ISSL Department of Electrical
More informationLecture 3, Performance
Repeating some definitions: Lecture 3, Performance CPI MHz MIPS MOPS Clocks Per Instruction megahertz, millions of cycles per second Millions of Instructions Per Second = MHz / CPI Millions of Operations
More informationCSCI-564 Advanced Computer Architecture
CSCI-564 Advanced Computer Architecture Lecture 8: Handling Exceptions and Interrupts / Superscalar Bo Wu Colorado School of Mines Branch Delay Slots (expose control hazard to software) Change the ISA
More informationDecoder Error Probability of MRD Codes
Decoder Error Probability of MRD Codes Maximilien Gadolea Department of Electrical and Compter Engineering Lehigh University Bethlehem, PA 18015 USA E-mail: magc@lehighed Zhiyan Yan Department of Electrical
More informationCOVER SHEET: Problem#: Points
EEL 4712 Midterm 3 Spring 2017 VERSION 1 Name: UFID: Sign here to give permission for your test to be returned in class, where others might see your score: IMPORTANT: Please be neat and write (or draw)
More informationLogic and Computer Design Fundamentals. Chapter 8 Sequencing and Control
Logic and Computer Design Fundamentals Chapter 8 Sequencing and Control Datapath and Control Datapath - performs data transfer and processing operations Control Unit - Determines enabling and sequencing
More informationLecture: Pipelining Basics
Lecture: Pipelining Basics Topics: Performance equations wrap-up, Basic pipelining implementation Video 1: What is pipelining? Video 2: Clocks and latches Video 3: An example 5-stage pipeline Video 4:
More informationClassify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled.
Jnction elements in network models. Classify by nmber of ports and examine the possible strctres that reslt. Using only one-port elements, no more than two elements can be assembled. Combining two two-ports
More informationLecture Notes: Finite Element Analysis, J.E. Akin, Rice University
9. TRUSS ANALYSIS... 1 9.1 PLANAR TRUSS... 1 9. SPACE TRUSS... 11 9.3 SUMMARY... 1 9.4 EXERCISES... 15 9. Trss analysis 9.1 Planar trss: The differential eqation for the eqilibrim of an elastic bar (above)
More informationChapter 2 Difficulties associated with corners
Chapter Difficlties associated with corners This chapter is aimed at resolving the problems revealed in Chapter, which are cased b corners and/or discontinos bondar conditions. The first section introdces
More informationSimple Instruction-Pipelining (cont.) Pipelining Jumps
6.823, L9--1 Simple ruction-pipelining (cont.) + Interrupts Updated March 6, 2000 Laboratory for Computer Science M.I.T. http://www.csg.lcs.mit.edu/6.823 Src1 ( j / ~j ) Src2 ( / Ind) Pipelining Jumps
More informationUNIVERSITY OF WISCONSIN MADISON
CS/ECE 252: INTRODUCTION TO COMPUTER ENGINEERING UNIVERSITY OF WISCONSIN MADISON Prof. Gurindar Sohi TAs: Minsub Shin, Lisa Ossian, Sujith Surendran Midterm Examination 2 In Class (50 minutes) Friday,
More informationHybrid modelling and model reduction for control & optimisation
Hybrid modelling and model redction for control & optimisation based on research done by RWTH-Aachen and TU Delft presented by Johan Grievink Models for control and optimiation market and environmental
More informationDecoder Error Probability of MRD Codes
Decoder Error Probability of MRD Codes Maximilien Gadolea Department of Electrical and Compter Engineering Lehigh University Bethlehem, PA 18015 USA E-mail: magc@lehigh.ed Zhiyan Yan Department of Electrical
More informationDesign at the Register Transfer Level
Week-7 Design at the Register Transfer Level Algorithmic State Machines Algorithmic State Machine (ASM) q Our design methodologies do not scale well to real-world problems. q 232 - Logic Design / Algorithmic
More informationLecture: Corporate Income Tax - Unlevered firms
Lectre: Corporate Income Tax - Unlevered firms Ltz Krschwitz & Andreas Löffler Disconted Cash Flow, Section 2.1, Otline 2.1 Unlevered firms Similar companies Notation 2.1.1 Valation eqation 2.1.2 Weak
More informationLecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2
BIJU PATNAIK UNIVERSITY OF TECHNOLOGY, ODISHA Lectre Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 Prepared by, Dr. Sbhend Kmar Rath, BPUT, Odisha. Tring Machine- Miscellany UNIT 2 TURING MACHINE
More informationECE 3401 Lecture 23. Pipeline Design. State Table for 2-Cycle Instructions. Control Unit. ISA: Instruction Specifications (for reference)
ECE 3401 Lecture 23 Pipeline Design Control State Register Combinational Control Logic New/ Modified Control Word ISA: Instruction Specifications (for reference) P C P C + 1 I N F I R M [ P C ] E X 0 PC
More informationEXAMPLES 4/12/2018. The MIPS Pipeline. Hazard Summary. Show the pipeline diagram. Show the pipeline diagram. Pipeline Datapath and Control
The MIPS Pipeline CSCI206 - Computer Organization & Programming Pipeline Datapath and Control zybook: 11.6 Developed and maintained by the Bucknell University Computer Science Department - 2017 Hazard
More informationReflections on a mismatched transmission line Reflections.doc (4/1/00) Introduction The transmission line equations are given by
Reflections on a mismatched transmission line Reflections.doc (4/1/00) Introdction The transmission line eqations are given by, I z, t V z t l z t I z, t V z, t c z t (1) (2) Where, c is the per-nit-length
More informationIntrodction Finite elds play an increasingly important role in modern digital commnication systems. Typical areas of applications are cryptographic sc
A New Architectre for a Parallel Finite Field Mltiplier with Low Complexity Based on Composite Fields Christof Paar y IEEE Transactions on Compters, Jly 996, vol 45, no 7, pp 856-86 Abstract In this paper
More informationCSc 256 Midterm 2 Fall 2010
CSc 256 Midterm 2 Fall 2010 NAME: 1a)YouaregivenaMIPSbranchinstruction: x:beq$12,$0,y Theaddressofthelabel"y"is0x40013c.Thememorylocationat"x"contains: addresscontents 0x4002080001000110000000????????????????...whichrepresentsthebeqinstruction;the????...????arethe
More informationSimple Instruction-Pipelining. Pipelined Harvard Datapath
6.823, L8--1 Simple ruction-pipelining Updated March 6, 2000 Laboratory for Computer Science M.I.T. http://www.csg.lcs.mit.edu/6.823 Pipelined Harvard path 6.823, L8--2. fetch decode & eg-fetch execute
More informationAssignment Fall 2014
Assignment 5.086 Fall 04 De: Wednesday, 0 December at 5 PM. Upload yor soltion to corse website as a zip file YOURNAME_ASSIGNMENT_5 which incldes the script for each qestion as well as all Matlab fnctions
More informationSimple Instruction-Pipelining. Pipelined Harvard Datapath
6.823, L8--1 Simple ruction-pipelining Laboratory for Computer Science M.I.T. http://www.csg.lcs.mit.edu/6.823 Pipelined Harvard path 6.823, L8--2. I fetch decode & eg-fetch execute memory Clock period
More informationBasic Computer Organization and Design Part 3/3
Basic Computer Organization and Design Part 3/3 Adapted by Dr. Adel Ammar Computer Organization Interrupt Initiated Input/Output Open communication only when some data has to be passed --> interrupt. The
More informationLecture: Corporate Income Tax
Lectre: Corporate Income Tax Ltz Krschwitz & Andreas Löffler Disconted Cash Flow, Section 2.1, Otline 2.1 Unlevered firms Similar companies Notation 2.1.1 Valation eqation 2.1.2 Weak atoregressive cash
More informationMicroprocessor Power Analysis by Labeled Simulation
Microprocessor Power Analysis by Labeled Simulation Cheng-Ta Hsieh, Kevin Chen and Massoud Pedram University of Southern California Dept. of EE-Systems Los Angeles CA 989 Outline! Introduction! Problem
More informationOptimization in Predictive Control Algorithm
Latest rends in Circits, Systems, Sinal Processin and Atomatic Control Optimization in Predictive Control Alorithm JAN ANOŠ, MAREK KUBALČÍK omas Bata University in Zlín, Faclty of Applied Informatics Nám..
More informationDigital Image Processing. Lecture 8 (Enhancement in the Frequency domain) Bu-Ali Sina University Computer Engineering Dep.
Digital Image Processing Lectre 8 Enhancement in the Freqenc domain B-Ali Sina Uniersit Compter Engineering Dep. Fall 009 Image Enhancement In The Freqenc Domain Otline Jean Baptiste Joseph Forier The
More informationSetting The K Value And Polarization Mode Of The Delta Undulator
LCLS-TN-4- Setting The Vale And Polarization Mode Of The Delta Undlator Zachary Wolf, Heinz-Dieter Nhn SLAC September 4, 04 Abstract This note provides the details for setting the longitdinal positions
More informationChapter 3 MATHEMATICAL MODELING OF DYNAMIC SYSTEMS
Chapter 3 MATHEMATICAL MODELING OF DYNAMIC SYSTEMS 3. System Modeling Mathematical Modeling In designing control systems we mst be able to model engineered system dynamics. The model of a dynamic system
More informationBayes and Naïve Bayes Classifiers CS434
Bayes and Naïve Bayes Classifiers CS434 In this lectre 1. Review some basic probability concepts 2. Introdce a sefl probabilistic rle - Bayes rle 3. Introdce the learning algorithm based on Bayes rle (ths
More information