Software Defined Radio A High Performance Challenge
|
|
- Bathsheba Eaton
- 6 years ago
- Views:
Transcription
1 Software efined adio A High Performance Challenge PFL, Lausanne, Switzerland, September 2006 Yuan Lin 1, Hyunseok Lee 1, Mark oh 1, Yoav Harel 1, Sangwon Seo 1, ob Mullenix 1 Scott Mahlke 1, Trevor Mudge 1, Chaitali Chakrabarti 2, Krisztian Flautner 3 1 Advanced Computer Architecture Lab, University of Michigan 2 epartment of lectrical ngineering, Arizona State University 3 AM, Ltd. 1
2 Anatomy of 3G Cellular Phone luetooth SP+ASCs GPS SP+ASCs GPS Analog ASCs -CMA luetooth Analog Analog ASCs S aseband Frontend Processor -CMA ASCs GPP+SP+ASCs GPP+SPs Analog ASCs Software efined adio (S): Use of software routines instead of ASCs for physical layer operations of wireless protocol systems Analog ASCs Power Manager Power Manager Application Processor GPP+SP GPP Transport Network Link MAC Camera Keyboard isplay Speaker SP+ASCs SPs Microphone PHY 2
3 Advantages of Software efined adio Multi-mode operations Lower costs Faster time to market Prototyping and bug fixes Chip volumes Longevity of platforms Protocol complexity favors software dominated solutions nables future wireless communication innovations Cognitive radio 3
4 hy is S Challenging? 1000 S esign Objectives for 3G and ifi Throughput requirements 40Gops 100 peak throughput Peak Performance (Gops) Power budget 10 Mobile S equirements 100 Mops/m 10 Mops/m etter Power fficiency 100m~500m peak power T C6x mbedded SPs 1 Mops/m General Purpose Processors Pentium M M Cell High-end SPs Power (atts) 4
5 The Anatomy of ireless Protocols -CMA Physical Layer Processing Fron tend / A A / LPF-Tx scrambler spreader nterleaver searcher LPF-x descrambler despreader. descrambler despreader combiner Transmitter Channel encoder eceiver Channel deinteleaver decoder (turbo/viterbi) Upp er layers 1. Filtering: suppress signals outside frequency band 2. Modulation: map source information onto signal waveforms 3. Channel stimation: stimate channel condition for transceivers 4. rror Correction: correct errors induced by noisy channel 5
6 Spreader/espreader Allow the transmission of several signals at the same time. (x[n] and y[n] in the below diagram) t is based on the orthogonality between spreading codes c i [n] c i [n] N! 1 1 N & n= 0 " 1, if i = j ci[ n] # c j[ n] = $ % 0, otherwize <orthogonality between codes> x[n] y[n] c j [n] c j [n]!! x[n] y[n] spreader despreader 6
7 Scrambler/escrambler andomize the output signal by multiplying pseudo random sequence so called scrambling code. Allow multiple terminals to communicate at the same time. Amount of operation : ~ 3 Gops c sc,i [n] c* sc,i [n] x[n] Complex multiplication Complex multiplication x[n] y[n] Complex multiplication Complex multiplication y[n] c sc,j [n] Terminal 1, with scrambling code n Scrambler c* sc,j [n] Terminal 2, with scrambling code m escrambler 7
8 ake eceiver Multipath fading ake receiver mitigates multipath fading effect Multipath fading is a major cause of unreliable wireless channel characteristic x(t) y(t) = a 0 x(t)+a 1 x(t-d 1 )+a 2 x(t-d 2 ) 8
9 ake eceiver - Functions deally the function of rake receiver is to aggregate the signal terms with proper delay compensation y(t) = a 0 x(t)+a 1 x(t-d 1 )+a 2 x(t-d 2 ) ake receiver r(t) = a 0 x(t)+a 1 x(t-d 1 -d est1 )+a 2 x(t-d 2 -d est2 ) = (a 0 +a 1 +a 2 ) * x(t)! 0! 0 e need to know delay spread of received signal that randomly varies 9
10 ake eceiver etect elay Spread Scan the received signal in frame buffer while computing correlation with scrambling code sequence. Correlation window Correlation esult eceived signal y[ n] = a x[ n] + a x[ n! d ] + a x[ n! d ] a 1 a 2 a 0 0 d 1 d 2 10
11 S Application Specific esign ireless protocols are systems of SP algorithms System-level xample: Specification of -CMA CH channel Algorithm-level xample: mplementation of a 64 point FFT 11
12 System Level esign ecisions System Characteristics 1. Algorithm macro-pipelining with streaming computation 2. Multiple periodic real-time deadlines 3. Low streaming throughput between algorithms SOA Architectural ecisions 1. Multi-core system 2. Communication through MA 3. eterministic hardware behavior 4. Compile-time algorithm mapping and scheduling 5. Low throughput interconnect 4. Heterogeneous interalgorithm communication 6. Multi-level scratchpad memories 12
13 SOA System Architecture 4 Ps static kernel mapping and scheduling SM+Scalar units 1 AM GPP controller scalar algorithms and protocol controls AM System Architecture Local Local Local Mem Mem Mem P P P xecution xecution xecution Unit Unit Unit MA scalar scalar MM SM MM tos & Sto Scalar SM Global Mem Local Mem P xecution Unit SM 13
14 SOA Memory Organization 2-Level scratchpad memories 12K Local scratchpad memory for stream queues 64K global scratchpad memory for large buffers Low-throughput shared bus 200MHz 32-bit bus inter-p communication using MA AM scalar Local Mem P xecution Unit scalar MM Scalar System Architecture MA tos & Sto Local Mem P xecution Unit SM MM Local Mem P xecution Unit SM SM Global Mem Local Mem P xecution Unit 14
15 SP Algorithm Characteristics 8 to 16-bit precision Vector operations Algorithms Type of Computation -CMA Vector idth long vectors constant vector size Static data movement patterns Scalar operations Filter Modulation Channel st. rror Correction Filter Modulation (FFT) Channel st. rror Correction Vector Vector Vector Mixed a Vector Vector Mixed Mixed or
16 SOA P Architecture P SM pipeline SM Memory (8K) 32x 2 issue L (400MHz) - SM + (Scalar or AGU) MA: - mem-to-mem transfer - access global memory 32-way SM S S N nst. Mem. 4K Scalar pipeline AGU pipeline 12bit Vector to Scalar Stage 1 Scalar Memory (4K) AGU Vector to Scalar Stage 2 Scalar to Vector 512bit MA Scalar US AGU 16
17 SOA P SM Pipeline P SM pipeline SM Memory (8K) 32x 32-way SM S S N nst. Mem. 4K AGU pipeline 12bit 16-bit 16 entries 2 read/ 1 write port Scalar pipeline Vector to Scalar Stage 1 Scalar Memory (4K) AGU Vector to Scalar Stage 2 Scalar to Vector 16- bit 512bit MA 16-bit 40-bit ACC Scalar US AGU 17
18 SOA P SM Pipeline P SM pipeline SM Memory (8K) 32x 32-way SM S S N nst. Mem. 4K Scalar pipeline AGU pipeline 12bit Vector to Scalar Stage 1 Vector to Scalar Stage 2 Scalar to Vector SM: Scalar Memory (4K) - 32 AGU wide - predicated exec. - predicated neg. 18 MA Scalar 512bit Memory: US - 512bit AGU port - 1 read port - 1 write port - 8 Kytes
19 SOA P SM Shuffle Network P SM pipeline SM Memory (8K) 32x 32-way SM S S N nst. Mem. 4K Scalar pipeline AGU pipeline 12bit Vector to Scalar Stage 1 Scalar Memory (4K) AGU Vector to Scalar Stage 2 Scalar to Vector 512bit MA Scalar nverse Shuffle SM xchange terative Shuffle xchange Feedback Only xchange Network () (S) (S) US AGU 19
20 Floorplan 20
21 SOA P Scalar Pipeline P 32-way SM SM pipeline S S N SM Memory (8K) 32x Scalar: - One 16-bit datapath - No mult unit Scalar memory: - port - 1 read/write port - 4 Kytes Scalar-to-Vector Vector-to-Scalar nst. Mem. 4K Scalar pipeline AGU pipeline 12bit Vector to Scalar Stage 1 Scalar Memory (4K) AGU Vector to Scalar Stage 2 Scalar to Vector 512bit MA Scalar US AGU 21
22 -CMA Mapping On SOA -CMA Physical Layer Processing Frontend / A A / LPF-Tx scrambler spreader nterleaver searcher LPF-x descrambler despreader. descrambler despreader combiner Transmitter Channel encoder eceiver Channel deinteleaver decoder (turbo/viterbi) Upper layers AM P P P P Global Memory espreader einterleaver Combiner PN Code uffer T/ (1K ytes) uffer uffer (2K ytes) (1K ytes) 2 LPF-x Misc. Turbo Searcher Control ecoder uffer uffer (1K ytes) (10 ytes) e- Power scrambler Control uffer (1K ytes) Turbo ncoder nterleaver Spreader Scrambler 4 LPF-x FFO Queue (12.5 Kytes) uffer (20 Kytes) uffer (20 Kytes) CMA eceiver CMA Transmitter 22
23 S Performance istribution Computations (Mcycles) filter modulation channel estimation error correction -CMA (2Mbps) a (24Mbps) a has higher number of total computational cycles -CMA requires higher computational cycles per bit 23
24 Power Consumption at 180nm 1400 Power (m) in 180nm P ata Memory P SM P SM s P SM Pipeline P Others Global Memory System Others -CMA (2Mbps) a (24Mbps) ide SM requires higher number of pipeline registers a consumes higher power than -CMA 8-bit -CMA computation versus 16-bit a computation 24
25 Summary of Prototype Hardware S Hardware equirements Comp. requirements: 10 ~ 100 GOPS Sub-watt power budget: ~ 0.2 att for cellular phones SOA esults -CMA & a: 1.3 ~ 2 GOPS (with SOA L ops) 180nm: ~ 3 atts (area: 26.6mm 2 ) 90nm (est.): ~ 0.5 att (6.7 mm 2 ) Key features of SOA Multi-P with scratchpad memories Low throughput shared bus 2-issue L: SM+(Scalar or AGU) 32-wide SM processing SM shuffle network 25
26 Further Power Saving Opportunities Terminal operation modes dle mode >95% of the operation time active mode idle mode active mode voice call web browsing t awake period 50msec x sleep period 1~4 sec x t 26
27 Generalized Operation Flow pulse shaping synchronization channel quality measurement pulse shaping channel estimation demodulation cell selection channel decoding 27
28 Computation Patterns x[n] Z - 1 Z - 1 Z - 1 c 0 c 1... c L- 1 y[n] 28
29 SOA P Architecture P SM pipeline SM Memory (8K) 32x Move the SSN out of the critical path Voltage scaling longer compute times 10s of n possible 32-way SM S S N nst. Mem. 4K Scalar pipeline AGU pipeline 12bit Vector to Scalar Stage 1 Scalar Memory (4K) AGU Vector to Scalar Stage 2 Scalar to Vector 512bit MA Scalar US AGU 29
30 SOA Programming Model Hardware Multi-P organization Software managed memory ide SM arithmetic units Software/SP Algorithms eal-time computation Concurrent algorithm executions Vector computations 30
31 -CMA Mapping On SOA -CMA Physical Layer Processing Frontend / A A / LPF-Tx scrambler spreader nterleaver searcher LPF-x descrambler despreader. descrambler despreader combiner Transmitter Channel encoder eceiver Channel deinteleaver decoder (turbo/viterbi) Upper layers AM P P P P Global Memory espreader einterleaver Combiner PN Code uffer T/ (1K ytes) uffer uffer (2K ytes) (1K ytes) 2 LPF-x Misc. Turbo Searcher Control ecoder uffer uffer (1K ytes) (10 ytes) e- Power scrambler Control uffer (1K ytes) Turbo ncoder nterleaver Spreader Scrambler 4 LPF-x FFO Queue (12.5 Kytes) uffer (20 Kytes) uffer (20 Kytes) CMA eceiver CMA Transmitter 31
32 SP: Programming Language Two-tier model Protocol system level description Concurrent kernel objects Communication through channels Algorithm kernel level description First-class vector objects and operations Flexible data permutation patterns 32
33 System-level Compilation Control/data separation Control processor: AM ata engines: Ps Control processor Non-preemptive kernel scheduling MA scheduling ata engines Kernel executions 33
34 System-level Compilation Static kernel-to-p assignment Channel variables MA operations for inter-p communications Memory copies for intra-p communications eal-time kernel scheduling by control processor Kernel executions Non-preemptive Software managed kernel contexts 34
35 Kernel-level Compilation Transforming vector computation into SM operations Mapping data movements with SM permutation operations Scheduling AGU operations for memory access 35
36 Prototype completed in August ehavioral synthesis Language by ecember eferences Future ork H. Lee, Y. Lin, Y. Harel, M. oh, S. Mahlke, T. Mudge, K. Flautner. Software defined radio - A high performance embedded challenge. Proc. 1st nt. Conf. High Performance mbedded Architectures and Compilers (HiPAC'05), arcelona, Spain, Nov ds. T. Conte, N. Navarro,.. Hwu, M. Valero, T. Ungerer. Lecture Notes in Computer Science, Springer-Verlag GmbH, vol / 2005, pp H. Lee, T. Mudge. A dual processor solution for the MAC layer of a software defined radio terminal. Proc. Conf. Compiler and Architecture Support for mbedded Systems (CASS'05), Sep. 2005, CA, pp Y. Lin, H. Lee, Y. Harel, M. oh, N. aron, S. Mahlke, T. Mudge, K. Flautner. A system solution for highperformance, low-power S. Proc S Technical Conf., Annaheim, CA, Nov. 2005, 6 pp. Y. Lin, H. Lee, M. oh, Y. Harel, S. Mahlke, T. Mudge, C. Chakrabarti, K Flautner. SOA: A low-power architecture for software radio. Proc. 33rd Ann. nt. Symp. on Computer Architecture, oston, MA USA, June H. Lee, C. Chakrabarti, T. Mudge. educing idle mode power in software defined radio terminals. Proc. nt. Symp. Low Power lectronic esign (SLP), to appear. 36
37 Questions? 37
38 ackup Slides 38
39 ifferent Levels of Software adio Tier Tier 0 Tier 1 Tier 2 Tier 3 Tier 4 Name Hardware adio (H) Software Controlled adio (SC) Software efined adio (S) deal Software adio (S) Ultimate Software adio (US) escription mplemented using hardware components. Cannot be modified Only control functions are implemented in software: inter-connects, power levels, etc. Software control of a variety of modulation techniques, wide-band or narrow-band operation, security functions, etc. Programmability extends to the entire system with analog conversion only at the antenna. efined for comparison purposes only <source: 39
40 Power Methodology Our flow sequence was esign Compiler and Silicon nsemble For nitial Floorplan stimation Physical Compiler For placement and Optimization Silicon nsemble outing e optimized for power and delay locks like memory were generated with Artisan Memory Generators e used the Synopsys P locks as much as possible to get better compiled blocks 40
41 Time 1 -CMA frame (15 slots), 10 msec Power Control eal-time Critical Path (0.67 msec) Searcher eal-time Critical Path (5 msec) PN Code <0.1 msec Power Control <0.1 msec einterleaver 0.2 msec A M P P N C P N P P N C P P N C P N P C P P N C P N P C P P N C P P N C P N P C P P N C P N P C P P N C P P N C P N P C P 1 F M O F M O F M O F M O F M O F M O F M O F M O F M O F M O F M O F M O F M O F M O P 2 Searcher 5 msec F(x) 0.3 msec emodulation 0.1 msec P 3 Turbo ecoder 10 msec P 4 N C Modulation F Turbo ncoder 0.2 msec nterleaver 0.2 msec Modulation 1 msec F(Tx) 8 msec nput from A/ AM: PN/Power Ctrl. P1 F/Mod. P2 Searcher P3 Turbo P4 T P N SYNC P1, AM F SYNC P1, P2, AM P N M F P N 1 slot 0.67 msec SYNC P1, AM M F P N SYNC P1, AM M Searcher F P N SYNC P1, AM M SYNC P1, AM P C 41
CS434/534: Topics in Networked (Networking) Systems
CS434/534: Topics in Networked (Networking) Systems Network OS Abstraction: From Data to Function Store; Wireless Foundation: Frequency-Domain Analysis Yang (Richard) Yang Computer Science Department Yale
More informationWireless Information Transmission System Lab. Channel Estimation. Institute of Communications Engineering. National Sun Yat-sen University
Wireless Information Transmission System Lab. Channel Estimation Institute of Communications Engineering National Sun Yat-sen University Table of Contents Introduction to Channel Estimation Generic Pilot
More informationNCU EE -- DSP VLSI Design. Tsung-Han Tsai 1
NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1 Multi-processor vs. Multi-computer architecture µp vs. DSP RISC vs. DSP RISC Reduced-instruction-set Register-to-register operation Higher throughput by using
More informationL15: Custom and ASIC VLSI Integration
L15: Custom and ASIC VLSI Integration Average Cost of one transistor 10 1 0.1 0.01 0.001 0.0001 0.00001 $ 0.000001 Gordon Moore, Keynote Presentation at ISSCC 2003 0.0000001 '68 '70 '72 '74 '76 '78 '80
More informationEmbedded Systems Design: Optimization Challenges. Paul Pop Embedded Systems Lab (ESLAB) Linköping University, Sweden
of /4 4 Embedded Systems Design: Optimization Challenges Paul Pop Embedded Systems Lab (ESLAB) Linköping University, Sweden Outline! Embedded systems " Example area: automotive electronics " Embedded systems
More informationPerformance, Power & Energy. ELEC8106/ELEC6102 Spring 2010 Hayden Kwok-Hay So
Performance, Power & Energy ELEC8106/ELEC6102 Spring 2010 Hayden Kwok-Hay So Recall: Goal of this class Performance Reconfiguration Power/ Energy H. So, Sp10 Lecture 3 - ELEC8106/6102 2 PERFORMANCE EVALUATION
More informationPerformance, Power & Energy
Recall: Goal of this class Performance, Power & Energy ELE8106/ELE6102 Performance Reconfiguration Power/ Energy Spring 2010 Hayden Kwok-Hay So H. So, Sp10 Lecture 3 - ELE8106/6102 2 What is good performance?
More informationTopic 3. Design of Sequences with Low Correlation
Topic 3. Design of Sequences with Low Correlation M-sequences and Quadratic Residue Sequences 2 Multiple Trace Term Sequences and WG Sequences 3 Gold-pair, Kasami Sequences, and Interleaved Sequences 4
More informationChe-Wei Chang Department of Computer Science and Information Engineering, Chang Gung University
Che-Wei Chang chewei@mail.cgu.edu.tw Department of Computer Science and Information Engineering, Chang Gung University } 2017/11/15 Midterm } 2017/11/22 Final Project Announcement 2 1. Introduction 2.
More informationProfessor Fearing EECS150/Problem Set Solution Fall 2013 Due at 10 am, Thu. Oct. 3 (homework box under stairs)
Professor Fearing EECS50/Problem Set Solution Fall 203 Due at 0 am, Thu. Oct. 3 (homework box under stairs). (25 pts) List Processor Timing. The list processor as discussed in lecture is described in RT
More informationA High-Yield Area-Power Efficient DWT Hardware for Implantable Neural Interface Applications
Neural Engineering 27 A High-Yield Area-Power Efficient DWT Hardware for Implantable Neural Interface Applications Awais M. Kamboh, Andrew Mason, Karim Oweiss {Kambohaw, Mason, Koweiss} @msu.edu Department
More informationCSE370: Introduction to Digital Design
CSE370: Introduction to Digital Design Course staff Gaetano Borriello, Brian DeRenzi, Firat Kiyak Course web www.cs.washington.edu/370/ Make sure to subscribe to class mailing list (cse370@cs) Course text
More informationVLSI Signal Processing
VLSI Signal Processing Lecture 1 Pipelining & Retiming ADSP Lecture1 - Pipelining & Retiming (cwliu@twins.ee.nctu.edu.tw) 1-1 Introduction DSP System Real time requirement Data driven synchronized by data
More informationOptimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks
2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks Yufei Ma, Yu Cao, Sarma Vrudhula,
More informationDigital Signal Processing for Embedded Communications and Biomedical Systems
igital Signal Processing for Embedded Communications and Biomedical Systems Keshab K. Parhi istinguished McKnight University Professor University of Minnesota, Minneapolis http://www.ece.umn.edu/users/parhi
More informationGPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications
GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications Christopher Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-Mei W. Hwu University of Illinois at Urbana-Champaign
More informationSchool of EECS Seoul National University
4!4 07$ 8902808 3 School of EECS Seoul National University Introduction Low power design 3974/:.9 43 Increasing demand on performance and integrity of VLSI circuits Popularity of portable devices Low power
More informationDSP Design Lecture 7. Unfolding cont. & Folding. Dr. Fredrik Edman.
SP esign Lecture 7 Unfolding cont. & Folding r. Fredrik Edman fredrik.edman@eit.lth.se Unfolding Unfolding creates a program with more than one iteration, J=unfolding factor Unfolding is a structured way
More informationCIS 371 Computer Organization and Design
CIS 371 Computer Organization and Design Unit 13: Power & Energy Slides developed by Milo Mar0n & Amir Roth at the University of Pennsylvania with sources that included University of Wisconsin slides by
More informationPipelined Viterbi Decoder Using FPGA
Research Journal of Applied Sciences, Engineering and Technology 5(4): 1362-1372, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2013 Submitted: July 05, 2012 Accepted: August
More informationAN IMPROVED LOW LATENCY SYSTOLIC STRUCTURED GALOIS FIELD MULTIPLIER
Indian Journal of Electronics and Electrical Engineering (IJEEE) Vol.2.No.1 2014pp1-6 available at: www.goniv.com Paper Received :05-03-2014 Paper Published:28-03-2014 Paper Reviewed by: 1. John Arhter
More informationMapper & De-Mapper System Document
Mapper & De-Mapper System Document Mapper / De-Mapper Table of Contents. High Level System and Function Block. Mapper description 2. Demodulator Function block 2. Decoder block 2.. De-Mapper 2..2 Implementation
More informationComparative Analysis of Equalization Methods for SC-FDMA
Comparative Analysis of Equalization Methods for SC-MA Anton ogadaev, Alexander Kozlov SUA Russia Email: {dak, akozlov}@vu.spb.ru Ann Ukhanova U otonik enmark Email: annuk@fotonik.dtu.dk Abstract n this
More informationN E T W O R K S T A N D A R D S UNIT 5. Ethernet LANs & P R O T O C O L S DPW
UI 5 thernet s onna arren opics for this Unit thernet History ypical network components functionality versus technologies and specifications he advantages and disadvantages of thernet technology in s dvantages
More informationLecture 2. Fading Channel
1 Lecture 2. Fading Channel Characteristics of Fading Channels Modeling of Fading Channels Discrete-time Input/Output Model 2 Radio Propagation in Free Space Speed: c = 299,792,458 m/s Isotropic Received
More informationECEN 651: Microprogrammed Control of Digital Systems Department of Electrical and Computer Engineering Texas A&M University
ECEN 651: Microprogrammed Control of Digital Systems Department of Electrical and Computer Engineering Texas A&M University Prof. Mi Lu TA: Ehsan Rohani Laboratory Exercise #4 MIPS Assembly and Simulation
More information9. Datapath Design. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017
9. Datapath Design Jacob Abraham Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017 October 2, 2017 ECE Department, University of Texas at Austin
More informationTracking of Spread Spectrum Signals
Chapter 7 Tracking of Spread Spectrum Signals 7. Introduction As discussed in the last chapter, there are two parts to the synchronization process. The first stage is often termed acquisition and typically
More informationChannel Coding and Interleaving
Lecture 6 Channel Coding and Interleaving 1 LORA: Future by Lund www.futurebylund.se The network will be free for those who want to try their products, services and solutions in a precommercial stage.
More informationEEC 216 Lecture #2: Metrics and Logic Level Power Estimation. Rajeevan Amirtharajah University of California, Davis
EEC 216 Lecture #2: Metrics and Logic Level Power Estimation Rajeevan Amirtharajah University of California, Davis Announcements PS1 available online tonight R. Amirtharajah, EEC216 Winter 2008 2 Outline
More informationSP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay
SP-CNN: A Scalable and Programmable CNN-based Accelerator Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay Motivation Power is a first-order design constraint, especially for embedded devices. Certain
More informationFPGA Implementation of Pseudo Noise Sequences based on Quadratic Residue Theory
FPGA Implementation of Pseudo Noise Sequences based on Quadratic Residue Theory A. Rajagopal Dept. of E&C, K.L. Sudha Dept.of E&C, Dundi Ajay Dept. of E&C, ABSTRACT Pseudo Noise (PN) sequences are defined
More informationMultiple-Input Multiple-Output Systems
Multiple-Input Multiple-Output Systems What is the best way to use antenna arrays? MIMO! This is a totally new approach ( paradigm ) to wireless communications, which has been discovered in 95-96. Performance
More informationGiuseppe Bianchi, Ilenia Tinnirello
Capacity of WLAN Networs Summary Per-node throughput in case of: Full connected networs each node sees all the others Generic networ topology not all nodes are visible Performance Analysis of single-hop
More informationIssues on Timing and Clocking
ECE152B TC 1 Issues on Timing and Clocking X Combinational Logic Z... clock clock clock period ECE152B TC 2 Latch and Flip-Flop L CK CK 1 L1 1 L2 2 CK CK CK ECE152B TC 3 Clocking X Combinational Logic...
More informationCS 700: Quantitative Methods & Experimental Design in Computer Science
CS 700: Quantitative Methods & Experimental Design in Computer Science Sanjeev Setia Dept of Computer Science George Mason University Logistics Grade: 35% project, 25% Homework assignments 20% midterm,
More informationTransformation Techniques for Real Time High Speed Implementation of Nonlinear Algorithms
International Journal of Electronics and Communication Engineering. ISSN 0974-66 Volume 4, Number (0), pp.83-94 International Research Publication House http://www.irphouse.com Transformation Techniques
More informationIntroduction to Wireless & Mobile Systems. Chapter 4. Channel Coding and Error Control Cengage Learning Engineering. All Rights Reserved.
Introduction to Wireless & Mobile Systems Chapter 4 Channel Coding and Error Control 1 Outline Introduction Block Codes Cyclic Codes CRC (Cyclic Redundancy Check) Convolutional Codes Interleaving Information
More informationEPOC Upstream Mapping Part 2
EPOC Upstream Mapping Part 2 Ed Boyd, Broadcom Burst Anatomy UPSTREAM BURST MARKERS EPON Review The EPON OLT locks to the upstream without information from the MAC. The is able to determine 1G or 10G EPON
More informationNumbering Systems. Computational Platforms. Scaling and Round-off Noise. Special Purpose. here that is dedicated architecture
Computational Platforms Numbering Systems Basic Building Blocks Scaling and Round-off Noise Computational Platforms Viktor Öwall viktor.owall@eit.lth.seowall@eit lth Standard Processors or Special Purpose
More informationPrinciples of Communications Lecture 8: Baseband Communication Systems. Chih-Wei Liu 劉志尉 National Chiao Tung University
Principles of Communications Lecture 8: Baseband Communication Systems Chih-Wei Liu 劉志尉 National Chiao Tung University cwliu@twins.ee.nctu.edu.tw Outlines Introduction Line codes Effects of filtering Pulse
More informationReal-Time Scheduling and Resource Management
ARTIST2 Summer School 2008 in Europe Autrans (near Grenoble), France September 8-12, 2008 Real-Time Scheduling and Resource Management Lecturer: Giorgio Buttazzo Full Professor Scuola Superiore Sant Anna
More informationComparative Performance of Three DSSS/Rake Modems Over Mobile UWB Dense Multipath Channels
MTR 6B5 MITRE TECHNICAL REPORT Comparative Performance of Three DSSS/Rake Modems Over Mobile UWB Dense Multipath Channels June 5 Phillip A. Bello Sponsor: Contract No.: FA871-5-C-1 Dept. No.: E53 Project
More informationLecture 4 Capacity of Wireless Channels
Lecture 4 Capacity of Wireless Channels I-Hsiang Wang ihwang@ntu.edu.tw 3/0, 014 What we have learned So far: looked at specific schemes and techniques Lecture : point-to-point wireless channel - Diversity:
More informationDirect-Sequence Spread-Spectrum
Chapter 3 Direct-Sequence Spread-Spectrum In this chapter we consider direct-sequence spread-spectrum systems. Unlike frequency-hopping, a direct-sequence signal occupies the entire bandwidth continuously.
More informationEEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture. Rajeevan Amirtharajah University of California, Davis
EEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture Rajeevan Amirtharajah University of California, Davis Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top
More informationTiming Issues. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolić. January 2003
Digital Integrated Circuits A Design Perspective Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolić Timing Issues January 2003 1 Synchronous Timing CLK In R Combinational 1 R Logic 2 C in C out Out 2
More informationArithmetic Building Blocks
rithmetic uilding locks Datapath elements dder design Static adder Dynamic adder Multiplier design rray multipliers Shifters, Parity circuits ECE 261 Krish Chakrabarty 1 Generic Digital Processor Input-Output
More informationGMU, ECE 680 Physical VLSI Design 1
ECE680: Physical VLSI Design Chapter VII Timing Issues in Digital Circuits (chapter 10 in textbook) GMU, ECE 680 Physical VLSI Design 1 Synchronous Timing (Fig. 10 1) CLK In R Combinational 1 R Logic 2
More informationEstimation of the Capacity of Multipath Infrared Channels
Estimation of the Capacity of Multipath Infrared Channels Jeffrey B. Carruthers Department of Electrical and Computer Engineering Boston University jbc@bu.edu Sachin Padma Department of Electrical and
More informationDigital Systems Roberto Muscedere Images 2013 Pearson Education Inc. 1
Digital Systems Digital systems have such a prominent role in everyday life The digital age The technology around us is ubiquitous, that is we don t even notice it anymore Digital systems are used in:
More informationL16: Power Dissipation in Digital Systems. L16: Spring 2007 Introductory Digital Systems Laboratory
L16: Power Dissipation in Digital Systems 1 Problem #1: Power Dissipation/Heat Power (Watts) 100000 10000 1000 100 10 1 0.1 4004 80088080 8085 808686 386 486 Pentium proc 18KW 5KW 1.5KW 500W 1971 1974
More informationDigital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.
Digital Integrated Circuits A Design Perspective Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic Arithmetic Circuits January, 2003 1 A Generic Digital Processor MEM ORY INPUT-OUTPUT CONTROL DATAPATH
More informationTHE UNIVERSITY OF MICHIGAN. Faster Static Timing Analysis via Bus Compression
Faster Static Timing Analysis via Bus Compression by David Van Campenhout and Trevor Mudge CSE-TR-285-96 THE UNIVERSITY OF MICHIGAN Computer Science and Engineering Division Department of Electrical Engineering
More informationIEEE abc-01/22r1
Project IEEE 802.16 Broadband Wireless Access Working Group Title An ARQ proposal for consideration by TG3/4 MAC group Date Submitted 2001-08 -31 Source(s) Yigal Leiba Itzik Kitroser
More informationDelay and Energy Consumption Analysis of Conventional SRAM
World Academy of Science, Engineering and Technology 13 8 Delay and Energy Consumption Analysis of Conventional SAM Arash Azizi-Mazreah, Mohammad T. Manzuri Shalmani, Hamid Barati, and Ali Barati Abstract
More informationMark Redekopp, All rights reserved. Lecture 1 Slides. Intro Number Systems Logic Functions
Lecture Slides Intro Number Systems Logic Functions EE 0 in Context EE 0 EE 20L Logic Design Fundamentals Logic Design, CAD Tools, Lab tools, Project EE 357 EE 457 Computer Architecture Using the logic
More informationAdministrivia. Course Objectives. Overview. Lecture Notes Week markem/cs333/ 2. Staff. 3. Prerequisites. 4. Grading. 1. Theory and application
Administrivia 1. markem/cs333/ 2. Staff 3. Prerequisites 4. Grading Course Objectives 1. Theory and application 2. Benefits 3. Labs TAs Overview 1. What is a computer system? CPU PC ALU System bus Memory
More informationDSP Design Lecture 5. Dr. Fredrik Edman.
SP esign SP esign Lecture 5 Retiming r. Fredrik Edman fredrik.edman@eit.lth.se Fredrik Edman, ept. of Electrical and Information Technology, Lund University, Sweden-www.eit.lth.se SP esign Repetition Critical
More informationPerformance Metrics for Computer Systems. CASS 2018 Lavanya Ramapantulu
Performance Metrics for Computer Systems CASS 2018 Lavanya Ramapantulu Eight Great Ideas in Computer Architecture Design for Moore s Law Use abstraction to simplify design Make the common case fast Performance
More informationMethods and tools to optimize the trade-off performance versus complexity of error control codes architectures.
Methods and tools to optimize the trade-off performance versus complexity of error control codes architectures. Emmanuel Boutillon CNRS, UMR 6285, Lab-STICC Centre de Recherche - BP 92116 F-56321 Lorient
More informationThe Linear-Feedback Shift Register
EECS 141 S02 Timing Project 2: A Random Number Generator R R R S 0 S 1 S 2 1 0 0 0 1 0 1 0 1 1 1 0 1 1 1 0 1 1 0 0 1 1 0 0 The Linear-Feedback Shift Register 1 Project Goal Design a 4-bit LFSR SPEED, SPEED,
More informationJoint FEC Encoder and Linear Precoder Design for MIMO Systems with Antenna Correlation
Joint FEC Encoder and Linear Precoder Design for MIMO Systems with Antenna Correlation Chongbin Xu, Peng Wang, Zhonghao Zhang, and Li Ping City University of Hong Kong 1 Outline Background Mutual Information
More informationRetiming. delay elements in a circuit without affecting the input/output characteristics of the circuit.
Chapter Retiming NCU EE -- SP VLSI esign. Chap. Tsung-Han Tsai 1 Retiming & A transformation techniques used to change the locations of delay elements in a circuit without affecting the input/output characteristics
More information! Inherent parallelism in hardware " Parallel computation beyond 61C. ! Counterpoint to software design " Furthering our understanding of computation
Welcome to EECS 150: Components and Design Techniques for Digital Systems! Course staff Randy Katz (Instructor), Jeff Kalvass (dmin Head T), llen Lee/Neil Warren (Project Co-Head Ts) Teaching ssistants:
More informationENERGY EFFICIENT TASK SCHEDULING OF SEND- RECEIVE TASK GRAPHS ON DISTRIBUTED MULTI- CORE PROCESSORS WITH SOFTWARE CONTROLLED DYNAMIC VOLTAGE SCALING
ENERGY EFFICIENT TASK SCHEDULING OF SEND- RECEIVE TASK GRAPHS ON DISTRIBUTED MULTI- CORE PROCESSORS WITH SOFTWARE CONTROLLED DYNAMIC VOLTAGE SCALING Abhishek Mishra and Anil Kumar Tripathi Department of
More informationEE216B: VLSI Signal Processing. FFT Processors. Prof. Dejan Marković FFT: Background
4/30/0 EE6B: VLSI Signal Processing FFT Processors Prof. Dejan Marković ee6b@gmail.com FFT: Background A bit of history 805 - algorithm first described by Gauss 965 - algorithm rediscovered (not for the
More informationSerial Parallel Multiplier Design in Quantum-dot Cellular Automata
Serial Parallel Multiplier Design in Quantum-dot Cellular Automata Heumpil Cho and Earl E. Swartzlander, Jr. Application Specific Processor Group Department of Electrical and Computer Engineering The University
More informationApproximate Queueing Model for Multi-rate Multi-user MIMO systems.
An Approximate Queueing Model for Multi-rate Multi-user MIMO systems Boris Bellalta,Vanesa Daza, Miquel Oliver Abstract A queueing model for Multi-rate Multi-user MIMO systems is presented. The model is
More informationAdministrative Stuff
EE141- Spring 2004 Digital Integrated Circuits Lecture 30 PERSPECTIVES 1 Administrative Stuff Homework 10 posted just for practice. No need to turn in (hw 9 due today). Normal office hours next week. HKN
More informationCoding theory: Applications
INF 244 a) Textbook: Lin and Costello b) Lectures (Tu+Th 12.15-14) covering roughly Chapters 1,9-12, and 14-18 c) Weekly exercises: For your convenience d) Mandatory problem: Programming project (counts
More informationToday. ESE532: System-on-a-Chip Architecture. Energy. Message. Preclass Challenge: Power. Energy Today s bottleneck What drives Efficiency of
ESE532: System-on-a-Chip Architecture Day 20: November 8, 2017 Energy Today Energy Today s bottleneck What drives Efficiency of Processors, FPGAs, accelerators How does parallelism impact energy? 1 2 Message
More informationEECS 579: Logic and Fault Simulation. Simulation
EECS 579: Logic and Fault Simulation Simulation: Use of computer software models to verify correctness Fault Simulation: Use of simulation for fault analysis and ATPG Circuit description Input data for
More informationEnergy-Efficient Real-Time Task Scheduling in Multiprocessor DVS Systems
Energy-Efficient Real-Time Task Scheduling in Multiprocessor DVS Systems Jian-Jia Chen *, Chuan Yue Yang, Tei-Wei Kuo, and Chi-Sheng Shih Embedded Systems and Wireless Networking Lab. Department of Computer
More informationFPGA-based Niederreiter Cryptosystem using Binary Goppa Codes
FPGA-based Niederreiter Cryptosystem using Binary Goppa Codes Wen Wang 1, Jakub Szefer 1, and Ruben Niederhagen 2 1. Yale University, USA 2. Fraunhofer Institute SIT, Germany April 9, 2018 PQCrypto 2018
More informationChapter 8. Low-Power VLSI Design Methodology
VLSI Design hapter 8 Low-Power VLSI Design Methodology Jin-Fu Li hapter 8 Low-Power VLSI Design Methodology Introduction Low-Power Gate-Level Design Low-Power Architecture-Level Design Algorithmic-Level
More informationAn Ultrafast Optical Digital Technology
An Ultrafast Optical Digital Technology Smart Light Stanford University EE380 Corporation 2.28.07 Overview Overview Background What and How methodology optical state machines Why smart speed power size
More informationEE241 - Spring 2006 Advanced Digital Integrated Circuits
EE241 - Spring 2006 Advanced Digital Integrated Circuits Lecture 20: Asynchronous & Synchronization Self-timed and Asynchronous Design Functions of clock in synchronous design 1) Acts as completion signal
More informationAnnotation Integration and Trade-off Analysis for Multimedia Applications
Annotation Integration and Trade-off Analysis for Multimedia Applications Radu Cornea, Alex Nicolau, Nikil Dutt School of Information & Computer Science University of California, Irvine Introduction and
More informationBit Error Rate Estimation for a Joint Detection Receiver in the Downlink of UMTS/TDD
in Proc. IST obile & Wireless Comm. Summit 003, Aveiro (Portugal), June. 003, pp. 56 60. Bit Error Rate Estimation for a Joint Detection Receiver in the Downlink of UTS/TDD K. Kopsa, G. atz, H. Artés,
More informationPerformance Analysis of Spread Spectrum CDMA systems
1 Performance Analysis of Spread Spectrum CDMA systems 16:33:546 Wireless Communication Technologies Spring 5 Instructor: Dr. Narayan Mandayam Summary by Liang Xiao lxiao@winlab.rutgers.edu WINLAB, Department
More informationTU Wien. Energy Efficiency. H. Kopetz 26/11/2009 Model of a Real-Time System
TU Wien 1 Energy Efficiency Outline 2 Basic Concepts Energy Estimation Hardware Scaling Power Gating Software Techniques Energy Sources 3 Why are Energy and Power Awareness Important? The widespread use
More informationBlind Source Separation with a Time-Varying Mixing Matrix
Blind Source Separation with a Time-Varying Mixing Matrix Marcus R DeYoung and Brian L Evans Department of Electrical and Computer Engineering The University of Texas at Austin 1 University Station, Austin,
More informationWireless Communication Technologies 16:332:559 (Advanced Topics in Communications) Lecture #17 and #18 (April 1, April 3, 2002)
Wireless Communication echnologies Lecture 7 & 8 Wireless Communication echnologies 6:33:559 (Advanced opics in Communications) Lecture #7 and #8 (April, April 3, ) Instructor rof. arayan Mandayam Summarized
More informationCSCI-564 Advanced Computer Architecture
CSCI-564 Advanced Computer Architecture Lecture 8: Handling Exceptions and Interrupts / Superscalar Bo Wu Colorado School of Mines Branch Delay Slots (expose control hazard to software) Change the ISA
More informationCircuit Modeling for Practical Many-core Architecture Design Exploration
Circuit Modeling for Practical Many-core Architecture Design Exploration Redefining design abstractions Dean Truong Bevan Baas VLSI Computation Lab University of California, Davis Outline Motivation Circuit
More informationGiuseppe Bianchi, Ilenia Tinnirello
Capacity of WLAN Networs Summary Ł Ł Ł Ł Arbitrary networ capacity [Gupta & Kumar The Capacity of Wireless Networs ] Ł! Ł "! Receiver Model Ł Ł # Ł $%&% Ł $% '( * &%* r (1+ r Ł + 1 / n 1 / n log n Area
More informationECS455: Chapter 5 OFDM. ECS455: Chapter 5 OFDM. OFDM: Overview. OFDM Applications. Dr.Prapun Suksompong prapun.com/ecs455
ECS455: Chapter 5 OFDM OFDM: Overview Let S = (S 1, S 2,, S ) contains the information symbols. S IFFT FFT Inverse fast Fourier transform Fast Fourier transform 1 Dr.Prapun Suksompong prapun.com/ecs455
More informationCell Switch Off Technique Combined with Coordinated Multi-Point (CoMP) Transmission for Energy Efficiency in Beyond-LTE Cellular Networks
Cell Switch Off Technique Combined with Coordinated Multi-Point (CoMP) Transmission for Energy Efficiency in Beyond-LTE Cellular Networks Gencer Cili, Halim Yanikomeroglu, and F. Richard Yu Department
More informationBurst Markers in EPoC Syed Rahman, Huawei Nicola Varanese, Qualcomm
Burst Markers in EPoC Syed Rahman, Huawei Nicola Varanese, Qualcomm Page 1 Introduction Burst markers are used to indicate the start and end of each burst in EPoC burst mode The same marker delimits the
More informationCSE140L: Components and Design Techniques for Digital Systems Lab. FSMs. Instructor: Mohsen Imani. Slides from Tajana Simunic Rosing
CSE140L: Components and Design Techniques for Digital Systems Lab FSMs Instructor: Mohsen Imani Slides from Tajana Simunic Rosing Source: Vahid, Katz 1 FSM design example Moore vs. Mealy Remove one 1 from
More informationLECTURE 16 AND 17. Digital signaling on frequency selective fading channels. Notes Prepared by: Abhishek Sood
ECE559:WIRELESS COMMUNICATION TECHNOLOGIES LECTURE 16 AND 17 Digital signaling on frequency selective fading channels 1 OUTLINE Notes Prepared by: Abhishek Sood In section 2 we discuss the receiver design
More informationWelcome to EECS 150: Components and Design Techniques for Digital Systems
Welcome to EECS 150: Components and Design Techniques for Digital Systems Course staff Randy Katz (Instructor), Po-Kai Chen (Head TA) Teaching Assistants: Bryan Brady, Jay Chen, Brian Gawalt, Jack Tzeng
More informationItanium TM Processor Clock Design
Itanium TM Processor Design Utpal Desai 1, Simon Tam, Robert Kim, Ji Zhang, Stefan Rusu Intel Corporation, M/S SC12-502, 2200 Mission College Blvd, Santa Clara, CA 95052 ABSTRACT The Itanium processor
More informationENEE350 Lecture Notes-Weeks 14 and 15
Pipelining & Amdahl s Law ENEE350 Lecture Notes-Weeks 14 and 15 Pipelining is a method of processing in which a problem is divided into a number of sub problems and solved and the solu8ons of the sub problems
More informationNeuromorphic architectures: challenges and opportunites in the years to come
Neuromorphic architectures: challenges and opportunites in the years to come Andreas G. Andreou andreou@jhu.edu Electrical and Computer Engineering Center for Language and Speech Processing Johns Hopkins
More informationINF Models of concurrency
INF4140 - Models of concurrency RPC and Rendezvous INF4140 Lecture 15. Nov. 2017 RPC and Rendezvous Outline More on asynchronous message passing interacting processes with different patterns of communication
More informationScheduling of Frame-based Embedded Systems with Rechargeable Batteries
Scheduling of Frame-based Embedded Systems with Rechargeable Batteries André Allavena Computer Science Department Cornell University Ithaca, NY 14853 andre@cs.cornell.edu Daniel Mossé Department of Computer
More informationAn Implementation of an Address Generator Using Hash Memories
An Implementation of an Address Generator Using Memories Tsutomu Sasao and Munehiro Matsuura Department of Computer Science and Electronics, Kyushu Institute of Technology, Iizuka 820-8502, Japan Abstract
More informationConstruction of a reconfigurable dynamic logic cell
PRAMANA c Indian Academy of Sciences Vol. 64, No. 3 journal of March 2005 physics pp. 433 441 Construction of a reconfigurable dynamic logic cell K MURALI 1, SUDESHNA SINHA 2 and WILLIAM L DITTO 3 1 Department
More information