Hardware Acceleration of the Pair HMM Algorithm for DNA Variant Calling

Size: px
Start display at page:

Download "Hardware Acceleration of the Pair HMM Algorithm for DNA Variant Calling"

Transcription

1 Hardware Acceleration o the Pair H Algorithm or NA Variant Calling Sitao Huang, Gowthami Jayashri anikandan, Anand Ramachandran, Kyle Rupnow, Wen-mei W. Hwu, eming Chen University o llinois at Urbana-Champaign, USA Advanced igital Sciences Center, Singapore

2 Genomic Variation and utations Humans have two sets o billion bases in their genomes No two humans have identical genome sequences About 0. % o genomes are not identical These dierences lead to people Having dierent susceptibility or resistance to diseases Responding dierently to the same medication There are also somatic variations that lead to cancer

3 The mportance o utations and Variant Calling The study o mutations is important (e.g. in cancer study) They create cancer They enable cancer to survive They enable cancer to spread They enable cancer to kill Variant calling is a set o analytics that tries to identiy mutations in a sequenced genome compared to a standard reerence Variant Calling is critical in cancer research and clinical applications GATK's HaplotypeCaller is one o the most popular variant calling tools available today.

4 Accelerating the Pair H in GATK Why Pair H Needs to Be Accelerated? Pair H computations constitute the bottleneck o HaplotypeCaller The ull HaplotypeCaller is time consuming Full HaplotypeCaller run on 80xWGS PCR-Free NA878 dataset: days on single CPU Traversal + Assembly Genotyping 598s 79s (.86%) (6.7%) Pair-H (orward algorithm) 45s (70.4%) Why Using Hardware (FPGA)? Proiling result o a typical HaplotypeCaller run on CPU Parallelism in pair H could be better utilized by the ine-grained processing elements in FPGA FPGA is good at processing streaming applications (alignment algorithms nature)

5 Pair H (candidate to be veriied) (data rom sequencing machine) nput: two sequences SS h and SS rr (SS h : haplotype SS rr : read) Goal: ind a similarity score o SS h and SS rr SS rr GTAA SS h AGGTC One possible alignment: SS rr SS h {aa tt } --GTAA AGGTC- Another possible alignment: SS rr -G-TAA SS h AGGT-C {aa tt } There are many action sequences mapping SS rr to SS hh. Similarity score is deined over a pair Hidden arkov odel 4

6 Pair H ynamic Programming Similarly: ( i, j) = a ( i, j) = a Coeicients rom model ( i, j) = prior ( i, j ) + a ( i, j) + a ( i, j ) ( i, j) ( a ( i, j ) + a ( i, j ) + a ( i, j ) ) 0 Read Sequence C G G A Output: Complexity: ssssssssss SS h, SS rr = NN h, NN rr + NN h, NN rr + NN h, NN rr O ( h r Nh Nr ) # haplotype sequences # read sequences 5

7 How to Accelerate? : Processing Elements Process rontier elements at the same time to maximize parallelism Number o s Needed = atrix Height What i matrix height is larger than number o s FPGA can host? 6

8 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 7

9 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 0 Read Sequence C G G G A A 4 Busy N dle * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

10 Ring* Connects the irst and the last with ivide matrix rows to groups Haplotype Sequence A G G T A C 0 Read Sequence C G G G A A 4 Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

11 Ring* Connects the irst and the last with ivide matrix rows to groups Haplotype Sequence A G G T A C 0 Read Sequence C G G G A A 4 Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

12 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 4 0 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

13 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 4 0 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

14 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 4 0 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

15 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups 4 Read Sequence C G G G A A Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

16 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: 4 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

17 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: 4 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

18 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N 4 * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

19 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N 4 * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

20 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N 4 * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

21 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: N 4 Read Sequence C G G G A A * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8

22 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 4 Read Sequence C G G G A A 8

23 Challenges in esigning or Pair H v ( i- ) v ( i- ) structure is designed according to the data dependencies in the algorithm Each passes its intermediate computing result to the next v( i ) v ( i ) 9

24 Challenges in esigning or Pair H Cont. Floating point operations Long latency Need sophisticated FS Complicated arithmetic operations in P Elements in three P matrices depend on each other ( i, j) = a ( i, j) = a ( i, j) = prior ( i, j ) + a ( i, j) + a ( i, j ) ( i, j) i, j + i, j adm + amm i, j prior i, j ami i, j + aii i, j (i-, j) i, j (i, j) amd i, j add Arithmetic Operations Within a (Original) ( a ( i, j ) + a ( i, j ) + a ( i, j ) ) (i-, j-) (i, j-) + i, j i, j 0

25 Optimization : Shorten critical path in arithmetic operations (i-, j) (i-, j-) Using dierent input operands i, j + i, j adm a t i, j amm b t i, j a t i, j + b t i, j i, j prior i, j i, j + i, j adm + amm i, j i, j prior ami i, j + aii i, j i, j (i, j) amd (i, j-) i, j add + i, j i, j Optimized Calculation Arithmetic Operations Within a (Original)

26 Optimization : Pipelining and resource sharing Hide the loating-point arithmetic operations latency mprove throughput

27 Optimization : Tuning ring size and number o rings Same amount o HW resource can accommodate more shorter rings (calculating multiple matrices) Shorter rings have ewer idle s Only /N dle s!

28 Experiment Result : Comparison to Other mplementations Compared to CPU, vector processor, GPU, multicore, previous FPGA implementations Using 0s dataset Arria 0 has more logic and SP resources. t also has hard loating-point SP block * * Theoretical runtime lower bound (assuming no idle ) or 64 s: 4.7ms * Altera. Accelerating genomics research with OpenCL and FPGAs, 06. 4

29 Experiment Result : mpact o Ring Size Shorter rings beneit rom higher utilization and smaller initialization overhead 8s/ring x 8 rings 6s/ring x 4 rings s/ring x rings 64s/ring x ring 5

30 Summary Pair H orward algorithm is computationintensive. t is the bottleneck o HaplotypeCaller. Ring-based hardware structure exhibits lexibility in coniguration and high data reuse. ring structure based pair H implementation can achieve signiicant speedup compared to the sotware implementation, and it also outperorms the published best hardware implementation. 6

31 HaplotypeCaller BACKUP SLES

32 Emission and Transition Probabilities BACKUP SLES Q base : Base Error Rate Q i : Base nsertion Probability Q d : Base eletion Probability Q g : Gap Continuation Penalty

33 What s in? (i-, j) (i-, j-) BACKUP SLES (i, j) (i, j-) i, j + i, j adm amm a t i, j + b t i, j i, j prior ami i, j + aii i, j amd i, j add + i, j a t i, j b t i, j i, j i, j i, j

34 Why sequence alignment? BACKUP SLES Comparing genes or regions rom dierent species to ind important regions determine unction uncover evolutionary orces Assembling ragments to sequence NA Compare individuals to looking or mutations

35 Problem Statement BACKUP SLES nput: two sequences SS h and SS rr (SS h : haplotype SS rr : read) Goal: ind a similarity score o SS h and SS rr (candidate to be veriied) (data rom sequencing machine) SS rr SS h GTAA AGGTC Similarity score is deined over a pair Hidden arkov odel

36 Pair H Action Sequence BACKUP SLES Action(elete, nsert, atch/ismatch) sequence {aa tt } s.t. SS rr {aa tt } SSh One possible alignment: SS rr SS h {aa tt } --GTAA AGGTC- Another possible alignment: SS rr -G-TAA SS h AGGT-C {aa tt } There are many action sequences mapping SS rr to SS hh.

37 Pair H - Probability BACKUP SLES Each action sequence is associated with a probability: PP aa tt = pp(aa tt aa tt ) tt atch/ ismatch

38 BACKUP SLES Similarity score ynamic Programming ssssssssss SS h, SS rr = PP aa tt = NN h, NN rr {aa tt } aa tt : SS h SSrr + NN h, NN rr + NN h, NN rr Last action: delete Last action: match / mismatch Last action: insert SS rr [0: jj ] SS h [0: ii] aa tt SS rr G?T?AA SS h AGGTA- {aa tt }????? ( i, j) = a + a + a ( i, j ) ( i, j ) ( i, j ) probability dependency (arkov)

39 Recursion Similarly: ( i, j) = a ( i, j) = a ( i, j) = prior ( i, j ) + a ( i, j) + a ( i, j ) ( i, j) ( a ( i, j ) + a ( i, j ) + a ( i, j ) ) BACKUP SLES 0 Read Sequence C G G A Output: Complexity: ssssssssss SS h, SS rr = NN h, NN rr + NN h, NN rr + NN h, NN rr O ( h r Nh Nr ) # haplotype sequences # read sequences

GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications

GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications Christopher Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-Mei W. Hwu University of Illinois at Urbana-Champaign

More information

Parallel Numerics. Scope: Revise standard numerical methods considering parallel computations!

Parallel Numerics. Scope: Revise standard numerical methods considering parallel computations! Parallel Numerics Scope: Revise standard numerical methods considering parallel computations! Required knowledge: Numerics Parallel Programming Graphs Literature: Dongarra, Du, Sorensen, van der Vorst:

More information

Binary Multipliers. Reading: Study Chapter 3. The key trick of multiplication is memorizing a digit-to-digit table Everything else was just adding

Binary Multipliers. Reading: Study Chapter 3. The key trick of multiplication is memorizing a digit-to-digit table Everything else was just adding Binary Multipliers The key trick of multiplication is memorizing a digit-to-digit table Everything else was just adding 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 4 6 8 2 4 6 8 3 3 6 9 2 5 8 2 24 27 4 4 8 2 6

More information

AH 2700A. Attenuator Pair Ratio for C vs Frequency. Option-E 50 Hz-20 khz Ultra-precision Capacitance/Loss Bridge

AH 2700A. Attenuator Pair Ratio for C vs Frequency. Option-E 50 Hz-20 khz Ultra-precision Capacitance/Loss Bridge 0 E ttenuator Pair Ratio or vs requency NEEN-ERLN 700 Option-E 0-0 k Ultra-precision apacitance/loss ridge ttenuator Ratio Pair Uncertainty o in ppm or ll Usable Pairs o Taps 0 0 0. 0. 0. 07/08/0 E E E

More information

Numerical Methods - Lecture 2. Numerical Methods. Lecture 2. Analysis of errors in numerical methods

Numerical Methods - Lecture 2. Numerical Methods. Lecture 2. Analysis of errors in numerical methods Numerical Methods - Lecture 1 Numerical Methods Lecture. Analysis o errors in numerical methods Numerical Methods - Lecture Why represent numbers in loating point ormat? Eample 1. How a number 56.78 can

More information

Word-length Optimization and Error Analysis of a Multivariate Gaussian Random Number Generator

Word-length Optimization and Error Analysis of a Multivariate Gaussian Random Number Generator Word-length Optimization and Error Analysis of a Multivariate Gaussian Random Number Generator Chalermpol Saiprasert, Christos-Savvas Bouganis and George A. Constantinides Department of Electrical & Electronic

More information

CHAPTER 1: INTRODUCTION. 1.1 Inverse Theory: What It Is and What It Does

CHAPTER 1: INTRODUCTION. 1.1 Inverse Theory: What It Is and What It Does Geosciences 567: CHAPTER (RR/GZ) CHAPTER : INTRODUCTION Inverse Theory: What It Is and What It Does Inverse theory, at least as I choose to deine it, is the ine art o estimating model parameters rom data

More information

Welcome to MCS 572. content and organization expectations of the course. definition and classification

Welcome to MCS 572. content and organization expectations of the course. definition and classification Welcome to MCS 572 1 About the Course content and organization expectations of the course 2 Supercomputing definition and classification 3 Measuring Performance speedup and efficiency Amdahl s Law Gustafson

More information

Introduction. Outline. CSC/ECE 574 Computer and Network Security. Secret Keys or Secret Algorithms? Secrets? (Cont d) Secret Key Cryptography

Introduction. Outline. CSC/ECE 574 Computer and Network Security. Secret Keys or Secret Algorithms? Secrets? (Cont d) Secret Key Cryptography Outline CSC/ECE 574 Computer and Network Security Introductory Remarks Feistel Cipher DES AES Topic 3.1 Secret Key Cryptography Algorithms CSC/ECE 574 Dr. Peng Ning 1 CSC/ECE 574 Dr. Peng Ning 2 Secret

More information

Practical Free-Start Collision Attacks on 76-step SHA-1

Practical Free-Start Collision Attacks on 76-step SHA-1 Practical Free-Start Collision Attacks on 76-step SHA-1 Inria and École polytechnique, France Nanyang Technological University, Singapore Joint work with Thomas Peyrin and Marc Stevens CWI, Amsterdam 2015

More information

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Jorge González-Domínguez*, Bertil Schmidt*, Jan C. Kässens**, Lars Wienbrandt** *Parallel and Distributed Architectures

More information

Accelerating linear algebra computations with hybrid GPU-multicore systems.

Accelerating linear algebra computations with hybrid GPU-multicore systems. Accelerating linear algebra computations with hybrid GPU-multicore systems. Marc Baboulin INRIA/Université Paris-Sud joint work with Jack Dongarra (University of Tennessee and Oak Ridge National Laboratory)

More information

Lecture 3, Performance

Lecture 3, Performance Lecture 3, Performance Repeating some definitions: CPI Clocks Per Instruction MHz megahertz, millions of cycles per second MIPS Millions of Instructions Per Second = MHz / CPI MOPS Millions of Operations

More information

Scoring functions. Talk Overview. Eran Eyal. Scoring functions what and why

Scoring functions. Talk Overview. Eran Eyal. Scoring functions what and why Scoring unctions Talk Overview Scoring unctions what and why Force ields based on approximation o molecular orces as we understand them Knowledge-based potentials let the data speak May 2011 Eran Eyal

More information

AP Calculus Notes: Unit 1 Limits & Continuity. Syllabus Objective: 1.1 The student will calculate limits using the basic limit theorems.

AP Calculus Notes: Unit 1 Limits & Continuity. Syllabus Objective: 1.1 The student will calculate limits using the basic limit theorems. Syllabus Objective:. The student will calculate its using the basic it theorems. LIMITS how the outputs o a unction behave as the inputs approach some value Finding a Limit Notation: The it as approaches

More information

Predictive Genome Analysis Using Partial DNA Sequencing Data

Predictive Genome Analysis Using Partial DNA Sequencing Data Predictive Genome Analysis Using Partial DNA Sequencing Data Nauman Ahmed, Koen Bertels and Zaid Al-Ars Computer Engineering Lab, Delft University of Technology, Delft, The Netherlands {n.ahmed, k.l.m.bertels,

More information

Compile-Time Dynamic Voltage Scaling Settings: Opportunities and Limits

Compile-Time Dynamic Voltage Scaling Settings: Opportunities and Limits Compile-Time Dynamic Voltage Scaling Settings: Opportunities and Limits Fen Xie Department o Electrical Engineering Princeton University Princeton, NJ xie@ee.princeton.edu Margaret Martonosi Department

More information

INF2270 Spring Philipp Häfliger. Lecture 8: Superscalar CPUs, Course Summary/Repetition (1/2)

INF2270 Spring Philipp Häfliger. Lecture 8: Superscalar CPUs, Course Summary/Repetition (1/2) INF2270 Spring 2010 Philipp Häfliger Summary/Repetition (1/2) content From Scalar to Superscalar Lecture Summary and Brief Repetition Binary numbers Boolean Algebra Combinational Logic Circuits Encoder/Decoder

More information

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot

More information

The achievable limits of operational modal analysis. * Siu-Kui Au 1)

The achievable limits of operational modal analysis. * Siu-Kui Au 1) The achievable limits o operational modal analysis * Siu-Kui Au 1) 1) Center or Engineering Dynamics and Institute or Risk and Uncertainty, University o Liverpool, Liverpool L69 3GH, United Kingdom 1)

More information

DNPU: An Energy-Efficient Deep Neural Network Processor with On-Chip Stereo Matching

DNPU: An Energy-Efficient Deep Neural Network Processor with On-Chip Stereo Matching DNPU: An Energy-Efficient Deep Neural Network Processor with On-hip Stereo atching Dongjoo Shin, Jinmook Lee, Jinsu Lee, Juhyoung Lee, and Hoi-Jun Yoo Semiconductor System Laboratory School of EE, KAIST

More information

2. Accelerated Computations

2. Accelerated Computations 2. Accelerated Computations 2.1. Bent Function Enumeration by a Circular Pipeline Implemented on an FPGA Stuart W. Schneider Jon T. Butler 2.1.1. Background A naive approach to encoding a plaintext message

More information

Fragment Processor. Textures

Fragment Processor. Textures Cg: A system for programming graph ic s hh ard ww are in a C-l ik e ll angu age U n i v e r s i t y o f T e x a s a t A u s t i n S t e v e n G v N V I D I A K u A y N V I D I A a n d S t a n f o r d U

More information

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago

More information

Improvement of Sparse Computation Application in Power System Short Circuit Study

Improvement of Sparse Computation Application in Power System Short Circuit Study Volume 44, Number 1, 2003 3 Improvement o Sparse Computation Application in Power System Short Circuit Study A. MEGA *, M. BELKACEMI * and J.M. KAUFFMANN ** * Research Laboratory LEB, L2ES Department o

More information

Lecture 3, Performance

Lecture 3, Performance Repeating some definitions: Lecture 3, Performance CPI MHz MIPS MOPS Clocks Per Instruction megahertz, millions of cycles per second Millions of Instructions Per Second = MHz / CPI Millions of Operations

More information

Biplots in Practice MICHAEL GREENACRE. Professor of Statistics at the Pompeu Fabra University. Chapter 6 Offprint

Biplots in Practice MICHAEL GREENACRE. Professor of Statistics at the Pompeu Fabra University. Chapter 6 Offprint Biplots in Practice MICHAEL GREENACRE Proessor o Statistics at the Pompeu Fabra University Chapter 6 Oprint Principal Component Analysis Biplots First published: September 010 ISBN: 978-84-93846-8-6 Supporting

More information

Local Alignment: Smith-Waterman algorithm

Local Alignment: Smith-Waterman algorithm Local Alignment: Smith-Waterman algorithm Example: a shared common domain of two protein sequences; extended sections of genomic DNA sequence. Sensitive to detect similarity in highly diverged sequences.

More information

Design and Optimal Configuration of Full-Duplex MAC Protocol for Cognitive Radio Networks Considering Self-Interference

Design and Optimal Configuration of Full-Duplex MAC Protocol for Cognitive Radio Networks Considering Self-Interference Received November 8, 015, accepted December 10, 015, date o publication December 17, 015, date o current version December 8, 015. Digital Object Identiier 10.1109/ACCE.015.509449 Design and Optimal Coniguration

More information

Asymptote. 2 Problems 2 Methods

Asymptote. 2 Problems 2 Methods Asymptote Problems Methods Problems Assume we have the ollowing transer unction which has a zero at =, a pole at = and a pole at =. We are going to look at two problems: problem is where >> and problem

More information

Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks

Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks Yufei Ma, Yu Cao, Sarma Vrudhula,

More information

Cost/Performance Tradeoff of n-select Square Root Implementations

Cost/Performance Tradeoff of n-select Square Root Implementations Australian Computer Science Communications, Vol.22, No.4, 2, pp.9 6, IEEE Comp. Society Press Cost/Performance Tradeoff of n-select Square Root Implementations Wanming Chu and Yamin Li Computer Architecture

More information

Pairwise & Multiple sequence alignments

Pairwise & Multiple sequence alignments Pairwise & Multiple sequence alignments Urmila Kulkarni-Kale Bioinformatics Centre 411 007 urmila@bioinfo.ernet.in Basis for Sequence comparison Theory of evolution: gene sequences have evolved/derived

More information

Special types of Riemann sums

Special types of Riemann sums Roberto s Notes on Subject Chapter 4: Deinite integrals and the FTC Section 3 Special types o Riemann sums What you need to know already: What a Riemann sum is. What you can learn here: The key types o

More information

9. Datapath Design. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017

9. Datapath Design. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017 9. Datapath Design Jacob Abraham Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017 October 2, 2017 ECE Department, University of Texas at Austin

More information

Sequence Alignment (chapter 6)

Sequence Alignment (chapter 6) Sequence lignment (chapter 6) he biological problem lobal alignment Local alignment Multiple alignment Introduction to bioinformatics, utumn 6 Background: comparative genomics Basic question in biology:

More information

Origami: Folding Warps for Energy Efficient GPUs

Origami: Folding Warps for Energy Efficient GPUs Origami: Folding Warps for Energy Efficient GPUs Mohammad Abdel-Majeed*, Daniel Wong, Justin Huang and Murali Annavaram* * University of Southern alifornia University of alifornia, Riverside Stanford University

More information

Review D: Potential Energy and the Conservation of Mechanical Energy

Review D: Potential Energy and the Conservation of Mechanical Energy MSSCHUSETTS INSTITUTE OF TECHNOLOGY Department o Physics 8. Spring 4 Review D: Potential Energy and the Conservation o Mechanical Energy D.1 Conservative and Non-conservative Force... D.1.1 Introduction...

More information

Example: When describing where a function is increasing, decreasing or constant we use the x- axis values.

Example: When describing where a function is increasing, decreasing or constant we use the x- axis values. Business Calculus Lecture Notes (also Calculus With Applications or Business Math II) Chapter 3 Applications o Derivatives 31 Increasing and Decreasing Functions Inormal Deinition: A unction is increasing

More information

Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters

Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters Jonathan Lifflander, G. Carl Evans, Anshu Arya, Laxmikant Kale University of Illinois Urbana-Champaign May 25, 2012 Work is overdecomposed

More information

Practical Free-Start Collision Attacks on full SHA-1

Practical Free-Start Collision Attacks on full SHA-1 Practical Free-Start Collision Attacks on full SHA-1 Inria and École polytechnique, France Nanyang Technological University, Singapore Joint work with Thomas Peyrin and Marc Stevens Séminaire Cryptologie

More information

Statistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences

Statistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences Statistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD William and Nancy Thompson Missouri Distinguished Professor Department

More information

EEO 401 Digital Signal Processing Prof. Mark Fowler

EEO 401 Digital Signal Processing Prof. Mark Fowler EEO 401 Digital Signal Processing Pro. Mark Fowler Note Set #10 Fourier Analysis or DT Signals eading Assignment: Sect. 4.2 & 4.4 o Proakis & Manolakis Much o Ch. 4 should be review so you are expected

More information

Genotype Imputation. Biostatistics 666

Genotype Imputation. Biostatistics 666 Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives

More information

Tools and Algorithms in Bioinformatics

Tools and Algorithms in Bioinformatics Tools and Algorithms in Bioinformatics GCBA815, Fall 2015 Week-4 BLAST Algorithm Continued Multiple Sequence Alignment Babu Guda, Ph.D. Department of Genetics, Cell Biology & Anatomy Bioinformatics and

More information

Algebraic Dynamic Programming. Dynamic Programming, Old Country Style

Algebraic Dynamic Programming. Dynamic Programming, Old Country Style Algebraic Dynamic Programming Session 2 Dynamic Programming, Old Country Style Robert Giegerich (Lecture) Stefan Janssen (Exercises) Faculty of Technology Summer 2013 http://www.techfak.uni-bielefeld.de/ags/pi/lehre/adp

More information

Pairwise sequence alignment and pair hidden Markov models

Pairwise sequence alignment and pair hidden Markov models Pairwise sequence alignment and pair hidden Markov models Martin C. Frith April 13, 2012 ntroduction Pairwise alignment and pair hidden Markov models (phmms) are basic textbook fare [2]. However, there

More information

Chapter 6 Reliability-based design and code developments

Chapter 6 Reliability-based design and code developments Chapter 6 Reliability-based design and code developments 6. General Reliability technology has become a powerul tool or the design engineer and is widely employed in practice. Structural reliability analysis

More information

Introduction to Simulation - Lecture 2. Equation Formulation Methods. Jacob White. Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy

Introduction to Simulation - Lecture 2. Equation Formulation Methods. Jacob White. Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy Introduction to Simulation - Lecture Equation Formulation Methods Jacob White Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy Outline Formulating Equations rom Schematics Struts and Joints

More information

FPGA Implementation of a Predictive Controller

FPGA Implementation of a Predictive Controller FPGA Implementation of a Predictive Controller SIAM Conference on Optimization 2011, Darmstadt, Germany Minisymposium on embedded optimization Juan L. Jerez, George A. Constantinides and Eric C. Kerrigan

More information

PAM-1 Matrix 10,000. From: Ala Arg Asn Asp Cys Gln Glu To:

PAM-1 Matrix 10,000. From: Ala Arg Asn Asp Cys Gln Glu To: 119-1 atrix 10,000 rom: la rg sn sp ys ln lu o: la 9867 2 9 10 3 8 17 rg 1 9913 1 0 1 10 0 sn 4 1 9822 36 0 4 6 sp 6 0 42 9859 0 6 53 ys 1 1 0 0 9973 0 0 ln 3 9 4 5 0 9876 27 lu 10 0 7 56 0 35 9865 120

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

EECS730: Introduction to Bioinformatics

EECS730: Introduction to Bioinformatics EECS730: Introduction to Bioinformatics Lecture 03: Edit distance and sequence alignment Slides adapted from Dr. Shaojie Zhang (University of Central Florida) KUMC visit How many of you would like to attend

More information

ABHELSINKI UNIVERSITY OF TECHNOLOGY

ABHELSINKI UNIVERSITY OF TECHNOLOGY On Repeated Squarings in Binary Fields Kimmo Järvinen Helsinki University of Technology August 14, 2009 K. Järvinen On Repeated Squarings in Binary Fields 1/1 Introduction Repeated squaring Repeated squaring:

More information

Thu June 16 Lecture Notes: Lattice Exercises I

Thu June 16 Lecture Notes: Lattice Exercises I Thu June 6 ecture Notes: attice Exercises I T. Satogata: June USPAS Accelerator Physics Most o these notes ollow the treatment in the class text, Conte and MacKay, Chapter 6 on attice Exercises. The portions

More information

An Ensemble Kalman Smoother for Nonlinear Dynamics

An Ensemble Kalman Smoother for Nonlinear Dynamics 1852 MONTHLY WEATHER REVIEW VOLUME 128 An Ensemble Kalman Smoother or Nonlinear Dynamics GEIR EVENSEN Nansen Environmental and Remote Sensing Center, Bergen, Norway PETER JAN VAN LEEUWEN Institute or Marine

More information

An Improved Expression for a Classical Type of Explicit Approximation of the Colebrook White Equation with Only One Internal Iteration

An Improved Expression for a Classical Type of Explicit Approximation of the Colebrook White Equation with Only One Internal Iteration International Journal o Hydraulic Engineering 06, 5(): 9-3 DOI: 0.593/j.ijhe.06050.03 An Improved Expression or a Classical Type o Explicit Approximation o the Colebrook White Equation with Only One Internal

More information

Moreover, the circular logic

Moreover, the circular logic Moreover, the circular logic How do we know what is the right distance without a good alignment? And how do we construct a good alignment without knowing what substitutions were made previously? ATGCGT--GCAAGT

More information

COMPARISON OF THERMAL CHARACTERISTICS BETWEEN THE PLATE-FIN AND PIN-FIN HEAT SINKS IN NATURAL CONVECTION

COMPARISON OF THERMAL CHARACTERISTICS BETWEEN THE PLATE-FIN AND PIN-FIN HEAT SINKS IN NATURAL CONVECTION HEFAT014 10 th International Conerence on Heat Transer, Fluid Mechanics and Thermodynamics 14 6 July 014 Orlando, Florida COMPARISON OF THERMA CHARACTERISTICS BETWEEN THE PATE-FIN AND PIN-FIN HEAT SINKS

More information

Caching Policy Optimization for Video on Demand

Caching Policy Optimization for Video on Demand Caching Policy Optimization or Video on Demand Hao Wang, Shengqian Han, and Chenyang Yang School o Electronics and Inormation Engineering, Beihang University, China, 100191 Email: {wang.hao, sqhan, cyyang}@buaa.edu.cn

More information

Novel Devices and Circuits for Computing

Novel Devices and Circuits for Computing Novel Devices and Circuits for Computing UCSB 594BB Winter 2013 Lecture 4: Resistive switching: Logic Class Outline Material Implication logic Stochastic computing Reconfigurable logic Material Implication

More information

Whole Genome Alignments and Synteny Maps

Whole Genome Alignments and Synteny Maps Whole Genome Alignments and Synteny Maps IINTRODUCTION It was not until closely related organism genomes have been sequenced that people start to think about aligning genomes and chromosomes instead of

More information

A method to determine relative stroke detection efficiencies from multiplicity distributions

A method to determine relative stroke detection efficiencies from multiplicity distributions A ethod to deterine relative stroke detection eiciencies ro ultiplicity distributions Schulz W. and Cuins K. 2. Austrian Lightning Detection and Inoration Syste (ALDIS), Kahlenberger Str.2A, 90 Vienna,

More information

Biochemistry 324 Bioinformatics. Pairwise sequence alignment

Biochemistry 324 Bioinformatics. Pairwise sequence alignment Biochemistry 324 Bioinformatics Pairwise sequence alignment How do we compare genes/proteins? When we have sequenced a genome, we try and identify the function of unknown genes by finding a similar gene

More information

Least-Squares Spectral Analysis Theory Summary

Least-Squares Spectral Analysis Theory Summary Least-Squares Spectral Analysis Theory Summary Reerence: Mtamakaya, J. D. (2012). Assessment o Atmospheric Pressure Loading on the International GNSS REPRO1 Solutions Periodic Signatures. Ph.D. dissertation,

More information

Sara C. Madeira. Universidade da Beira Interior. (Thanks to Ana Teresa Freitas, IST for useful resources on this subject)

Sara C. Madeira. Universidade da Beira Interior. (Thanks to Ana Teresa Freitas, IST for useful resources on this subject) Bioinformática Sequence Alignment Pairwise Sequence Alignment Universidade da Beira Interior (Thanks to Ana Teresa Freitas, IST for useful resources on this subject) 1 16/3/29 & 23/3/29 27/4/29 Outline

More information

1. Cyclone III Device Family Overview

1. Cyclone III Device Family Overview 1. Cyclone III Device Family Overview CIII51001-1.3 Lowest System-Cost FPGAs The Cyclone III device amily oered by Altera is a cost-optimized, memory-rich FPGA amily. Cyclone III FPGAs are built on Taiwan

More information

Sparse LU Factorization on GPUs for Accelerating SPICE Simulation

Sparse LU Factorization on GPUs for Accelerating SPICE Simulation Nano-scale Integrated Circuit and System (NICS) Laboratory Sparse LU Factorization on GPUs for Accelerating SPICE Simulation Xiaoming Chen PhD Candidate Department of Electronic Engineering Tsinghua University,

More information

Background: comparative genomics. Sequence similarity. Homologs. Similarity vs homology (2) Similarity vs homology. Sequence Alignment (chapter 6)

Background: comparative genomics. Sequence similarity. Homologs. Similarity vs homology (2) Similarity vs homology. Sequence Alignment (chapter 6) Sequence lignment (chapter ) he biological problem lobal alignment Local alignment Multiple alignment Background: comparative genomics Basic question in biology: what properties are shared among organisms?

More information

Ultra Fast Calculation of Temperature Profiles of VLSI ICs in Thermal Packages Considering Parameter Variations

Ultra Fast Calculation of Temperature Profiles of VLSI ICs in Thermal Packages Considering Parameter Variations Ultra Fast Calculation o Temperature Proiles o VLSI ICs in Thermal Packages Considering Parameter Variations Je-Hyoung Park, Virginia Martín Hériz, Ali Shakouri, and Sung-Mo Kang Dept. o Electrical Engineering,

More information

Matrix Computations: Direct Methods II. May 5, 2014 Lecture 11

Matrix Computations: Direct Methods II. May 5, 2014 Lecture 11 Matrix Computations: Direct Methods II May 5, 2014 ecture Summary You have seen an example of how a typical matrix operation (an important one) can be reduced to using lower level BS routines that would

More information

Bioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment

Bioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment Bioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment Substitution score matrices, PAM, BLOSUM Needleman-Wunsch algorithm (Global) Smith-Waterman algorithm (Local) BLAST (local, heuristic) E-value

More information

11 Parallel programming models

11 Parallel programming models 237 // Program Design 10.3 Assessing parallel programs 11 Parallel programming models Many different models for expressing parallelism in programming languages Actor model Erlang Scala Coordination languages

More information

TESTING TIMED FINITE STATE MACHINES WITH GUARANTEED FAULT COVERAGE

TESTING TIMED FINITE STATE MACHINES WITH GUARANTEED FAULT COVERAGE TESTING TIMED FINITE STATE MACHINES WITH GUARANTEED FAULT COVERAGE Khaled El-Fakih 1, Nina Yevtushenko 2 *, Hacene Fouchal 3 1 American University o Sharjah, PO Box 26666, UAE kelakih@aus.edu 2 Tomsk State

More information

Lecture 23: Illusiveness of Parallel Performance. James C. Hoe Department of ECE Carnegie Mellon University

Lecture 23: Illusiveness of Parallel Performance. James C. Hoe Department of ECE Carnegie Mellon University 18 447 Lecture 23: Illusiveness of Parallel Performance James C. Hoe Department of ECE Carnegie Mellon University 18 447 S18 L23 S1, James C. Hoe, CMU/ECE/CALCM, 2018 Your goal today Housekeeping peel

More information

A Simple Architectural Enhancement for Fast and Flexible Elliptic Curve Cryptography over Binary Finite Fields GF(2 m )

A Simple Architectural Enhancement for Fast and Flexible Elliptic Curve Cryptography over Binary Finite Fields GF(2 m ) A Simple Architectural Enhancement for Fast and Flexible Elliptic Curve Cryptography over Binary Finite Fields GF(2 m ) Stefan Tillich, Johann Großschädl Institute for Applied Information Processing and

More information

Micro-canonical ensemble model of particles obeying Bose-Einstein and Fermi-Dirac statistics

Micro-canonical ensemble model of particles obeying Bose-Einstein and Fermi-Dirac statistics Indian Journal o Pure & Applied Physics Vol. 4, October 004, pp. 749-757 Micro-canonical ensemble model o particles obeying Bose-Einstein and Fermi-Dirac statistics Y K Ayodo, K M Khanna & T W Sakwa Department

More information

Driving Point Impedance Computation Applying Parallel Processing Techniques

Driving Point Impedance Computation Applying Parallel Processing Techniques 7th WSEAS Int. Con. on MATHEMATICAL METHODS and COMPUTATIONAL TECHNIQUES IN ELECTRICAL ENGINEERING, Soia, 27-29/10/05 (pp229-234) Driving Point Impedance Computation Applying Parallel Processing Techniques

More information

Pairwise Alignment. Guan-Shieng Huang. Dept. of CSIE, NCNU. Pairwise Alignment p.1/55

Pairwise Alignment. Guan-Shieng Huang. Dept. of CSIE, NCNU. Pairwise Alignment p.1/55 Pairwise Alignment Guan-Shieng Huang shieng@ncnu.edu.tw Dept. of CSIE, NCNU Pairwise Alignment p.1/55 Approach 1. Problem definition 2. Computational method (algorithms) 3. Complexity and performance Pairwise

More information

OPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION

OPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION OPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION Xu Bei, Yeo Jun Yoon and Ali Abur Teas A&M University College Station, Teas, U.S.A. abur@ee.tamu.edu Abstract This paper presents

More information

The effect of strong purifying selection on genetic diversity

The effect of strong purifying selection on genetic diversity Genetics: Early Online, published on May 9, 08 as 0.534/genetics.8.30058 The eect o strong puriying selection on genetic diversity Ivana Cvijović,4, Benjamin H. Good,4, and Michael M. Desai,3,4 Department

More information

Protein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki.

Protein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki. Protein Bioinformatics Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet rickard.sandberg@ki.se sandberg.cmb.ki.se Outline Protein features motifs patterns profiles signals 2 Protein

More information

3.5 Analysis of Members under Flexure (Part IV)

3.5 Analysis of Members under Flexure (Part IV) 3.5 Analysis o Members under Flexure (Part IV) This section covers the ollowing topics. Analysis o a Flanged Section 3.5.1 Analysis o a Flanged Section Introduction A beam can have langes or lexural eiciency.

More information

Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs

Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs Article Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs E. George Walters III Department of Electrical and Computer Engineering, Penn State Erie,

More information

CS 361 Meeting 28 11/14/18

CS 361 Meeting 28 11/14/18 CS 361 Meeting 28 11/14/18 Announcements 1. Homework 9 due Friday Computation Histories 1. Some very interesting proos o undecidability rely on the technique o constructing a language that describes the

More information

Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences

Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD Department of Computer Science University of Missouri 2008 Free for Academic

More information

An Algorithm for a Two-Disk Fault-Tolerant Array with (Prime 1) Disks

An Algorithm for a Two-Disk Fault-Tolerant Array with (Prime 1) Disks An Algorithm for a Two-Disk Fault-Tolerant Array with (Prime 1) Disks Sanjeeb Nanda and Narsingh Deo School of Computer Science University of Central Florida Orlando, Florida 32816-2362 sanjeeb@earthlink.net,

More information

SP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay

SP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay SP-CNN: A Scalable and Programmable CNN-based Accelerator Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay Motivation Power is a first-order design constraint, especially for embedded devices. Certain

More information

Outcomes. Spiral 1 / Unit 3. The Problem SYNTHESIZING LOGIC FUNCTIONS

Outcomes. Spiral 1 / Unit 3. The Problem SYNTHESIZING LOGIC FUNCTIONS -3. -3.2 Outcomes Spiral / Unit 3 Minterm and Materms Canonical Sums and Products 2 and 3 Variable oolean lgebra Theorems emorgan's Theorem unction Snthesis use Canonical Sums/Products Mark Redekopp I

More information

Multivariate Gaussian Random Number Generator Targeting Specific Resource Utilization in an FPGA

Multivariate Gaussian Random Number Generator Targeting Specific Resource Utilization in an FPGA Multivariate Gaussian Random Number Generator Targeting Specific Resource Utilization in an FPGA Chalermpol Saiprasert, Christos-Savvas Bouganis and George A. Constantinides Department of Electrical &

More information

A Faster Randomized Algorithm for Root Counting in Prime Power Rings

A Faster Randomized Algorithm for Root Counting in Prime Power Rings A Faster Randomized Algorithm or Root Counting in Prime Power Rings Leann Kopp and Natalie Randall July 17, 018 Abstract Let p be a prime and Z[x] a polynomial o degree d such that is not identically zero

More information

Comptes rendus de l Academie bulgare des Sciences, Tome 59, 4, 2006, p POSITIVE DEFINITE RANDOM MATRICES. Evelina Veleva

Comptes rendus de l Academie bulgare des Sciences, Tome 59, 4, 2006, p POSITIVE DEFINITE RANDOM MATRICES. Evelina Veleva Comtes rendus de l Academie bulgare des ciences Tome 59 4 6 353 36 POITIVE DEFINITE RANDOM MATRICE Evelina Veleva Abstract: The aer begins with necessary and suicient conditions or ositive deiniteness

More information

Analysis and Design of Algorithms Dynamic Programming

Analysis and Design of Algorithms Dynamic Programming Analysis and Design of Algorithms Dynamic Programming Lecture Notes by Dr. Wang, Rui Fall 2008 Department of Computer Science Ocean University of China November 6, 2009 Introduction 2 Introduction..................................................................

More information

Accelerating Quantum Chromodynamics Calculations with GPUs

Accelerating Quantum Chromodynamics Calculations with GPUs Accelerating Quantum Chromodynamics Calculations with GPUs Guochun Shi, Steven Gottlieb, Aaron Torok, Volodymyr Kindratenko NCSA & Indiana University National Center for Supercomputing Applications University

More information

BeiHang Short Course, Part 7: HW Acceleration: It s about Performance, Energy and Power

BeiHang Short Course, Part 7: HW Acceleration: It s about Performance, Energy and Power BeiHang Short Course, Part 7: HW Acceleration: It s about Performance, Energy and Power James C. Hoe Department of ECE Carnegie Mellon niversity Eric S. Chung, et al., Single chip Heterogeneous Computing:

More information

An Algorithmic Framework of Large-Scale Circuit Simulation Using Exponential Integrators

An Algorithmic Framework of Large-Scale Circuit Simulation Using Exponential Integrators An Algorithmic Framework of Large-Scale Circuit Simulation Using Exponential Integrators Hao Zhuang 1, Wenjian Yu 2, Ilgweon Kang 1, Xinan Wang 1, and Chung-Kuan Cheng 1 1. University of California, San

More information

Non-newtonian Rabinowitsch Fluid Effects on the Lubrication Performances of Sine Film Thrust Bearings

Non-newtonian Rabinowitsch Fluid Effects on the Lubrication Performances of Sine Film Thrust Bearings International Journal o Mechanical Engineering and Applications 7; 5(): 6-67 http://www.sciencepublishinggroup.com/j/ijmea doi:.648/j.ijmea.75.4 ISSN: -X (Print); ISSN: -48 (Online) Non-newtonian Rabinowitsch

More information

The Derivation of a Drag Coefficient Formula from Velocity-Voidage Correlations

The Derivation of a Drag Coefficient Formula from Velocity-Voidage Correlations 1 The Derivation o a Drag Coeicient Formula rom Velocity-Voidage Correlations By M. Syamlal EG&G, T.S.W.V, Inc. P.O. Box 880 Morgantown, West Virginia 6507-0880 T.J. O Brien U.S. Department o Energy Morgantown

More information

Evolution. CT Amemiya et al. Nature 496, (2013) doi: /nature12027

Evolution. CT Amemiya et al. Nature 496, (2013) doi: /nature12027 Sequence Alignment Evolution CT Amemiya et al. Nature 496, 311-316 (2013) doi:10.1038/nature12027 Evolutionary Rates next generation OK OK OK X X Still OK? Sequence conservation implies function Alignment

More information

PERFORMANCE METRICS. Mahdi Nazm Bojnordi. CS/ECE 6810: Computer Architecture. Assistant Professor School of Computing University of Utah

PERFORMANCE METRICS. Mahdi Nazm Bojnordi. CS/ECE 6810: Computer Architecture. Assistant Professor School of Computing University of Utah PERFORMANCE METRICS Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement Jan. 17 th : Homework 1 release (due on Jan.

More information