Hardware Acceleration of the Pair HMM Algorithm for DNA Variant Calling
|
|
- Anthony Owen
- 6 years ago
- Views:
Transcription
1 Hardware Acceleration o the Pair H Algorithm or NA Variant Calling Sitao Huang, Gowthami Jayashri anikandan, Anand Ramachandran, Kyle Rupnow, Wen-mei W. Hwu, eming Chen University o llinois at Urbana-Champaign, USA Advanced igital Sciences Center, Singapore
2 Genomic Variation and utations Humans have two sets o billion bases in their genomes No two humans have identical genome sequences About 0. % o genomes are not identical These dierences lead to people Having dierent susceptibility or resistance to diseases Responding dierently to the same medication There are also somatic variations that lead to cancer
3 The mportance o utations and Variant Calling The study o mutations is important (e.g. in cancer study) They create cancer They enable cancer to survive They enable cancer to spread They enable cancer to kill Variant calling is a set o analytics that tries to identiy mutations in a sequenced genome compared to a standard reerence Variant Calling is critical in cancer research and clinical applications GATK's HaplotypeCaller is one o the most popular variant calling tools available today.
4 Accelerating the Pair H in GATK Why Pair H Needs to Be Accelerated? Pair H computations constitute the bottleneck o HaplotypeCaller The ull HaplotypeCaller is time consuming Full HaplotypeCaller run on 80xWGS PCR-Free NA878 dataset: days on single CPU Traversal + Assembly Genotyping 598s 79s (.86%) (6.7%) Pair-H (orward algorithm) 45s (70.4%) Why Using Hardware (FPGA)? Proiling result o a typical HaplotypeCaller run on CPU Parallelism in pair H could be better utilized by the ine-grained processing elements in FPGA FPGA is good at processing streaming applications (alignment algorithms nature)
5 Pair H (candidate to be veriied) (data rom sequencing machine) nput: two sequences SS h and SS rr (SS h : haplotype SS rr : read) Goal: ind a similarity score o SS h and SS rr SS rr GTAA SS h AGGTC One possible alignment: SS rr SS h {aa tt } --GTAA AGGTC- Another possible alignment: SS rr -G-TAA SS h AGGT-C {aa tt } There are many action sequences mapping SS rr to SS hh. Similarity score is deined over a pair Hidden arkov odel 4
6 Pair H ynamic Programming Similarly: ( i, j) = a ( i, j) = a Coeicients rom model ( i, j) = prior ( i, j ) + a ( i, j) + a ( i, j ) ( i, j) ( a ( i, j ) + a ( i, j ) + a ( i, j ) ) 0 Read Sequence C G G A Output: Complexity: ssssssssss SS h, SS rr = NN h, NN rr + NN h, NN rr + NN h, NN rr O ( h r Nh Nr ) # haplotype sequences # read sequences 5
7 How to Accelerate? : Processing Elements Process rontier elements at the same time to maximize parallelism Number o s Needed = atrix Height What i matrix height is larger than number o s FPGA can host? 6
8 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 7
9 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 0 Read Sequence C G G G A A 4 Busy N dle * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
10 Ring* Connects the irst and the last with ivide matrix rows to groups Haplotype Sequence A G G T A C 0 Read Sequence C G G G A A 4 Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
11 Ring* Connects the irst and the last with ivide matrix rows to groups Haplotype Sequence A G G T A C 0 Read Sequence C G G G A A 4 Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
12 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 4 0 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
13 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 4 0 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
14 Ring* Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Haplotype Sequence A G G T A C 4 0 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
15 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups 4 Read Sequence C G G G A A Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
16 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: 4 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
17 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: 4 Read Sequence C G G G A A N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
18 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N 4 * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
19 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N 4 * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
20 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: Read Sequence C G G G A A N 4 * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
21 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: N 4 Read Sequence C G G G A A * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 8
22 Ring* Haplotype Sequence A G G T A C 0 Connects the irst and the last with ivide matrix rows to groups Processing Element () Ring: N * Zilong Wang, Sitao Huang, Lanjun Wang, Hao Li, Yu Wang, and Huazhong Yang. Accelerating subsequence similarity search based on dynamic time warping distance with FPGA. n Proceedings o the AC/SGA international symposium on Field Programmable Gate Arrays, pages 5 6. AC, 0. 4 Read Sequence C G G G A A 8
23 Challenges in esigning or Pair H v ( i- ) v ( i- ) structure is designed according to the data dependencies in the algorithm Each passes its intermediate computing result to the next v( i ) v ( i ) 9
24 Challenges in esigning or Pair H Cont. Floating point operations Long latency Need sophisticated FS Complicated arithmetic operations in P Elements in three P matrices depend on each other ( i, j) = a ( i, j) = a ( i, j) = prior ( i, j ) + a ( i, j) + a ( i, j ) ( i, j) i, j + i, j adm + amm i, j prior i, j ami i, j + aii i, j (i-, j) i, j (i, j) amd i, j add Arithmetic Operations Within a (Original) ( a ( i, j ) + a ( i, j ) + a ( i, j ) ) (i-, j-) (i, j-) + i, j i, j 0
25 Optimization : Shorten critical path in arithmetic operations (i-, j) (i-, j-) Using dierent input operands i, j + i, j adm a t i, j amm b t i, j a t i, j + b t i, j i, j prior i, j i, j + i, j adm + amm i, j i, j prior ami i, j + aii i, j i, j (i, j) amd (i, j-) i, j add + i, j i, j Optimized Calculation Arithmetic Operations Within a (Original)
26 Optimization : Pipelining and resource sharing Hide the loating-point arithmetic operations latency mprove throughput
27 Optimization : Tuning ring size and number o rings Same amount o HW resource can accommodate more shorter rings (calculating multiple matrices) Shorter rings have ewer idle s Only /N dle s!
28 Experiment Result : Comparison to Other mplementations Compared to CPU, vector processor, GPU, multicore, previous FPGA implementations Using 0s dataset Arria 0 has more logic and SP resources. t also has hard loating-point SP block * * Theoretical runtime lower bound (assuming no idle ) or 64 s: 4.7ms * Altera. Accelerating genomics research with OpenCL and FPGAs, 06. 4
29 Experiment Result : mpact o Ring Size Shorter rings beneit rom higher utilization and smaller initialization overhead 8s/ring x 8 rings 6s/ring x 4 rings s/ring x rings 64s/ring x ring 5
30 Summary Pair H orward algorithm is computationintensive. t is the bottleneck o HaplotypeCaller. Ring-based hardware structure exhibits lexibility in coniguration and high data reuse. ring structure based pair H implementation can achieve signiicant speedup compared to the sotware implementation, and it also outperorms the published best hardware implementation. 6
31 HaplotypeCaller BACKUP SLES
32 Emission and Transition Probabilities BACKUP SLES Q base : Base Error Rate Q i : Base nsertion Probability Q d : Base eletion Probability Q g : Gap Continuation Penalty
33 What s in? (i-, j) (i-, j-) BACKUP SLES (i, j) (i, j-) i, j + i, j adm amm a t i, j + b t i, j i, j prior ami i, j + aii i, j amd i, j add + i, j a t i, j b t i, j i, j i, j i, j
34 Why sequence alignment? BACKUP SLES Comparing genes or regions rom dierent species to ind important regions determine unction uncover evolutionary orces Assembling ragments to sequence NA Compare individuals to looking or mutations
35 Problem Statement BACKUP SLES nput: two sequences SS h and SS rr (SS h : haplotype SS rr : read) Goal: ind a similarity score o SS h and SS rr (candidate to be veriied) (data rom sequencing machine) SS rr SS h GTAA AGGTC Similarity score is deined over a pair Hidden arkov odel
36 Pair H Action Sequence BACKUP SLES Action(elete, nsert, atch/ismatch) sequence {aa tt } s.t. SS rr {aa tt } SSh One possible alignment: SS rr SS h {aa tt } --GTAA AGGTC- Another possible alignment: SS rr -G-TAA SS h AGGT-C {aa tt } There are many action sequences mapping SS rr to SS hh.
37 Pair H - Probability BACKUP SLES Each action sequence is associated with a probability: PP aa tt = pp(aa tt aa tt ) tt atch/ ismatch
38 BACKUP SLES Similarity score ynamic Programming ssssssssss SS h, SS rr = PP aa tt = NN h, NN rr {aa tt } aa tt : SS h SSrr + NN h, NN rr + NN h, NN rr Last action: delete Last action: match / mismatch Last action: insert SS rr [0: jj ] SS h [0: ii] aa tt SS rr G?T?AA SS h AGGTA- {aa tt }????? ( i, j) = a + a + a ( i, j ) ( i, j ) ( i, j ) probability dependency (arkov)
39 Recursion Similarly: ( i, j) = a ( i, j) = a ( i, j) = prior ( i, j ) + a ( i, j) + a ( i, j ) ( i, j) ( a ( i, j ) + a ( i, j ) + a ( i, j ) ) BACKUP SLES 0 Read Sequence C G G A Output: Complexity: ssssssssss SS h, SS rr = NN h, NN rr + NN h, NN rr + NN h, NN rr O ( h r Nh Nr ) # haplotype sequences # read sequences
GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications
GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications Christopher Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-Mei W. Hwu University of Illinois at Urbana-Champaign
More informationParallel Numerics. Scope: Revise standard numerical methods considering parallel computations!
Parallel Numerics Scope: Revise standard numerical methods considering parallel computations! Required knowledge: Numerics Parallel Programming Graphs Literature: Dongarra, Du, Sorensen, van der Vorst:
More informationBinary Multipliers. Reading: Study Chapter 3. The key trick of multiplication is memorizing a digit-to-digit table Everything else was just adding
Binary Multipliers The key trick of multiplication is memorizing a digit-to-digit table Everything else was just adding 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 4 6 8 2 4 6 8 3 3 6 9 2 5 8 2 24 27 4 4 8 2 6
More informationAH 2700A. Attenuator Pair Ratio for C vs Frequency. Option-E 50 Hz-20 khz Ultra-precision Capacitance/Loss Bridge
0 E ttenuator Pair Ratio or vs requency NEEN-ERLN 700 Option-E 0-0 k Ultra-precision apacitance/loss ridge ttenuator Ratio Pair Uncertainty o in ppm or ll Usable Pairs o Taps 0 0 0. 0. 0. 07/08/0 E E E
More informationNumerical Methods - Lecture 2. Numerical Methods. Lecture 2. Analysis of errors in numerical methods
Numerical Methods - Lecture 1 Numerical Methods Lecture. Analysis o errors in numerical methods Numerical Methods - Lecture Why represent numbers in loating point ormat? Eample 1. How a number 56.78 can
More informationWord-length Optimization and Error Analysis of a Multivariate Gaussian Random Number Generator
Word-length Optimization and Error Analysis of a Multivariate Gaussian Random Number Generator Chalermpol Saiprasert, Christos-Savvas Bouganis and George A. Constantinides Department of Electrical & Electronic
More informationCHAPTER 1: INTRODUCTION. 1.1 Inverse Theory: What It Is and What It Does
Geosciences 567: CHAPTER (RR/GZ) CHAPTER : INTRODUCTION Inverse Theory: What It Is and What It Does Inverse theory, at least as I choose to deine it, is the ine art o estimating model parameters rom data
More informationWelcome to MCS 572. content and organization expectations of the course. definition and classification
Welcome to MCS 572 1 About the Course content and organization expectations of the course 2 Supercomputing definition and classification 3 Measuring Performance speedup and efficiency Amdahl s Law Gustafson
More informationIntroduction. Outline. CSC/ECE 574 Computer and Network Security. Secret Keys or Secret Algorithms? Secrets? (Cont d) Secret Key Cryptography
Outline CSC/ECE 574 Computer and Network Security Introductory Remarks Feistel Cipher DES AES Topic 3.1 Secret Key Cryptography Algorithms CSC/ECE 574 Dr. Peng Ning 1 CSC/ECE 574 Dr. Peng Ning 2 Secret
More informationPractical Free-Start Collision Attacks on 76-step SHA-1
Practical Free-Start Collision Attacks on 76-step SHA-1 Inria and École polytechnique, France Nanyang Technological University, Singapore Joint work with Thomas Peyrin and Marc Stevens CWI, Amsterdam 2015
More informationHybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS
Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS Jorge González-Domínguez*, Bertil Schmidt*, Jan C. Kässens**, Lars Wienbrandt** *Parallel and Distributed Architectures
More informationAccelerating linear algebra computations with hybrid GPU-multicore systems.
Accelerating linear algebra computations with hybrid GPU-multicore systems. Marc Baboulin INRIA/Université Paris-Sud joint work with Jack Dongarra (University of Tennessee and Oak Ridge National Laboratory)
More informationLecture 3, Performance
Lecture 3, Performance Repeating some definitions: CPI Clocks Per Instruction MHz megahertz, millions of cycles per second MIPS Millions of Instructions Per Second = MHz / CPI MOPS Millions of Operations
More informationScoring functions. Talk Overview. Eran Eyal. Scoring functions what and why
Scoring unctions Talk Overview Scoring unctions what and why Force ields based on approximation o molecular orces as we understand them Knowledge-based potentials let the data speak May 2011 Eran Eyal
More informationAP Calculus Notes: Unit 1 Limits & Continuity. Syllabus Objective: 1.1 The student will calculate limits using the basic limit theorems.
Syllabus Objective:. The student will calculate its using the basic it theorems. LIMITS how the outputs o a unction behave as the inputs approach some value Finding a Limit Notation: The it as approaches
More informationPredictive Genome Analysis Using Partial DNA Sequencing Data
Predictive Genome Analysis Using Partial DNA Sequencing Data Nauman Ahmed, Koen Bertels and Zaid Al-Ars Computer Engineering Lab, Delft University of Technology, Delft, The Netherlands {n.ahmed, k.l.m.bertels,
More informationCompile-Time Dynamic Voltage Scaling Settings: Opportunities and Limits
Compile-Time Dynamic Voltage Scaling Settings: Opportunities and Limits Fen Xie Department o Electrical Engineering Princeton University Princeton, NJ xie@ee.princeton.edu Margaret Martonosi Department
More informationINF2270 Spring Philipp Häfliger. Lecture 8: Superscalar CPUs, Course Summary/Repetition (1/2)
INF2270 Spring 2010 Philipp Häfliger Summary/Repetition (1/2) content From Scalar to Superscalar Lecture Summary and Brief Repetition Binary numbers Boolean Algebra Combinational Logic Circuits Encoder/Decoder
More informationAlgorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment
Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot
More informationThe achievable limits of operational modal analysis. * Siu-Kui Au 1)
The achievable limits o operational modal analysis * Siu-Kui Au 1) 1) Center or Engineering Dynamics and Institute or Risk and Uncertainty, University o Liverpool, Liverpool L69 3GH, United Kingdom 1)
More informationDNPU: An Energy-Efficient Deep Neural Network Processor with On-Chip Stereo Matching
DNPU: An Energy-Efficient Deep Neural Network Processor with On-hip Stereo atching Dongjoo Shin, Jinmook Lee, Jinsu Lee, Juhyoung Lee, and Hoi-Jun Yoo Semiconductor System Laboratory School of EE, KAIST
More information2. Accelerated Computations
2. Accelerated Computations 2.1. Bent Function Enumeration by a Circular Pipeline Implemented on an FPGA Stuart W. Schneider Jon T. Butler 2.1.1. Background A naive approach to encoding a plaintext message
More informationFragment Processor. Textures
Cg: A system for programming graph ic s hh ard ww are in a C-l ik e ll angu age U n i v e r s i t y o f T e x a s a t A u s t i n S t e v e n G v N V I D I A K u A y N V I D I A a n d S t a n f o r d U
More informationGPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic
GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago
More informationImprovement of Sparse Computation Application in Power System Short Circuit Study
Volume 44, Number 1, 2003 3 Improvement o Sparse Computation Application in Power System Short Circuit Study A. MEGA *, M. BELKACEMI * and J.M. KAUFFMANN ** * Research Laboratory LEB, L2ES Department o
More informationLecture 3, Performance
Repeating some definitions: Lecture 3, Performance CPI MHz MIPS MOPS Clocks Per Instruction megahertz, millions of cycles per second Millions of Instructions Per Second = MHz / CPI Millions of Operations
More informationBiplots in Practice MICHAEL GREENACRE. Professor of Statistics at the Pompeu Fabra University. Chapter 6 Offprint
Biplots in Practice MICHAEL GREENACRE Proessor o Statistics at the Pompeu Fabra University Chapter 6 Oprint Principal Component Analysis Biplots First published: September 010 ISBN: 978-84-93846-8-6 Supporting
More informationLocal Alignment: Smith-Waterman algorithm
Local Alignment: Smith-Waterman algorithm Example: a shared common domain of two protein sequences; extended sections of genomic DNA sequence. Sensitive to detect similarity in highly diverged sequences.
More informationDesign and Optimal Configuration of Full-Duplex MAC Protocol for Cognitive Radio Networks Considering Self-Interference
Received November 8, 015, accepted December 10, 015, date o publication December 17, 015, date o current version December 8, 015. Digital Object Identiier 10.1109/ACCE.015.509449 Design and Optimal Coniguration
More informationAsymptote. 2 Problems 2 Methods
Asymptote Problems Methods Problems Assume we have the ollowing transer unction which has a zero at =, a pole at = and a pole at =. We are going to look at two problems: problem is where >> and problem
More informationOptimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks
2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks Yufei Ma, Yu Cao, Sarma Vrudhula,
More informationCost/Performance Tradeoff of n-select Square Root Implementations
Australian Computer Science Communications, Vol.22, No.4, 2, pp.9 6, IEEE Comp. Society Press Cost/Performance Tradeoff of n-select Square Root Implementations Wanming Chu and Yamin Li Computer Architecture
More informationPairwise & Multiple sequence alignments
Pairwise & Multiple sequence alignments Urmila Kulkarni-Kale Bioinformatics Centre 411 007 urmila@bioinfo.ernet.in Basis for Sequence comparison Theory of evolution: gene sequences have evolved/derived
More informationSpecial types of Riemann sums
Roberto s Notes on Subject Chapter 4: Deinite integrals and the FTC Section 3 Special types o Riemann sums What you need to know already: What a Riemann sum is. What you can learn here: The key types o
More information9. Datapath Design. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017
9. Datapath Design Jacob Abraham Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017 October 2, 2017 ECE Department, University of Texas at Austin
More informationSequence Alignment (chapter 6)
Sequence lignment (chapter 6) he biological problem lobal alignment Local alignment Multiple alignment Introduction to bioinformatics, utumn 6 Background: comparative genomics Basic question in biology:
More informationOrigami: Folding Warps for Energy Efficient GPUs
Origami: Folding Warps for Energy Efficient GPUs Mohammad Abdel-Majeed*, Daniel Wong, Justin Huang and Murali Annavaram* * University of Southern alifornia University of alifornia, Riverside Stanford University
More informationReview D: Potential Energy and the Conservation of Mechanical Energy
MSSCHUSETTS INSTITUTE OF TECHNOLOGY Department o Physics 8. Spring 4 Review D: Potential Energy and the Conservation o Mechanical Energy D.1 Conservative and Non-conservative Force... D.1.1 Introduction...
More informationExample: When describing where a function is increasing, decreasing or constant we use the x- axis values.
Business Calculus Lecture Notes (also Calculus With Applications or Business Math II) Chapter 3 Applications o Derivatives 31 Increasing and Decreasing Functions Inormal Deinition: A unction is increasing
More informationDynamic Scheduling for Work Agglomeration on Heterogeneous Clusters
Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters Jonathan Lifflander, G. Carl Evans, Anshu Arya, Laxmikant Kale University of Illinois Urbana-Champaign May 25, 2012 Work is overdecomposed
More informationPractical Free-Start Collision Attacks on full SHA-1
Practical Free-Start Collision Attacks on full SHA-1 Inria and École polytechnique, France Nanyang Technological University, Singapore Joint work with Thomas Peyrin and Marc Stevens Séminaire Cryptologie
More informationStatistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences
Statistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD William and Nancy Thompson Missouri Distinguished Professor Department
More informationEEO 401 Digital Signal Processing Prof. Mark Fowler
EEO 401 Digital Signal Processing Pro. Mark Fowler Note Set #10 Fourier Analysis or DT Signals eading Assignment: Sect. 4.2 & 4.4 o Proakis & Manolakis Much o Ch. 4 should be review so you are expected
More informationGenotype Imputation. Biostatistics 666
Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives
More informationTools and Algorithms in Bioinformatics
Tools and Algorithms in Bioinformatics GCBA815, Fall 2015 Week-4 BLAST Algorithm Continued Multiple Sequence Alignment Babu Guda, Ph.D. Department of Genetics, Cell Biology & Anatomy Bioinformatics and
More informationAlgebraic Dynamic Programming. Dynamic Programming, Old Country Style
Algebraic Dynamic Programming Session 2 Dynamic Programming, Old Country Style Robert Giegerich (Lecture) Stefan Janssen (Exercises) Faculty of Technology Summer 2013 http://www.techfak.uni-bielefeld.de/ags/pi/lehre/adp
More informationPairwise sequence alignment and pair hidden Markov models
Pairwise sequence alignment and pair hidden Markov models Martin C. Frith April 13, 2012 ntroduction Pairwise alignment and pair hidden Markov models (phmms) are basic textbook fare [2]. However, there
More informationChapter 6 Reliability-based design and code developments
Chapter 6 Reliability-based design and code developments 6. General Reliability technology has become a powerul tool or the design engineer and is widely employed in practice. Structural reliability analysis
More informationIntroduction to Simulation - Lecture 2. Equation Formulation Methods. Jacob White. Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy
Introduction to Simulation - Lecture Equation Formulation Methods Jacob White Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy Outline Formulating Equations rom Schematics Struts and Joints
More informationFPGA Implementation of a Predictive Controller
FPGA Implementation of a Predictive Controller SIAM Conference on Optimization 2011, Darmstadt, Germany Minisymposium on embedded optimization Juan L. Jerez, George A. Constantinides and Eric C. Kerrigan
More informationPAM-1 Matrix 10,000. From: Ala Arg Asn Asp Cys Gln Glu To:
119-1 atrix 10,000 rom: la rg sn sp ys ln lu o: la 9867 2 9 10 3 8 17 rg 1 9913 1 0 1 10 0 sn 4 1 9822 36 0 4 6 sp 6 0 42 9859 0 6 53 ys 1 1 0 0 9973 0 0 ln 3 9 4 5 0 9876 27 lu 10 0 7 56 0 35 9865 120
More informationSequence analysis and comparison
The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species
More informationEECS730: Introduction to Bioinformatics
EECS730: Introduction to Bioinformatics Lecture 03: Edit distance and sequence alignment Slides adapted from Dr. Shaojie Zhang (University of Central Florida) KUMC visit How many of you would like to attend
More informationABHELSINKI UNIVERSITY OF TECHNOLOGY
On Repeated Squarings in Binary Fields Kimmo Järvinen Helsinki University of Technology August 14, 2009 K. Järvinen On Repeated Squarings in Binary Fields 1/1 Introduction Repeated squaring Repeated squaring:
More informationThu June 16 Lecture Notes: Lattice Exercises I
Thu June 6 ecture Notes: attice Exercises I T. Satogata: June USPAS Accelerator Physics Most o these notes ollow the treatment in the class text, Conte and MacKay, Chapter 6 on attice Exercises. The portions
More informationAn Ensemble Kalman Smoother for Nonlinear Dynamics
1852 MONTHLY WEATHER REVIEW VOLUME 128 An Ensemble Kalman Smoother or Nonlinear Dynamics GEIR EVENSEN Nansen Environmental and Remote Sensing Center, Bergen, Norway PETER JAN VAN LEEUWEN Institute or Marine
More informationAn Improved Expression for a Classical Type of Explicit Approximation of the Colebrook White Equation with Only One Internal Iteration
International Journal o Hydraulic Engineering 06, 5(): 9-3 DOI: 0.593/j.ijhe.06050.03 An Improved Expression or a Classical Type o Explicit Approximation o the Colebrook White Equation with Only One Internal
More informationMoreover, the circular logic
Moreover, the circular logic How do we know what is the right distance without a good alignment? And how do we construct a good alignment without knowing what substitutions were made previously? ATGCGT--GCAAGT
More informationCOMPARISON OF THERMAL CHARACTERISTICS BETWEEN THE PLATE-FIN AND PIN-FIN HEAT SINKS IN NATURAL CONVECTION
HEFAT014 10 th International Conerence on Heat Transer, Fluid Mechanics and Thermodynamics 14 6 July 014 Orlando, Florida COMPARISON OF THERMA CHARACTERISTICS BETWEEN THE PATE-FIN AND PIN-FIN HEAT SINKS
More informationCaching Policy Optimization for Video on Demand
Caching Policy Optimization or Video on Demand Hao Wang, Shengqian Han, and Chenyang Yang School o Electronics and Inormation Engineering, Beihang University, China, 100191 Email: {wang.hao, sqhan, cyyang}@buaa.edu.cn
More informationNovel Devices and Circuits for Computing
Novel Devices and Circuits for Computing UCSB 594BB Winter 2013 Lecture 4: Resistive switching: Logic Class Outline Material Implication logic Stochastic computing Reconfigurable logic Material Implication
More informationWhole Genome Alignments and Synteny Maps
Whole Genome Alignments and Synteny Maps IINTRODUCTION It was not until closely related organism genomes have been sequenced that people start to think about aligning genomes and chromosomes instead of
More informationA method to determine relative stroke detection efficiencies from multiplicity distributions
A ethod to deterine relative stroke detection eiciencies ro ultiplicity distributions Schulz W. and Cuins K. 2. Austrian Lightning Detection and Inoration Syste (ALDIS), Kahlenberger Str.2A, 90 Vienna,
More informationBiochemistry 324 Bioinformatics. Pairwise sequence alignment
Biochemistry 324 Bioinformatics Pairwise sequence alignment How do we compare genes/proteins? When we have sequenced a genome, we try and identify the function of unknown genes by finding a similar gene
More informationLeast-Squares Spectral Analysis Theory Summary
Least-Squares Spectral Analysis Theory Summary Reerence: Mtamakaya, J. D. (2012). Assessment o Atmospheric Pressure Loading on the International GNSS REPRO1 Solutions Periodic Signatures. Ph.D. dissertation,
More informationSara C. Madeira. Universidade da Beira Interior. (Thanks to Ana Teresa Freitas, IST for useful resources on this subject)
Bioinformática Sequence Alignment Pairwise Sequence Alignment Universidade da Beira Interior (Thanks to Ana Teresa Freitas, IST for useful resources on this subject) 1 16/3/29 & 23/3/29 27/4/29 Outline
More information1. Cyclone III Device Family Overview
1. Cyclone III Device Family Overview CIII51001-1.3 Lowest System-Cost FPGAs The Cyclone III device amily oered by Altera is a cost-optimized, memory-rich FPGA amily. Cyclone III FPGAs are built on Taiwan
More informationSparse LU Factorization on GPUs for Accelerating SPICE Simulation
Nano-scale Integrated Circuit and System (NICS) Laboratory Sparse LU Factorization on GPUs for Accelerating SPICE Simulation Xiaoming Chen PhD Candidate Department of Electronic Engineering Tsinghua University,
More informationBackground: comparative genomics. Sequence similarity. Homologs. Similarity vs homology (2) Similarity vs homology. Sequence Alignment (chapter 6)
Sequence lignment (chapter ) he biological problem lobal alignment Local alignment Multiple alignment Background: comparative genomics Basic question in biology: what properties are shared among organisms?
More informationUltra Fast Calculation of Temperature Profiles of VLSI ICs in Thermal Packages Considering Parameter Variations
Ultra Fast Calculation o Temperature Proiles o VLSI ICs in Thermal Packages Considering Parameter Variations Je-Hyoung Park, Virginia Martín Hériz, Ali Shakouri, and Sung-Mo Kang Dept. o Electrical Engineering,
More informationMatrix Computations: Direct Methods II. May 5, 2014 Lecture 11
Matrix Computations: Direct Methods II May 5, 2014 ecture Summary You have seen an example of how a typical matrix operation (an important one) can be reduced to using lower level BS routines that would
More informationBioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment
Bioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment Substitution score matrices, PAM, BLOSUM Needleman-Wunsch algorithm (Global) Smith-Waterman algorithm (Local) BLAST (local, heuristic) E-value
More information11 Parallel programming models
237 // Program Design 10.3 Assessing parallel programs 11 Parallel programming models Many different models for expressing parallelism in programming languages Actor model Erlang Scala Coordination languages
More informationTESTING TIMED FINITE STATE MACHINES WITH GUARANTEED FAULT COVERAGE
TESTING TIMED FINITE STATE MACHINES WITH GUARANTEED FAULT COVERAGE Khaled El-Fakih 1, Nina Yevtushenko 2 *, Hacene Fouchal 3 1 American University o Sharjah, PO Box 26666, UAE kelakih@aus.edu 2 Tomsk State
More informationLecture 23: Illusiveness of Parallel Performance. James C. Hoe Department of ECE Carnegie Mellon University
18 447 Lecture 23: Illusiveness of Parallel Performance James C. Hoe Department of ECE Carnegie Mellon University 18 447 S18 L23 S1, James C. Hoe, CMU/ECE/CALCM, 2018 Your goal today Housekeeping peel
More informationA Simple Architectural Enhancement for Fast and Flexible Elliptic Curve Cryptography over Binary Finite Fields GF(2 m )
A Simple Architectural Enhancement for Fast and Flexible Elliptic Curve Cryptography over Binary Finite Fields GF(2 m ) Stefan Tillich, Johann Großschädl Institute for Applied Information Processing and
More informationMicro-canonical ensemble model of particles obeying Bose-Einstein and Fermi-Dirac statistics
Indian Journal o Pure & Applied Physics Vol. 4, October 004, pp. 749-757 Micro-canonical ensemble model o particles obeying Bose-Einstein and Fermi-Dirac statistics Y K Ayodo, K M Khanna & T W Sakwa Department
More informationDriving Point Impedance Computation Applying Parallel Processing Techniques
7th WSEAS Int. Con. on MATHEMATICAL METHODS and COMPUTATIONAL TECHNIQUES IN ELECTRICAL ENGINEERING, Soia, 27-29/10/05 (pp229-234) Driving Point Impedance Computation Applying Parallel Processing Techniques
More informationPairwise Alignment. Guan-Shieng Huang. Dept. of CSIE, NCNU. Pairwise Alignment p.1/55
Pairwise Alignment Guan-Shieng Huang shieng@ncnu.edu.tw Dept. of CSIE, NCNU Pairwise Alignment p.1/55 Approach 1. Problem definition 2. Computational method (algorithms) 3. Complexity and performance Pairwise
More informationOPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION
OPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION Xu Bei, Yeo Jun Yoon and Ali Abur Teas A&M University College Station, Teas, U.S.A. abur@ee.tamu.edu Abstract This paper presents
More informationThe effect of strong purifying selection on genetic diversity
Genetics: Early Online, published on May 9, 08 as 0.534/genetics.8.30058 The eect o strong puriying selection on genetic diversity Ivana Cvijović,4, Benjamin H. Good,4, and Michael M. Desai,3,4 Department
More informationProtein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki.
Protein Bioinformatics Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet rickard.sandberg@ki.se sandberg.cmb.ki.se Outline Protein features motifs patterns profiles signals 2 Protein
More information3.5 Analysis of Members under Flexure (Part IV)
3.5 Analysis o Members under Flexure (Part IV) This section covers the ollowing topics. Analysis o a Flanged Section 3.5.1 Analysis o a Flanged Section Introduction A beam can have langes or lexural eiciency.
More informationReduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs
Article Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs E. George Walters III Department of Electrical and Computer Engineering, Penn State Erie,
More informationCS 361 Meeting 28 11/14/18
CS 361 Meeting 28 11/14/18 Announcements 1. Homework 9 due Friday Computation Histories 1. Some very interesting proos o undecidability rely on the technique o constructing a language that describes the
More informationStatistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences
Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD Department of Computer Science University of Missouri 2008 Free for Academic
More informationAn Algorithm for a Two-Disk Fault-Tolerant Array with (Prime 1) Disks
An Algorithm for a Two-Disk Fault-Tolerant Array with (Prime 1) Disks Sanjeeb Nanda and Narsingh Deo School of Computer Science University of Central Florida Orlando, Florida 32816-2362 sanjeeb@earthlink.net,
More informationSP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay
SP-CNN: A Scalable and Programmable CNN-based Accelerator Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay Motivation Power is a first-order design constraint, especially for embedded devices. Certain
More informationOutcomes. Spiral 1 / Unit 3. The Problem SYNTHESIZING LOGIC FUNCTIONS
-3. -3.2 Outcomes Spiral / Unit 3 Minterm and Materms Canonical Sums and Products 2 and 3 Variable oolean lgebra Theorems emorgan's Theorem unction Snthesis use Canonical Sums/Products Mark Redekopp I
More informationMultivariate Gaussian Random Number Generator Targeting Specific Resource Utilization in an FPGA
Multivariate Gaussian Random Number Generator Targeting Specific Resource Utilization in an FPGA Chalermpol Saiprasert, Christos-Savvas Bouganis and George A. Constantinides Department of Electrical &
More informationA Faster Randomized Algorithm for Root Counting in Prime Power Rings
A Faster Randomized Algorithm or Root Counting in Prime Power Rings Leann Kopp and Natalie Randall July 17, 018 Abstract Let p be a prime and Z[x] a polynomial o degree d such that is not identically zero
More informationComptes rendus de l Academie bulgare des Sciences, Tome 59, 4, 2006, p POSITIVE DEFINITE RANDOM MATRICES. Evelina Veleva
Comtes rendus de l Academie bulgare des ciences Tome 59 4 6 353 36 POITIVE DEFINITE RANDOM MATRICE Evelina Veleva Abstract: The aer begins with necessary and suicient conditions or ositive deiniteness
More informationAnalysis and Design of Algorithms Dynamic Programming
Analysis and Design of Algorithms Dynamic Programming Lecture Notes by Dr. Wang, Rui Fall 2008 Department of Computer Science Ocean University of China November 6, 2009 Introduction 2 Introduction..................................................................
More informationAccelerating Quantum Chromodynamics Calculations with GPUs
Accelerating Quantum Chromodynamics Calculations with GPUs Guochun Shi, Steven Gottlieb, Aaron Torok, Volodymyr Kindratenko NCSA & Indiana University National Center for Supercomputing Applications University
More informationBeiHang Short Course, Part 7: HW Acceleration: It s about Performance, Energy and Power
BeiHang Short Course, Part 7: HW Acceleration: It s about Performance, Energy and Power James C. Hoe Department of ECE Carnegie Mellon niversity Eric S. Chung, et al., Single chip Heterogeneous Computing:
More informationAn Algorithmic Framework of Large-Scale Circuit Simulation Using Exponential Integrators
An Algorithmic Framework of Large-Scale Circuit Simulation Using Exponential Integrators Hao Zhuang 1, Wenjian Yu 2, Ilgweon Kang 1, Xinan Wang 1, and Chung-Kuan Cheng 1 1. University of California, San
More informationNon-newtonian Rabinowitsch Fluid Effects on the Lubrication Performances of Sine Film Thrust Bearings
International Journal o Mechanical Engineering and Applications 7; 5(): 6-67 http://www.sciencepublishinggroup.com/j/ijmea doi:.648/j.ijmea.75.4 ISSN: -X (Print); ISSN: -48 (Online) Non-newtonian Rabinowitsch
More informationThe Derivation of a Drag Coefficient Formula from Velocity-Voidage Correlations
1 The Derivation o a Drag Coeicient Formula rom Velocity-Voidage Correlations By M. Syamlal EG&G, T.S.W.V, Inc. P.O. Box 880 Morgantown, West Virginia 6507-0880 T.J. O Brien U.S. Department o Energy Morgantown
More informationEvolution. CT Amemiya et al. Nature 496, (2013) doi: /nature12027
Sequence Alignment Evolution CT Amemiya et al. Nature 496, 311-316 (2013) doi:10.1038/nature12027 Evolutionary Rates next generation OK OK OK X X Still OK? Sequence conservation implies function Alignment
More informationPERFORMANCE METRICS. Mahdi Nazm Bojnordi. CS/ECE 6810: Computer Architecture. Assistant Professor School of Computing University of Utah
PERFORMANCE METRICS Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement Jan. 17 th : Homework 1 release (due on Jan.
More information