An FPGA Implementation of Reciprocal Sums for SPME

Similar documents
Non-bonded interactions

Advanced Molecular Molecular Dynamics

Non-bonded interactions

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

Optimizing GROMACS for parallel performance

Seminar in Particles Simulations. Particle Mesh Ewald. Theodora Konstantinidou

Introduction to molecular dynamics

Parallelization of Molecular Dynamics (with focus on Gromacs) SeSE 2014 p.1/29

Efficient Parallelization of Molecular Dynamics Simulations on Hybrid CPU/GPU Supercoputers

Molecular Dynamics Simulations

Lecture 12: Solvation Models: Molecular Mechanics Modeling of Hydration Effects

A Fast N-Body Solver for the Poisson(-Boltzmann) Equation

Large-Scale Hydrodynamic Brownian Simulations on Multicore and Manycore Architectures

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

The Fast Multipole Method in molecular dynamics

RWTH Aachen University

Building a Multi-FPGA Virtualized Restricted Boltzmann Machine Architecture Using Embedded MPI

Explore Computational Power of GPU in Electromagnetics and Micromagnetics

Why study protein dynamics?

FENZI: GPU-enabled Molecular Dynamics Simulations of Large Membrane Regions based on the CHARMM force field and PME

Long-range interactions: P 3 M, MMMxD, ELC, MEMD and ICC

CE 530 Molecular Simulation

Coarse-Grained Models!

SP-CNN: A Scalable and Programmable CNN-based Accelerator. Dilan Manatunga Dr. Hyesoon Kim Dr. Saibal Mukhopadhyay

Tutorial on the smooth particle-mesh Ewald algorithm

Janus: FPGA Based System for Scientific Computing Filippo Mantovani

A Molecular Dynamics Simulation of a Homogeneous Organic-Inorganic Hybrid Silica Membrane

Anton: A Specialized ASIC for Molecular Dynamics

Gromacs Workshop Spring CSC

MOLECULAR DYNAMICS SIMULATIONS OF BIOMOLECULES: Long-Range Electrostatic Effects

An introduction to Molecular Dynamics. EMBO, June 2016

Linear Scale-Space. Algorithms for Linear Scale-Space. Gaussian Image Derivatives. Rein van den Boomgaard University of Amsterdam.

Multiple time step Monte Carlo simulations: Application to charged systems with Ewald summation

Computer simulation methods (2) Dr. Vania Calandrini

Graphics Card Computing for Materials Modelling

Introduction to Benchmark Test for Multi-scale Computational Materials Software

arxiv: v2 [physics.comp-ph] 19 Oct 2008

A reciprocal space based method for treating long range interactions in ab initio and force-field-based calculations in clusters

Computational Molecular Biophysics. Computational Biophysics, GRS Jülich SS 2013

Molecular Dynamics. A very brief introduction

Astronomical Computer Simulations. Aaron Smith

Molecular Dynamics Simulations. Dr. Noelia Faginas Lago Dipartimento di Chimica,Biologia e Biotecnologie Università di Perugia

ab initio Electronic Structure Calculations

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

Molecular Mechanics, Dynamics & Docking

Exam 2. Average: 85.6 Median: 87.0 Maximum: Minimum: 55.0 Standard Deviation: Numerical Methods Fall 2011 Lecture 20

Cider Seminar, University of Toronto DESIGN AND PERFORMANCE ANALYSIS OF A HIGH SPEED AWGN COMMUNICATION CHANNEL EMULATOR

All-atom Molecular Mechanics. Trent E. Balius AMS 535 / CHE /27/2010

XXL-BIOMD. Large Scale Biomolecular Dynamics Simulations. onsdag, 2009 maj 13

Set the initial conditions r i. Update neighborlist. Get new forces F i

Computational Chemistry - MD Simulations

Simulations of bulk phases. Periodic boundaries. Cubic boxes

Accelerating Model Reduction of Large Linear Systems with Graphics Processors

Classical Electrostatics for Biomolecular Simulations

Force Field for Water Based on Neural Network

Charge equilibration

1/11/12. A Scalar Field. The Electric Field + + A Vector Field

Parameter Estimation for the Dirichlet-Multinomial Distribution

Multiscale Materials Modeling

PHY 114 A General Physics II 11 AM-12:15 PM TR Olin 101. Plan for Lecture 1:

Introduction to Molecular Dynamics

Andrew Morton University of Waterloo Canada

Scalable and Power-Efficient Data Mining Kernels

How to Avoid Butchering S-parameters

Electromagnetic Field Theory (EMT)

Neighbor Tables Long-Range Potentials

Computation Time Assessment of a Galerkin Finite Volume Method (GFVM) for Solving Time Solid Mechanics Problems under Dynamic Loads

Fondamenti di Chimica Farmaceutica. Computer Chemistry in Drug Research: Introduction

Example questions for Molecular modelling (Level 4) Dr. Adrian Mulholland

Reflections on S-parameter Quality DesignCon IBIS Summit, Santa Clara, February 3, 2011

Concept of Statistical Model Checking. Presented By: Diana EL RABIH

Using Molecular Dynamics to Compute Properties CHEM 430

Structural Bioinformatics (C3210) Molecular Mechanics

Material Surfaces, Grain Boundaries and Interfaces: Structure-Property Relationship Predictions

New Algorithms for Conventional and. Jing Kong ( 孔静 ) Q-Chem Inc. Pittsburgh, PA

Molecular Dynamics Simulations

Long range interactions in computer simulations. 1 (Part 3)

Lecture 27: Hardware Acceleration. James C. Hoe Department of ECE Carnegie Mellon University

Efficient multiple time step method for use with Ewald and particle mesh Ewald for large biomolecular systems

arxiv: v1 [physics.comp-ph] 22 Nov 2012

Beam dynamics calculation

CS 179: GPU Programming. Lecture 9 / Homework 3

Econometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018

Simulation of molecular systems by molecular dynamics

Development of Molecular Dynamics Simulation System for Large-Scale Supra-Biomolecules, PABIOS (PArallel BIOmolecular Simulator)

FDTD for 1D wave equation. Equation: 2 H Notations: o o. discretization. ( t) ( x) i i i i i

ACCELERATED LEARNING OF GAUSSIAN PROCESS MODELS

A smooth particle-mesh Ewald algorithm for Stokes suspension simulations: The sedimentation of fibers

Week 4. Outline Review electric Forces Review electric Potential

ARTICLES. Constant Temperature Constrained Molecular Dynamics: The Newton-Euler Inverse Mass Operator Method

Analysis of Software Artifacts

Reading: Chapter 28. 4πε r. For r > a. Gauss s Law

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic

Faster Kinetics: Accelerate Your Finite-Rate Combustion Simulation with GPUs

Supporting Information. for An atomistic modeling of F-actin mechanical responses and determination of. mechanical properties

PanHomc'r I'rui;* :".>r '.a'' W"»' I'fltolt. 'j'l :. r... Jnfii<on. Kslaiaaac. <.T i.. %.. 1 >

Forces, Energy and Equations of Motion

Notes for Lecture 10

Quantum computing with superconducting qubits Towards useful applications

Transcription:

An FPGA Implementation of Reciprocal Sums for SPME Sam Lee and Paul Chow Edward S. Rogers Sr. Department of Electrical and Computer Engineering University of Toronto

Objectives Accelerate part of Molecular Dynamics Simulation Smooth Particle Mesh Ewald Implementation FPGA based Try it and learn Investigation Acceleration bottleneck Precision requirement Parallelization strategy

Presentation Outline Molecular Dynamics SPME The Reciprocal Sum Compute Engine Speedup and Parallelization Precision Future work

Molecular Dynamics Simulation 4

Molecular Dynamics Combines empirical force calculations with Newton s equations of motion. Predict the time trajectory of small atomic systems. Computationally demanding.. Calculate interatomic forces.. Calculate the net force.. Integrate Newton s equations of motion. a r = F m ( t + δt ) = r () t + δt v () t + 0.5δt a () t v ( t + δt ) = v () t + 0. 5δt a () t + a ( t + δt ) F 5

Molecular Dynamics k Θ All Angles + + k ( l b l o All Bonds ( Θ Θ ) o ) U = A [ + cos( nτ + φ)] All Torsions + + All Pairs All Pairs q q r σ 4ε r 6 σ r + δ δ 6

MD Simulation Problem scientists are facing: SLOW! O(N ) complexity. 0 CPU Years 7

Solutions Parallelize to more compute engines Accelerate with FPGA Especially: The non-bonded calculations To be more specific, this paper addresses: Electrostatic interaction (Reciprocal space) Smooth Particle Mesh Ewald algorithm. 8

Previous Work Software SPME Implementations: Original PME Package written by Toukmaji. Used in NAMD. Hardware Implementations: No previous hardware implementation of reciprocal sums calculation. MD-Grape & MD-Engine uses Ewald Summation. Ewald Summation is O(N ); SPME is O(NLogN)! 9

Smooth Particle Mesh Ewald 0

Electrostatic Interaction Coulombic equation: qq = 4πε r v coulomb 0 Under the Periodic Boundary Condition, the summation to calculate Electrostatic energy is only Conditionally Convergent. U = ' n N N q q i= j= ij, n i r j

Periodic Boundary Condition A 4 5 B 4 5 C 4 5 D 4 5 E 4 5 F 4 5 G 4 5 H 4 5 I 4 5 To combat Surface Effect 4 5 Replication

Ewald Summation Used For PBC To calculate the Coulombic Interactions O(N ) Direct Sum + O(N ) Reciprocal Sum q r Direct Sum q Reciprocal Sum q r r

Smooth Particle Mesh Ewald Shift the workload to the Reciprocal Sum. Use Fast Fourier Transform. O(N) Real + O(NLogN) Reciprocal. RSCE calculates the Reciprocal Sums using the SPME algorithm. 4

5 SPME Reciprocal Contribution ),m,m m Q)( (θ ),m,m (m r Q r E F K m K m rec K m αi αi rec ~ 0 0 0 = = = = = ) (m b ) (m b ) b (m ),m,m B(m = 0 exp exp = + = n k i i n i i i i ) K πim k ( ) (k M ) K )m πi(n ( ) b (m exp m ) /β m π ( πv ),m,m C(m = 0 0 0 0 0 = ),,,c( m ) m, m, m )F(Q)(,m,m F(Q)(m ),m,m B(m m ) /β m π ( πv E m ~ 0 exp = FFT FFT Energy: Force: ),m,m m Q)( (θ ),m,m Q(m E K m K m rec K m ~ 0 0 0 = = = =

Charge Interpolation F D A C B E 6

Reciprocal Sum Compute Engine 7

RSCE Architecture 8

RSCE Verification Testbench 9

RSCE Validation Environment 0

Speedup Estimate RSCE vs. Software Implementation

RSCE Speedup RSCE @ 00MHz vs. P4 Intel @.4GHz. Speedup: x to 4x Why so insignificant? Reciprocal Sums calculations not easily parallelizable. QMM memory bandwidth limitation. Improvement: Using more QMM memories can improve the speedup. Slight design modifications are required.

Parallelization Strategy Multiple RSCE

RSCE Parallelization Strategy Assume a -D simulation system. Assume P=, K=8, N=6. Assume NumP = 4. An 8x8x8 mesh Four 4x4x4 Mini Meshes 4

RSCE Parallelization Strategy Mini-mesh composed -> D-IFFT D-IFFT = two passes of D-FFT (X and Y). X Direction FFT Y Direction FFT Ky Ky P P P P P P P4 D FFT Y direction P4 0 Kx D FFT X direction 0 Kx 5

Parallelization Strategy D-IFFT -> Energy Calculation -> D-FFT D-FFT -> Force Calculation Energy Calculation Force Calculation E Total = E P P= 0 D-FFT 6

MD Simulations RSCE + NAMD 7

RSCE Precision Precision goal: Relative error bound < 0-5. Two major calculation steps: B-Spline Calculation. D-FFT/IFFT Calculation. Due to the limited logic resource & limited precision FFT LogiCore. => Precision goal cannot be achieved. 8

RSCE Precision To achieve the relative error bound of < 0-5. Minimum calculation precision: FFT {4.0}, B-Spline {.7} 9

MD Simulation with RSCE RMS Energy Error Fluctuation: RMS Energy Fluctuatio n = E E E 0

FFT Precision Vs. Energy Fluctuation

Summary Implementation of FPGA-based Reciprocal Sums Compute Engine and its SystemC model. Integration of the RSCE into a widely used Molecular Dynamics program called NAMD for verification RSCE Speedup Estimate x to 4x Precision Requirement B-Spline: {.7} & FFT: {4:0} => 0-5 rel. error Parallelization Strategy

Future Work More in-depth precision analysis. Investigation on how to further speedup the SPME algorithm with FPGA.

Questions 4