Klaus Schulten Department of Physics and Theoretical and Computational Biophysics Group University of Illinois at Urbana-Champaign

Similar documents
NIH Center for Macromolecular Modeling and Bioinformatics Developer of VMD and NAMD. Beckman Institute

Introduction to Protein Structures - Molecular Graphics Tool

NIH Center for Macromolecular Modeling and Bioinformatics Developer of VMD and NAMD. Beckman Institute

Introduction to Protein Structures - Molecular Graphics Tool

VMD: Visual Molecular Dynamics. Our Microscope is Made of...

GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications

GPU-Accelerated Analysis and Visualization of Large Structures Solved by Molecular Dynamics Flexible Fitting

The Computational Microscope

Folding WT villin in silico Three folding simulations reach native state within 5-8 µs

Adaptive Heterogeneous Computing with OpenCL: Harnessing hundreds of GPUs and CPUs

Molecular Dynamics Flexible Fitting

Enhanced Sampling and Free Energy Applications in Biomolecular Modeling

Our Microscope is Made of...

Direct Self-Consistent Field Computations on GPU Clusters

Graphics Card Computing for Materials Modelling

Welcome to MCS 572. content and organization expectations of the course. definition and classification

Using a CUDA-Accelerated PGAS Model on a GPU Cluster for Bioinformatics

Computational Structural Biology and Molecular Simulation. Introduction to VMD Molecular Visualization and Analysis

MURCIA: Fast parallel solvent accessible surface area calculation on GPUs and application to drug discovery and molecular visualization

The living cell is a society of molecules: molecules assembling and cooperating bring about life!

Part II - Applications of MultiSeq Evolution of Translation: Dynamics of Recognition in RNA:Protein Complexes

Part III - Bioinformatics Study of Aminoacyl trna Synthetases. VMD Multiseq Tutorial Web tools. Perth, Australia 2004 Computational Biology Workshop

Parallelization of Molecular Dynamics (with focus on Gromacs) SeSE 2014 p.1/29

Probing Biomolecular Machines with Graphics Processors

GPU Computing Activities in KISTI

Structure of the HIV-1 Capsid Assembly by a hybrid approach

Introduction to numerical computations on the GPU

Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters

Accelerating Quantum Chromodynamics Calculations with GPUs

Efficient Molecular Dynamics on Heterogeneous Architectures in GROMACS

DNA/RNA structure and packing

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters

CRYPTOGRAPHIC COMPUTING

Introduction to Evolutionary Concepts

Real-time signal detection for pulsars and radio transients using GPUs

Perm State University Research-Education Center Parallel and Distributed Computing

A microsecond a day keeps the doctor away: Efficient GPU Molecular Dynamics with GROMACS

Integrating Globus into a Science Gateway for Cryo-EM

Our Microscope is Made of...

S0214 : GPU Based Stacking Sequence Generation For Composite Skins Using GA


Random Sampling for Short Lattice Vectors on Graphics Cards

NSF Summer School UIUC 2003 Molecular Machines of the Living Cell : Photosynthetic. Unit from Light Absorption to ATP Synthesis Melih Sener

Accelerating Protein Coordinate Conversion using GPUs

Accelerating linear algebra computations with hybrid GPU-multicore systems.

What can sequences tell us?

Don t forget to bring your MD tutorial. Potential Energy (hyper)surface

Adaptive Heterogeneous Computing with OpenCL: Harnessing hundreds of GPUs and CPUs

Announcements & Lecture Points

arxiv: v1 [hep-lat] 7 Oct 2010

Mechanical Proteins. Stretching imunoglobulin and fibronectin. domains of the muscle protein titin. Adhesion Proteins of the Immune System

COMPARISON OF CPU AND GPU IMPLEMENTATIONS OF THE LATTICE BOLTZMANN METHOD

Mechanisms for exploiting heterogeneous computing: Harnessing hundreds of GPUs and CPUs

Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWAS

Measurement of Dynamic trna Movement and Visualizing Ribosome Biogenesis Applications of Time resolved Electron Cryo microscopy

arxiv: v1 [physics.data-an] 19 Feb 2017

Evolution of Translation: Dynamics of Recognition in RNA:Protein Complexes

Lecture 25: Protein Synthesis Key learning goals: Be able to explain the main stuctural features of ribosomes, and know (roughly) how many DNA and

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Science Unit Learning Summary

Multiphase Flow Simulations in Inclined Tubes with Lattice Boltzmann Method on GPU

Statistical Clustering of Vesicle Patterns Practical Aspects of the Analysis of Large Datasets with R

Chapter 4 A Tour of the Cell. The human body is made up of trillions of cells many of which are specialized - Muscle cells

What I Did Last Summer

Slide 1. Slide 2. Slide 3. Chapter 4 A Tour of the Cell. State Standards. Introduction to Cells. Standard 1.c. Standard 1.e.

What can be learnt from MD

no.1 Raya Ayman Anas Abu-Humaidan

Lattice Boltzmann simulations on heterogeneous CPU-GPU clusters

Outline. Viruses, Bacteria, and Archaea. Viruses Structure Classification Reproduction Prokaryotes Structure Reproduction Nutrition Bacteria Archaea

APPLICATION OF CUDA TECHNOLOGY FOR CALCULATION OF GROUND STATES OF FEW-BODY NUCLEI BY FEYNMAN'S CONTINUAL INTEGRALS METHOD

What Organelle Makes Proteins According To The Instructions Given By Dna

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

CELL STRUCTURE & FUNCTION

Cellular Processes in Bacterial Cells

GRAPHENE FLAGSHIP. Funded by the European Union

Hands-on Course in Computational Structural Biology and Molecular Simulation BIOP590C/MCB590C. Course Details

Eukaryotic Cells. Figure 1: A mitochondrion

Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge

Accelerating Molecular Modeling Applications with Graphics Processors

Molecular dynamics simulations and drug discovery

Explore Computational Power of GPU in Electromagnetics and Micromagnetics

How is molecular dynamics being used in life sciences? Davide Branduardi

POLITECNICO DI MILANO DATA PARALLEL OPTIMIZATIONS ON GPU ARCHITECTURES FOR MOLECULAR DYNAMIC SIMULATIONS

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic

Section 19 1 Bacteria (pages )

Xiaoxia Li. Group of HPC & Cheminformatics Institute of Process Engineering Chinese Academy of Sciences, Beijing

Cell Theory. The cell is the basic unit of structure and function for all living things, but no one knew they existed before the 17 th century!

FENZI: GPU-enabled Molecular Dynamics Simulations of Large Membrane Regions based on the CHARMM force field and PME

A CUDA Solver for Helmholtz Equation

Bacterial Genetics & Operons

4/18/2011. Titus Beu University Babes-Bolyai Department of Theoretical and Computational Physics Cluj-Napoca, Romania

TER 26. Preview for 2/6/02 Dr. Kopeny. Bacteria and Archaea: The Prokaryotic Domains. Nitrogen cycle

Regulació electrostàtica de canals microfluídics i porus biològics. Jordi Faraudo Institut de Ciència de Materials de Barcelona

Population annealing study of the frustrated Ising antiferromagnet on the stacked triangular lattice

Simulating ribosome biogenesis in replicating whole cells

Some history. Now, we know that Robert Hooke was not looking at living cells, but the remains of dead cell walls.

Practical Combustion Kinetics with CUDA

Case Study: Protein Folding using Homotopy Methods

XXL-BIOMD. Large Scale Biomolecular Dynamics Simulations. onsdag, 2009 maj 13

S XMP LIBRARY INTERNALS. Niall Emmart University of Massachusetts. Follow on to S6151 XMP: An NVIDIA CUDA Accelerated Big Integer Library

Transcription:

Klaus Schulten Department of Physics and Theoretical and Computational Biophysics Group University of Illinois at Urbana-Champaign GTC, San Jose Convention Center, CA Sept. 20 23, 2010

GPU and the Computational Microscope Accuracy Speed-up Unprecedented Scale Investigation of drug (Tamiflu) resistance of the swine flu virus demanded fast response!

Computational Microscope Views at Atomic Resolution... RER The Computational Microscope Rs E M SER N C GA RER L...how living cells maintain health and battle disease

Our Microscope is Made of... Chemistry Physics NAMD Software Virus ns/day 100 10 40,000 registered users 1 128 256 512 1024 2048 4096 8192 16384 32768 cores Math (repeat one billion times = microsecond)..and Supercomputers

Our Microscope is a Tool to Think Graphics, Geometry Physics Carl Woese Lipoprotein particle HDL Ribosomes in whole cell Genetics VMD Analysis Engine T. Martinez, Stanford U. Atomic coordinates 140,000 Registered Users Volumetric data, electronsa

Science 1: Poliovirus Infection Virus Genome Inside Capsid Virus attachment to cell Rigid Capsid membrane indentation Cell Membrane Surface Receptor How does this work? Answer requires 100 million atom long-time simulation T Virus Genome Inside Cell Loose Capsid

Science 1: Virus Capsid Mechanics Atomic Force Microscope Hepatitis B Virus 500 400 Experiment Simulation Force (pn) 300 200 100 0-40 -20 0 20 40 60-20 80 180 280 380 480 Indentation (Å)

GPU Solution 1: Time-Averaged Electrostatics Thousands of trajectory frames 1.5 hour job reduced to 3 min GPU Speedup: 25.5x Per-node power consumption on NCSA GPU cluster: CPUs-only: 448 Watt-hours CPUs+GPUs: 43 Watt-hours Power efficiency gain: 10x

Science 2: How Nature Harvests Sun Light 95% of the energy in the biosphere comes from this energy source LIGHT 20 200 ADP ATP Purple Photosynthetic Bacterium 6 Chromatophore (70 nm) 1

Science 2: How Nature Harvests Sun Light Light Electrostatics ADP ATP ~10M atoms Electrostatic field calculated with multilevel summation method 1 CPU core: 1 hr 10 min 3 GPUs (G80): ~90 seconds

GPU Solution 2: Multilevel Summation Method for Electrostatics on the GPU grid levels atom level Localized grid operations map well to GPUs grid-grid interactions (3D convolution) atom-grid interactions GPU Speedups 26x on 1 GPU 46x on 3 GPUs 10 Million Atoms 1 CPU core: 1 hr 10 min 3 GPUs (G80): ~90 seconds Multilevel summation method has linear time complexity well suited for GPUs; more flexible than other methods

Science 3: How Proteins are Made from Genetic Blueprint Ribosome Decodes genetic information from mrna Important target of many antibiotics Static structures of crystal forms led to 2009 Nobel Prize But one needs structures of ribosomes in action! ribosome mrna new protein membrane protein-conducting channel

Science 3: How Proteins Are Made from Genetic Blueprint Low-resolution Data High-resolution Structure Close-up of Nascent Protein

GPU Solution 3: Molecular Dynamics Simulations 1.6 ~2.8 1.2 s/step 0.8 0.4 2 GPUs 4 GPUs 8 GPUs 16 GPUs Faster 0 8 16 32 64 CPU cores Molecular dynamics simulation of protein insertion process NCSA Lincoln Cluster performance (8 Intel cores and 2 NVIDIA Tesla GPUs per node, 1 million atoms) GPUs reduced time for simulation from two months to two weeks!

Science 4: Nanopore Sensors Genetics: Genes control our bodies and experiences! Epigenetics: Our bodies and experiences control the genes! Epigenetics made possible through DNA methylation Detect methylation with nanopores methylation 100,000 10,000 methylated DNA easy to move methylated DNA small change big effect un-methylated DNA Related pathologies: obesity, depression, cancer 88 bp copies 1,000 100 10 1 hard to move un-methylated DNA 1.5 2 2.5 3 3.5 4 voltage (V)

Science 4: Nanopore Sensors Create a Better Nanopore with Polymeric Materials New materials, new problems: Nanoprecipitation 8 Radial distribution functions identify nanoprecipitation 6 nanoprecipitation of ions g(r) 4 Precipitate 2 Liquid 0 2 4 6 8 10 r / Angstrom

GPU Solution 4: Computing Radial Distribution Functions 4.7 million atoms Intel X5550, 4-cores @ 2.66GHz 4-core Intel X5550 CPU: 15 hours 4 NVIDIA C2050 GPUs: 10 minutes Fermi GPUs ~3x faster than GT200 GPUs: larger on-chip shared memory g(r) 8 6 4 Precipitate billions of atom pairs/sec 100 10 1 6x NVIDIA Tesla C1060 (GT200) 4x NVIDIA Tesla C2050 (Fermi) Faster 2 Liquid 0 2 4 6 8 10 r / Angstrom 0.1 10,000 100,000 1,000,000 5,000,000 Atoms

Science 5: Quantum Chemistry Visualization Chemistry is the result of atoms sharing electrons Electrons occupy clouds in the space around atoms Calculations for visualizing these clouds are costly: tens to hundreds of seconds on CPUs non-interactive GPUs enable the dynamics of electronic structures to be animated interactively for the first time Taxol: Cancer Drug VMD enables interactive display of QM simulations, e.g. Terachem, GAMESS

Science 5: Quantum Chemistry Visualization Rendering of electron clouds achieved on GPUs as quickly as you see this movie! CPUs: One working day! day minute Simulation: Terachem Interactive Visualization: VMD Courtesy T. Martinez, Stanford

GPU Solution 5: Computing C60 Molecular Orbitals 3-D orbital lattice: millions of points Device CPUs, GPUs Runtime (s) Speedup Intel X5550-SSE 1 30.64 1.0 Intel X5550-SSE 8 4.13 7.4 GeForce GTX 480 1 0.255 120 GeForce GTX 480 4 0.081 378 Lattice slices computed on multiple GPUs GPU threads each compute one point 2-D CUDA grid on one GPU CUDA thread blocks

Science 6: Protein Folding Protein misfolding responsible for diseases: Alzheimer s Parkinson s Huntington Mad cow Type II diabetes... villin headpiece 3 months on 329 CPUs Observe folding process in unprecedented detail

Science 6: Protein Folding Some simulations still fail to fold proteins due to inaccurate modeling of interatomic forces! Protein folding demands more accurate model which leads to more expensive computation WW domain 3 months on 329 CPUs

GPU Solution 6: Computing More Accurate Simulation Models Atomic polarizability increases computation by 2x but, the additional computations are perfectly suited to the GPU! For now, NAMD calculates atomic polarizability on CPUs only...soon we will also use GPUs Atomic polarizability of water, highly accurately simulated through additional particles (shown in green) NAMD CPU performance scaling polarizable water non-polarizable water Seconds per step 1 0.1 0.01 100 1000 CPU cores

LacY Genetic activity of E. coli bacteria mrna mrna LacY Zan Luthey-Schulten and Elijah Roberts GTX280 Since our technique is a native GPU algorithm, no optimized CPU version exists by which to measure its performance..

2010 Workshop on GPU Computing for Molecular Modeling Spread the benefits of GPU computing to solve new problems in molecular modeling Intensive 2-day workshop after 1-week GPU workshop at NCSA Participants present their work and exchange ideas and GPU solutions Workshop Participants

Three of our GPU Heroes Our GPU Biomedical Science Computing Goals: More accurate simulations Speed-up: simulations now take minutes instead of weeks Make previously unreachable scales accessible Acknowledgements Theoretical and Computational Biophysics Group, University of Illinois at Urbana- Champaign Wen-mei Hwu and the IMPACT group at University of Illinois at Urbana-Champaign L. Kale and the Center for Parallel Computing at University of Illinois at Urbana-Champaign NVIDIA CUDA Center of Excellence, University of Illinois at Urbana-Champaign Ben Levine, Axel Kohlmeyer at Temple University NCSA Innovative Systems Lab The CUDA team at NVIDIA John Stone Jim Phillips David Hardy Zan Luthey-Schulten and Elija Roberts (E. coli whole cell simulations) NIH support: P41-RR05969

Thank You