Introduction to Three Dimensional Structure Determination of Macromolecules by Cryo-Electron Microscopy

Similar documents
3D Structure Determination using Cryo-Electron Microscopy Computational Challenges

Some Optimization and Statistical Learning Problems in Structural Biology

Non-unique Games over Compact Groups and Orientation Estimation in Cryo-EM

PCA from noisy linearly reduced measurements

Three-dimensional structure determination of molecules without crystallization

Class Averaging in Cryo-Electron Microscopy

Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian

Multireference Alignment, Cryo-EM, and XFEL

Orientation Assignment in Cryo-EM

Global Positioning from Local Distances

Covariance Matrix Estimation for the Cryo-EM Heterogeneity Problem

Three-Dimensional Electron Microscopy of Macromolecular Assemblies

arxiv: v1 [physics.comp-ph] 12 Mar 2018

History of 3D Electron Microscopy and Helical Reconstruction

Electron microscopy in molecular cell biology II

Opportunity and Challenge: the new era of cryo-em. Ning Gao School of Life Sciences Tsinghua University

NIH Public Access Author Manuscript Ann Math. Author manuscript; available in PMC 2012 November 23.

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1

ParM filament images were extracted and from the electron micrographs and

Fast and Robust Phase Retrieval

Applied and Computational Harmonic Analysis

Image Assessment San Diego, November 2005

Structure, Dynamics and Function of Membrane Proteins using 4D Electron Microscopy

Determining Protein Structure BIBC 100

Computing Steerable Principal Components of a Large Set of Images and Their Rotations Colin Ponce and Amit Singer

Fast Angular Synchronization for Phase Retrieval via Incomplete Information

Data-dependent representations: Laplacian Eigenmaps

Tools for Cryo-EM Map Fitting. Paul Emsley MRC Laboratory of Molecular Biology

REPRESENTATION THEORETIC PATTERNS IN THREE DIMENSIONAL CRYO-ELECTRON MICROSCOPY I - THE INTRINSIC RECONSTITUTION ALGORITHM

arxiv: v1 [cs.cv] 12 Jul 2016

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.

Representation theoretic patterns in three dimensional Cryo-Electron Microscopy I: The intrinsic reconstitution algorithm

sin" =1.22 # D "l =1.22 f# D I: In introduction to molecular electron microscopy - Imaging macromolecular assemblies

Mitochondria Mitochondria were first seen by kollicker in 1850 in muscles and called them sarcosomes. Flemming (1882) described these organelles as

Experimental Techniques in Protein Structure Determination

A New Approach to EM of Helical Polymers Yields New Insights

Part 1 X-ray Crystallography

Graph Partitioning Using Random Walks

What can electron microscopy tell us about chaperoned protein folding? Helen R Saibil

CS273: Algorithms for Structure Handout # 13 and Motion in Biology Stanford University Tuesday, 11 May 2003

Basics of protein structure

Integrating Globus into a Science Gateway for Cryo-EM

Nonlinear Dimensionality Reduction. Jose A. Costa

Lecture CIMST Winter School. 1. What can you see by TEM?

The Projected Power Method: An Efficient Algorithm for Joint Alignment from Pairwise Differences

Advanced Machine Learning & Perception

Processing Heterogeneity

Protein Structure Analysis and Verification. Course S Basics for Biosystems of the Cell exercise work. Maija Nevala, BIO, 67485U 16.1.

NMR of large protein systems: Solid state and dynamic nuclear polarization. Sascha Lange, Leibniz-Institut für Molekulare Pharmakologie (FMP)

Algorithms in Bioinformatics

REPRESENTATION THEORETIC PATTERNS IN THREE DIMENSIONAL CRYO-ELECTRON MICROSCOPY III - PRESENCE OF POINT SYMMETRIES

Algorithms for Image Restoration and. 3D Reconstruction from Cryo-EM. Images

Introduction to" Protein Structure

Unsupervised Learning Techniques Class 07, 1 March 2006 Andrea Caponnetto

Molecular electron microscopy

NMR Spectroscopy of Polymers

Application examples of single particle 3D reconstruction. Ning Gao Tsinghua University

Protein Crystallography

I690/B680 Structural Bioinformatics Spring Protein Structure Determination by NMR Spectroscopy

Introduction to Cryo Microscopy. Nikolaus Grigorieff

Biological Small Angle X-ray Scattering (SAXS) Dec 2, 2013

ITERATIVE METHODS IN LARGE FIELD ELECTRON MICROSCOPE TOMOGRAPHY. Xiaohua Wan

Intrinsic Structure Study on Whale Vocalizations

Improving cryo-em maps: focused 2D & 3D classifications and focused refinements

Electron cryo- microscopy at Yale

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Spectroscopy of Nanostructures. Angle-resolved Photoemission (ARPES, UPS)

Module 2: Reflecting on One s Problems

Self-Tuning Semantic Image Segmentation

THE HIDDEN CONVEXITY OF SPECTRAL CLUSTERING

New algoritms for electron diffraction of 3D protein crystals. JP Abrahams, D Georgieva, L Jiang, I Sikhuralidze, NS Pannu

Introduction to Machine Learning. PCA and Spectral Clustering. Introduction to Machine Learning, Slides: Eran Halperin

Automated Assignment of Backbone NMR Data using Artificial Intelligence

Iterative Laplacian Score for Feature Selection

arxiv: v1 [math.rt] 16 Oct 2009

Supplementary Information: Long range allosteric regulation of the human 26S proteasome by 20S proteasome-targeting cancer drugs

BINF6201/8201. Molecular phylogenetic methods

Contents. xiii. Preface v

X-ray protein crystallography is currently the primary methodology

Is Manifold Learning for Toy Data only?

REPRESENTATION THEORETIC PATTERNS IN THREE DIMENSIONAL CRYO-ELECTRON MICROSCOPY III - PRESENCE OF POINT SYMMETRIES

CRYSTALLOGRAPHY AND STORYTELLING WITH DATA. President, Association of Women in Science, Bethesda Chapter STEM Consultant

Tutorial 1 Geometry, Topology, and Biology Patrice Koehl and Joel Hass

Diffusion Wavelets and Applications

arxiv: v3 [cs.cv] 6 Apr 2016

Review. CS/CME/BioE/Biophys/BMI 279 Nov. 30 and Dec. 5, 2017 Ron Dror

Chapter 12: Intracellular sorting

Learning Eigenfunctions: Links with Spectral Clustering and Kernel PCA

Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Physics of Cellular materials: Filaments

Applications of scattering theory! From the structure of the proton! to protein structure!

SHELXC/D/E. Andrea Thorn

BMB/Bi/Ch 173 Winter 2018

Macromolecular X-ray Crystallography

Computational Biology: Basics & Interesting Problems

Statistical Issues in Searches: Photon Science Response. Rebecca Willett, Duke University

Proteins are not rigid structures: Protein dynamics, conformational variability, and thermodynamic stability

Cells and the Stuff They re Made of. Indiana University P575 1

Theory and Applications of Residual Dipolar Couplings in Biomolecular NMR

Sparsity and Morphological Diversity in Source Separation. Jérôme Bobin IRFU/SEDI-Service d Astrophysique CEA Saclay - France

Transcription:

Introduction to Three Dimensional Structure Determination of Macromolecules by Cryo-Electron Microscopy Amit Singer Princeton University, Department of Mathematics and PACM July 23, 2014 Amit Singer (Princeton University) July 2014 1 / 21

Single Particle Reconstruction using cryo-em Schematic drawing of the imaging process: The cryo-em problem: Amit Singer (Princeton University) July 2014 2 / 21

New detector technology: Exciting times for cryo-em BIOCHEMISTRY The Resolution Revolution Werner Kühlbrandt P recise knowledge of the structure of macromolecules in the cell is essential for understanding how they function. Structures of large macromolecules can now be obtained at near-atomic resolution by averaging thousands of electron microscope images recorded before radiation damage accumulates. This is what Amunts et al. have done in their research article on page 1485 of this issue ( 1), reporting the structure of the large subunit of the mitochondrial ribosome at 3.2 Å resolution by electron cryo-microscopy (cryo-em). Together with other recent high-resolution cryo-em structures ( 2 4) (see the figure), this achievement heralds the beginning of a new era in molecular biology, where structures at near-atomic resolution are no longer the prerogative of x-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. Ribosomes are ancient, massive protein- RNA complexes that translate the linear genetic code into three-dimensional proteins. Mitochondria semi-autonomous organelles www.sciencemag.org SCIENCE VOL 343 28 MARCH 2014 1443 A B C Advances in detector technology and image processing are yielding high-resolution electron cryo-microscopy structures of biomolecules. Near-atomic resolution with cryo-em. (A) The large subunit of the yeast mitochondrial ribosome at 3.2 Å reported by Amunts et al. In the detailed view below, the base pairs of an RNA double helix and a magnesium ion (blue) are clearly resolved. (B) TRPV1 ion channel at 3.4 Å ( 2), with a detailed view of residues lining the ion pore on the four-fold axis of the tetrameric channel. (C) F 420-reducing [NiFe] hydrogenase at 3.36 Å ( 3). The detail shows an α helix in the FrhA subunit with resolved side chains. The maps are not drawn to scale. Amit Singer (Princeton University) July 2014 3 / 21

The Resolution Revolution X-ray crystallography is one of the greatest innovations of the 20th century W. Kühlbrandt, Science 343, 1443, March 28 2014: Does the resolution revolution in cryo-em mean that the era of x-ray protein crystallography is coming to an end? Definitely not. For the foreseeable future, small proteins in cryo-em, anything below 100 kd counts as small and resolutions of 2Å or better will remain the domain of x-rays. But for large, fragile, or flexible structures (such as membrane protein complexes) that are difficult to prepare yet hold the key to central biomedical questions, the new technology is a major breakthrough. In the future, it may no longer be necessary to crystallize large, well-defined complexes such as ribosomes. Instead, their structures can be determined elegantly and quickly by cryo-em. These are exciting times. Amit Singer (Princeton University) July 2014 4 / 21

Big Movie Data, Publicly Available http://www.ebi.ac.uk/pdbe/emdb/empiar/ Amit Singer (Princeton University) July 2014 5 / 21

ASPIRE: Algorithms for Single Particle Reconstruction Open source toolbox, publicly available: http://spr.math.princeton.edu/ Amit Singer (Princeton University) July 2014 6 / 21

E. coli 50S ribosomal subunit 27,000 particle images provided by Dr. Fred Sigworth, Yale Medical School 3D reconstruction by S, Lanhui Wang (now data scientist at TripAdvisor), and Jane Zhao (now postdoc at Courant Institute, NYU) Amit Singer (Princeton University) July 2014 7 / 21

Data-Driven Discovery: The cryo-em experience Amit Singer (Princeton University) July 2014 8 / 21

Image Formation Model and Inverse Problem Projection I i Molecule φ R i = Ri 1 Ri 2 Ri 3 T T T SO(3) Electron source Projection images I i (x,y) = φ(xr1 i +yri 2 +zri 3 )dz + noise. φ : R 3 R is the electric potential of the molecule. Cryo-EM problem: Find φ and R 1,...,R n given I 1,...,I n. Amit Singer (Princeton University) July 2014 9 / 21

Toy Example Amit Singer (Princeton University) July 2014 10 / 21

Typical Reconstruction Procedure: Iterative Refinement Alternating minimization or expectation-maximization, starting from an initial guess φ 0 for the 3-D structure I i = P(R i φ)+ǫ i, i = 1,...,n. R i φ(r) = φ(r 1 i r) is the left group action P is integration in the z-direction. Converges to a local optimum, not necessarily the global one. Model bias is a well-known pitfall Einstein from noise (Henderson, PNAS 2013) Is reference free orientation assignment and reconstruction possible? Amit Singer (Princeton University) July 2014 11 / 21

Orientation Estimation: Fourier projection-slice theorem (x ij,y ij ) R i c ij c ij = (x ij,y ij,0) T Projection I i Î i (x ji,y ji ) 3D Fourier space R i c ij = R j c ji Projection I j Î j 3D Fourier space Amit Singer (Princeton University) July 2014 12 / 21

Angular Reconstitution (Vainshtein and Goncharov 1986, Van Heel 1987) Amit Singer (Princeton University) July 2014 13 / 21

Experiments with simulated noisy projections Each projection is 129x129 pixels. SNR = Var(Signal) Var(Noise), (a) Clean (b) SNR=2 0 (c) SNR=2 1 (d) SNR=2 2 (e) SNR=2 3 (f) SNR=2 4 (g) SNR=2 5 (h) SNR=2 6 (i) SNR=2 7 (j) SNR=2 8 Amit Singer (Princeton University) July 2014 14 / 21

Fraction of correctly identified common lines and the SNR Define common line as being correctly identified if both radial lines deviate by no more than 10 from true directions. log 2 (SNR) p 20 0.997 0 0.980-1 0.956-2 0.890-3 0.764-4 0.575-5 0.345-6 0.157-7 0.064-8 0.028-9 0.019 Amit Singer (Princeton University) July 2014 15 / 21

Outline for this week 1 Orientation assignment 2 Heterogeneity 3 Class averaging and symmetry detection Amit Singer (Princeton University) July 2014 16 / 21

Orientation Estimation (x ij,y ij) R ic ij c ij = (x ij,y ij,0) T Projection I i Projection I i Î i 3D Fourier space Molecule φ R i SO(3) (x ji,y ji) R ic ij = R jc ji Electron source Projection I j Î j 3D Fourier space n = 3: Vainshtein and Goncharov 1986, van Heel 1987 n > 3: S, Shkolnisky SIAM Imaging 2011 Minimizing a quadratic cost: n i,j=1 R ic ij R j c ji 2 Quadratic constraints: R T i R i = I 3 3 Semidefinite Programming (SDP) Relaxation, Spectral Relaxation n > 3: Maximum likelihood using SDP: Bandeira, Charikar, Chen, S (in preparation) Amit Singer (Princeton University) July 2014 17 / 21

The heterogeneity problem What if the molecule has more than one possible structure? (Image source: H. Liao and J. Frank, Classification by bootstrapping in single particle methods, Proceedings of the 2010 IEEE international conference on biomedical imaging, 2010.) Covariance matrix estimation of the 3-D structures from their 2-D projections Katsevich, Katsevich, S (submitted) Amit Singer (Princeton University) July 2014 18 / 21

Class averaging for image denoising Rotation invariant representation (steerable PCA, bispectrum) Vector diffusion maps S, Wu (Comm. Pure Appl. Math 2012) Zhao, S (J Struct. Bio. 2014) Generalization of Laplacian Eigenmaps (Belkin, Niyogi 2003) and Diffusion Maps (Coifman, Lafon 2006) Graph Connection Laplacian Experimental images (70S) courtesy of Dr. Joachim Frank (Columbia) Class averages by vector diffusion maps (averaging with 20 nearest neighbors) Amit Singer (Princeton University) July 2014 19 / 21

Algorithmic Pipeline Particle Picking: manual, automatic or experimental image segmentation. Class Averaging: classify images with similar viewing directions, register and average to improve their signal-to-noise ratio (SNR). Orientation Estimation: common lines Three-dimensional Reconstruction: a 3D volume is generated by a tomographic inversion algorithm. Iterative Refinement Extensions Non-trivial point-group symmetries Heterogeneity (structural variability) Amit Singer (Princeton University) July 2014 20 / 21

References - Orientation Assignment A. Singer, Y. Shkolnisky, Three-Dimensional Structure Determination from Common Lines in Cryo-EM by Eigenvectors and Semidefinite Programming, SIAM Journal on Imaging Sciences, 4 (2), pp. 543 572 (2011). R. Hadani, A. Singer, Representation theoretic patterns in three dimensional Cryo-Electron Microscopy I The intrinsic reconstitution algorithm, Annals of Mathematics, 174 (2), pp. 1219 1241 (2011). L. Wang, A. Singer, Z. Wen, Orientation Determination of Cryo-EM images using Least Unsquared Deviations, SIAM Journal on Imaging Sciences, 6(4), pp. 2450 2483 (2013). A. S. Bandeira, M. Charikar, A. Singer, A. Zhu, Multireference Alignment using Semidefinite Programming, 5th Innovations in Theoretical Computer Science (ITCS 2014). Amit Singer (Princeton University) July 2014 21 / 21