Reac%on coordinates and LSDMap

Size: px
Start display at page:

Download "Reac%on coordinates and LSDMap"

Transcription

1 Reac%on coordinates and LSDMap

2 The challenges in molecular biophysics: a broad range of interconnected length and %me scales Organism Cell E.Coli ~10 20 atoms ~10 10 atoms ~1-10 µm Thermodynamics Macroscale Thermodynamics Mesoscale System ~ atoms ~ nm Mesoscale Mul%scale Biomolecule (Macromolecule) ~ atoms ~ 1-10 nm Mesoscale Mul%scale Molecule ~10 1 atoms ~ 1-10 Ǻ Quantum Chemistry Atom ~1 atom ~ 1 Ǻ Quantum Physics

3 What we study Dynamics Function 10 9 Cells Number of atoms Systems Macromolecules Molecules Atoms Length scale (m)

4 Way do we care about dynamics? func&on requires dynamics! from

5 Main goals and challenges Trajectories in equilibrium distribution Observables to compare with the experiment Mechanism from the dynamics New predictions dd 1. Force Field a set of parameters and equations describing the interactions between atoms ? 2. Sampling Can the simulations cover the rare events that we are interested in? 3. Data analysis How do we understand the mechanism in the data?

6 Protein Representa%ons Cartesian coordinate representa%on x, y, z coordinate for each single atom Internal coordinate representa%on: b - bond length α - angle between two consecu%ve bonds θ - angle between three consecu%ve bonds Idealized geometry model dihedral angles - only DOFs 2 backbone dihedrals 4 sidechain dihedrals

7

8 All- Atom Molecular Mechanics Force- Field A biomolecule is considered a collec%on of masses (atoms) connected by springs (bonds). The associated effec%ve energy (including electronic effects) is parameterized in a classical force- field. Generally, a force- field is in the form:

9

10

11 What we are interested in.

12 What is relevant and what is noise? in between bulk water one water molecule quantum chemistry gives molecular orbitals Large water clusters Wet/Dry interfaces Interac%on with solutes what are the relevant variables? what is the intrinsic dimensionality? thermodynamics describes the system What is the minimal set of variables needed to describe realis%cally the dynamics of a macromolecule? C.Clemen%, Curr. Opin. Struct. Biol. 2008, vol.18(1), 10-15

13 The problem of reac%on coordinates?

14 The problem of reac%on coordinates Reac%on Coordinates iden%fy the minimum free energy path along which the reac%on take place, and allow to locate reactants, products, and transi%on states Example: H + D 2 O Figures from B.R.Strazisar et al., Science 290, 958(2000)

15 How to choose good reac%on coordinates Physical- relevant variables Q, Rg, z automated ways: gene%c neural network, maximum likelihood a priori knowledge of the system Iso- commitor based on some other reac%on coordinate Dimensionality reduc%ons? linear dimensionality reduc%on (PCA, MDS) usually around equilibrium nonlinear dimensionality reduc%on Isomap, Diffusion Map, Sketch- map require correla%on to the physical- relevant variables

16 Reaction coordinates Gauge the progress of a reaction Cluster the (meta)stable states Preserve the barrier height Free Energy reaction coordinate Physical (intuitive) collective variables C5 Iso-commitor (Pfold) αp αr αl Dimensionality reduction?

17 The problem of reac%on coordinates p- fold ( commitment probability, isocommidor ): probability for a path star%ng in a par%cular point on the landscape to visit the folded state before the unfolded state 50% 50% Folded Unfolded

18 Empirical reac%on coordinates: how good are they? Folding trajectory of CI2 (Go- like model) at T f C.Clemen%, P.A. Jennings, J.N. Onuchic J.Mol.Biol. 311, (2001)

19 Is p- fold the ul%mate reac%on coordinate? S.S. Cho, Y. Levy, P.G. Wolynes, P versus Q: Structural reac:on coordinates capture protein folding on smooth landscapes PNAS 103:3, (2006)

20 Mathema%cally this is a problem of non- linear dimensionality reduc%on Example: a set of points on a torus in 3d defines a 2d embedded surface

21 Reduc%on coordinates for macromolecular mo%ons The problem consists in finding the best low dimensional descrip%on of a collec%on of macromolecular conforma%ons? Main mo%on 2 Main mo%on 1 We need to introduce a measure of similarity between configura%ons

22 Similar problems arise in different research fields (computer science, engineering, applied math, sta%s%cs, biology, ) Examples: classifica%on of documents, image recogni%on Courtesy of M.Maggioni

23 Linear dimensionality reduc%on: Principle component analysis (PCA) Pearson, Philos. Mag, 2, 559, 1901

24 Idea: Use geodesics to define the surface A geodesic is the shortest path between two points If we know the geodesics between any couple of points then we know everything about the geometry of the system Isomap algorithm - Tenenbaum, de Silva, & Langford, (2000) Science 290,

25 Basic ideas of ISOMAP Define a low- dimensional hyper- surface preserving as best as possible geodesic distances between all pairs of data points in the sample Geodesic distance and Euclidean distance

26 Use geodesics to define the manifold Network of nearest neighbors can serve to approximate geodesics between any pairs of points P. Das, M. Moll, H. Stama:, L.E. Kavraki, & C.Clemen: Proc. Natl. Acad. Sci. USA 103, (2006)

27 Applica%on of nonlinear dimensionality reduc%on to SH3 folding dynamics ScIMAP Na%ve structure of SH3 protein P. Das, M. Moll, H. Stama:, L.E. Kavraki, & C.Clemen: Proc. Natl. Acad. Sci. USA 103, (2006)

28 Applica%on of nonlinear dimensionality reduc%on to SH3 folding dynamics Free energy as a func%on of the first collec%ve coordinate P. Das, M. Moll, H. Stama:, L.E. Kavraki, & C.Clemen: Proc. Natl. Acad. Sci. USA 103, (2006)

29 Applica%on of nonlinear dimensionality reduc%on to SH3 folding dynamics U unfolded state N na%ve state TS transi%on state P. Das, M. Moll, H. Stama:, L.E. Kavraki, & C.Clemen: Proc. Natl. Acad. Sci. USA 103, (2006)

30 The limits of Isomap Example: Polymer reversal inside a narrow pore Huang and Makarov, J. Chem. Phys (2008) Empirical reac%on coordinate: distance between the first bead and the last bead projected on the z direc%on, z = z N - z 1 Is z a good reac%on coordinate to describe the polymer reversal dynamics?

31 How can we es%mate the reversal rate? 1. Direct method ( experiment ) Measuring the wai%ng %me between reversal events: p(t) dt α e - kt dt 2. Transi%on state theory If we have an accurate free energy profile, we can es%mate the rate: Huang and Makarov, J. Chem. Phys (2008)

32 Transi%on state theory Trajectories from simula%ons Weighted Histogram Analysis Method (WHAM) to get free energy profile as a func%on of the reac%on coordinate Transi%on State Theory (TST) to get the reversal rate Huang and Makarov, J. Chem. Phys (2008)

33 Recrossing of the transi%on events TST overes%mates the rate constant k Transmission factor Langevin equa%on Kramers theory

34 Comparison TST with κ Kramer both overes%mate the rate constant k if reac%on coordinate is not well chosen Huang and Makarov, J. Chem. Phys (2008) Is z a good reac%on coordinate to describe the transloca%on of polymer inside the pore? No.

35 Rate constant as obtained by using the 1 st ISOMAP coordinate S%ll huge gap between ISOMAP and sta%s%cal results The geodesic distance is not the best way to describe the dynamics of polymer reversal inside the pore.

36 Sketch- map RMSD distribu%on: Short range matches gaussian noise and long range matches uniformly- distributed points for alanine- 12. Define a set of coordinates best preserve the distances in the medium range. Minimize the sum of differences between the sigmoid func%ons (F and f) of distances in high and low dimensional space. Cerio^, Tribello & Parrinello, Proc. Nat. Acad. Sci. (USA), 108, 13023, 2011

37 Similarity measure Mul%dimensional scaling: Euclidean distance (black) Isomap: Geodesic distance (purple) Diffusion map: Diffusion distance

38 Diffusion Map If the data {x} are obtained from the sampling of a diffusion process with a poten%al energy func%on E(x), the associated probability distribu%on p(x,t) is expected to sa%sfy the Fokker- Planck equa%on: A natural distance measure can be defined on the data It measures how easily x 0 and x 1 transform into each other RR Coifman, S Lafon, A Lee, M Maggioni, B Nadler, FJ Warner, and SW Zucker, Proc. of Natl. Acad. Sci. USA, 102, , 2005

39 Diffusion Map The Fokker- Planck equa%on has a discrete eigenvalue spectrum 0 = λ 0 < λ 1 < λ 2 < λ 3. Boltzmann distribu%on (equilibrium) > 0 eigenfunc%ons If there is a separa%on of %mescales: λ k << λ k+1 Diffusion distance = 0 GOOD REACTION COORDINATES

40 Diffusion Map A discrete approxima%on of these eigenvalues and eigenvectors can be obtained by considering the kernel: eigenvalues and eigenfunc%ons of M are the discrete approxima%on of R. R. Coifman, S. Lafon, A.B. Lee, M. Maggioni, B. Nadler, F. Warner, S.W. Zucker Geometric diffusions as a tool for harmonic analysis and structure defini:on of data: Diffusion maps Proc.Natl.Acad.Sci.USA 102(21) , 2005

41 Idea of local scale The spread of Gaussian distribu%on is different for different points, which tells the level of the flatness of the manifold. Within the local scale, the manifold should be approximately flat. Only noises in green circle. Curvature in blue circle. Jung, Licle and Maggioni, Proc. AAAI, (2009) Rohrdanz, Zheng, Maggioni and Clemen:. J Chem. Phys., 134(12), (2011)

42 Determina%on of the local scale Find the smallest local scale above the noise in which PCA captures the dynamics reasonably well. PCA on increasing scales Local Principle Component Analysis Local intrinsic dimensionality the k- th neighbor Noise Jung, Licle and Maggioni, Proc. AAAI, (2009) Rohrdanz, Zheng, Maggioni and Clemen:. J Chem. Phys., 134(12), (2011)

43 Locally scaled diffusion map (LSDMap) RMSD To extract Point- specific local scale from a discrete data set (i.e. molecular dynamics data) Rohrdanz, Zheng, Maggioni and Clemen:. J Chem. Phys., 134(12), (2011)

44 Diffusion eigenspectrum Sta%onary solu%on Pore radius large gap %mescale separa%on Zheng, Rohrdanz, Maggioni and Clemen:, J Chem. Phys., (2011)

45 Free energy landscape

46 Local PCA spectra Minimum Barrier

47 Local scales Different regions of the configura%on space have different local scales.

48 Local heterogenei%es Barrier: Large spectra gap, small intrinsic dimension, large local scale Minimum: Small spectra gap, large intrinsic dimension, small local scale

49 We test the goodness of the first diffusion coordinate as reac%on coordinate by es%ma%ng the rates From Kramers theory of escape rates we have: Free Energy Escape rate D(x) = diffusion coefficient, it s NOT a constant Reac%on Coordinate, x

50 Reversal rate Zheng, Rohrdanz, Maggioni and Clemen:, J Chem. Phys., (2011)

51 Z vs. 1 st DC Zheng, Rohrdanz, and Clementi, J Chem. Phys., 134, (2011) Z 1 st DC = 0 surface is different from z=0 surface. 1 st DC

52 Correla%on to the contact probabili%es z 1 st DC

53 LSDMap code h4p://sourceforge.net/projects/lsdmap in Fortran90 and MPI

54 Clemen& s group Dr. Mary Rohrdanz Dr. Jordane Preto Lorenzo Boninsegna Wenwei Zheng Fernando Yrazu Alex Kluber Amarda Shehu (now: GMU) Payel Das (now: IBM) Silvina Matysiak (now: U. Maryland) Brad Lambeth (now: Shell) Collaborators: Prof. Mauro Maggioni Miles Crosskey (Duke Math) $$ NSF CHE CHE $$ Welch Founda%on C- 1570

Non-linear Dimensionality Reduction

Non-linear Dimensionality Reduction Non-linear Dimensionality Reduction CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Introduction Laplacian Eigenmaps Locally Linear Embedding (LLE)

More information

CSE 291. Assignment Spectral clustering versus k-means. Out: Wed May 23 Due: Wed Jun 13

CSE 291. Assignment Spectral clustering versus k-means. Out: Wed May 23 Due: Wed Jun 13 CSE 291. Assignment 3 Out: Wed May 23 Due: Wed Jun 13 3.1 Spectral clustering versus k-means Download the rings data set for this problem from the course web site. The data is stored in MATLAB format as

More information

Founda'ons of Large- Scale Mul'media Informa'on Management and Retrieval. Lecture #4 Similarity. Edward Chang

Founda'ons of Large- Scale Mul'media Informa'on Management and Retrieval. Lecture #4 Similarity. Edward Chang Founda'ons of Large- Scale Mul'media Informa'on Management and Retrieval Lecture #4 Similarity Edward Y. Chang Edward Chang Foundations of LSMM 1 Edward Chang Foundations of LSMM 2 Similar? Edward Chang

More information

Stable MMPI-2 Scoring: Introduction to Kernel. Extension Techniques

Stable MMPI-2 Scoring: Introduction to Kernel. Extension Techniques 1 Stable MMPI-2 Scoring: Introduction to Kernel Extension Techniques Liberty,E., Almagor,M., Zucker,S., Keller,Y., and Coifman,R.R. Abstract The current study introduces a new technique called Geometric

More information

Effective stochastic dynamics on a protein folding energy landscape

Effective stochastic dynamics on a protein folding energy landscape THE JOURNAL OF CHEMICAL PHYSICS 125, 054910 2006 Effective stochastic dynamics on a protein folding energy landscape Sichun Yang, a José N. Onuchic, b and Herbert Levine c Center for Theoretical Biological

More information

Low-Dimensional Free Energy Landscapes of Protein Folding Reactions by Nonlinear Dimensionality Reduction

Low-Dimensional Free Energy Landscapes of Protein Folding Reactions by Nonlinear Dimensionality Reduction Appeared in PNAS 103(26):9885-9890, 2006 Low-Dimensional Free Energy Landscapes of Protein Folding Reactions by Nonlinear Dimensionality Reduction Payel Das 1, Mark Moll 2, Hernan Stamati 2, Lydia E. Kavraki

More information

Using the computer to select the right variables

Using the computer to select the right variables Using the computer to select the right variables Rationale: Straight Line Distance Curved Transition Distance Actual transition difficulty represented by curved path Lake Carnegie, Princeton, NJ Using

More information

Computers in Chemistry as Educa3on and Research Tools. Bong June Sung Department of Chemistry Sogang University

Computers in Chemistry as Educa3on and Research Tools. Bong June Sung Department of Chemistry Sogang University Computers in Chemistry as Educa3on and Research Tools Bong June Sung Department of Chemistry Sogang University Computa3onal Chemistry Computa3onal chemistry is a branch of chemistry that employs computer

More information

Intrinsic Structure Study on Whale Vocalizations

Intrinsic Structure Study on Whale Vocalizations 1 2015 DCLDE Conference Intrinsic Structure Study on Whale Vocalizations Yin Xian 1, Xiaobai Sun 2, Yuan Zhang 3, Wenjing Liao 3 Doug Nowacek 1,4, Loren Nolte 1, Robert Calderbank 1,2,3 1 Department of

More information

Molecular Dynamics. Molecules in motion

Molecular Dynamics. Molecules in motion Molecular Dynamics Molecules in motion 1 Molecules in mo1on Molecules are not sta1c, but move all the 1me Source: h9p://en.wikipedia.org/wiki/kine1c_theory 2 Gasses, liquids and solids Gasses, liquids

More information

Contribution from: Springer Verlag Berlin Heidelberg 2005 ISBN

Contribution from: Springer Verlag Berlin Heidelberg 2005 ISBN Contribution from: Mathematical Physics Studies Vol. 7 Perspectives in Analysis Essays in Honor of Lennart Carleson s 75th Birthday Michael Benedicks, Peter W. Jones, Stanislav Smirnov (Eds.) Springer

More information

Physics 2203, Fall 2011 Modern Physics

Physics 2203, Fall 2011 Modern Physics Physics 2203, Fall 2011 Modern Physics. Friday, Nov. 2 nd, 2012. Energy levels in Nitrogen molecule Sta@s@cal Physics: Quantum sta@s@cs: Ch. 15 in our book. Notes from Ch. 10 in Serway Announcements Second

More information

Fields, wave and electromagne3c pulses. fields, waves <- > par0cles pulse <- > bunch (finite in 0me),

Fields, wave and electromagne3c pulses. fields, waves <- > par0cles pulse <- > bunch (finite in 0me), Fields, wave and electromagne3c pulses fields, waves par0cles pulse bunch (finite in 0me), 1 Op3cs ray or geometric op0cs: ABCD matrix, wave op0cs (used e.m. field to describe the op0cal field):

More information

Latent Dirichlet Alloca/on

Latent Dirichlet Alloca/on Latent Dirichlet Alloca/on Blei, Ng and Jordan ( 2002 ) Presented by Deepak Santhanam What is Latent Dirichlet Alloca/on? Genera/ve Model for collec/ons of discrete data Data generated by parameters which

More information

Computer Vision. Pa0ern Recogni4on Concepts Part I. Luis F. Teixeira MAP- i 2012/13

Computer Vision. Pa0ern Recogni4on Concepts Part I. Luis F. Teixeira MAP- i 2012/13 Computer Vision Pa0ern Recogni4on Concepts Part I Luis F. Teixeira MAP- i 2012/13 What is it? Pa0ern Recogni4on Many defini4ons in the literature The assignment of a physical object or event to one of

More information

Unsupervised dimensionality reduction

Unsupervised dimensionality reduction Unsupervised dimensionality reduction Guillaume Obozinski Ecole des Ponts - ParisTech SOCN course 2014 Guillaume Obozinski Unsupervised dimensionality reduction 1/30 Outline 1 PCA 2 Kernel PCA 3 Multidimensional

More information

Sampling the free energy surfaces of collective variables

Sampling the free energy surfaces of collective variables Sampling the free energy surfaces of collective variables Jérôme Hénin Enhanced Sampling and Free-Energy Calculations Urbana, 12 September 2018 Please interrupt! struct bioinform phys chem theoretical

More information

Lecture 12 The Level Set Approach for Turbulent Premixed Combus=on

Lecture 12 The Level Set Approach for Turbulent Premixed Combus=on Lecture 12 The Level Set Approach for Turbulent Premixed Combus=on 12.- 1 A model for premixed turbulent combus7on, based on the non- reac7ng scalar G rather than on progress variable, has been developed

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Feature Extraction Hamid R. Rabiee Jafar Muhammadi, Alireza Ghasemi, Payam Siyari Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Dimensionality Reduction

More information

A Principled Comparative Analysis of Dimensionality Reduction Techniques on Protein Structure Decoy Data

A Principled Comparative Analysis of Dimensionality Reduction Techniques on Protein Structure Decoy Data A Principled Comparative Analysis of Dimensionality Reduction Techniques on Protein Structure Decoy Data Rohan Pandit 1 Amarda Shehu 2,3,4, 1 Thomas Jefferson High School, Alexandria, VA 2 Dept. of Computer

More information

Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource

Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource Networks Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource Networks in biology Protein-Protein Interaction Network of Yeast Transcriptional regulatory network of E.coli Experimental

More information

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu Dimension Reduction Techniques Presented by Jie (Jerry) Yu Outline Problem Modeling Review of PCA and MDS Isomap Local Linear Embedding (LLE) Charting Background Advances in data collection and storage

More information

ISSN: (Online) Volume 3, Issue 5, May 2015 International Journal of Advance Research in Computer Science and Management Studies

ISSN: (Online) Volume 3, Issue 5, May 2015 International Journal of Advance Research in Computer Science and Management Studies ISSN: 2321-7782 (Online) Volume 3, Issue 5, May 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at:

More information

Manifold Learning and it s application

Manifold Learning and it s application Manifold Learning and it s application Nandan Dubey SE367 Outline 1 Introduction Manifold Examples image as vector Importance Dimension Reduction Techniques 2 Linear Methods PCA Example MDS Perception

More information

Connection of Local Linear Embedding, ISOMAP, and Kernel Principal Component Analysis

Connection of Local Linear Embedding, ISOMAP, and Kernel Principal Component Analysis Connection of Local Linear Embedding, ISOMAP, and Kernel Principal Component Analysis Alvina Goh Vision Reading Group 13 October 2005 Connection of Local Linear Embedding, ISOMAP, and Kernel Principal

More information

Challenges and Approaches to Nanoscale Materials Characteriza5on at Atomic Resolu5on

Challenges and Approaches to Nanoscale Materials Characteriza5on at Atomic Resolu5on Challenges and Approaches to Nanoscale Materials Characteriza5on at Atomic Resolu5on Ralph G. Nuzzo G. L. Clark Professor of Chemistry and Professor of Materials Science and Engineering University of Illinois

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001 Modeling two-state cooperativity in protein folding Ke Fan, Jun Wang, and Wei Wang arxiv:cond-mat/0103385v1 [cond-mat.soft] 19 Mar 2001 National Laboratory of Solid State Microstructure and Department

More information

Estimating the Intrinsic Dimension of High-Dimensional Data Sets

Estimating the Intrinsic Dimension of High-Dimensional Data Sets Estimating the Intrinsic Dimension of High-Dimensional Data Sets Anna V. Little Department of Mathematics, Jacksonville University Collaborators: M. Maggioni (advisor, Duke University), L. Rosasco, Y.

More information

7. Quantum Monte Carlo (QMC)

7. Quantum Monte Carlo (QMC) Molecular Simulations with Chemical and Biological Applications (Part I) 7. Quantum Monte Carlo (QMC) Dr. Mar(n Steinhauser 1 HS 2014 Molecular Simula(ons with Chemical and Biological Applica(ons 1 Introduc5on

More information

Application of the Markov State Model to Molecular Dynamics of Biological Molecules. Speaker: Xun Sang-Ni Supervisor: Prof. Wu Dr.

Application of the Markov State Model to Molecular Dynamics of Biological Molecules. Speaker: Xun Sang-Ni Supervisor: Prof. Wu Dr. Application of the Markov State Model to Molecular Dynamics of Biological Molecules Speaker: Xun Sang-Ni Supervisor: Prof. Wu Dr. Jiang Introduction Conformational changes of proteins : essential part

More information

Energy Diagrams 2/20/16. Energy Diagrams are a plot of the reac=on steps, or Reac=on Coordinate (X- axis) versus the Energy (Kcal or KJ) A Review

Energy Diagrams 2/20/16. Energy Diagrams are a plot of the reac=on steps, or Reac=on Coordinate (X- axis) versus the Energy (Kcal or KJ) A Review Energy Diagrams Energy Diagrams are a plot of the reac=on steps, or eac=on Coordinate (X- axis) versus the Energy (Kcal or KJ) A eview In a spontaneous reac=on, the product(s) are more stable than the

More information

Temperature dependence of reactions with multiple pathways

Temperature dependence of reactions with multiple pathways PCCP Temperature dependence of reactions with multiple pathways Muhammad H. Zaman, ac Tobin R. Sosnick bc and R. Stephen Berry* ad a Department of Chemistry, The University of Chicago, Chicago, IL 60637,

More information

Global (ISOMAP) versus Local (LLE) Methods in Nonlinear Dimensionality Reduction

Global (ISOMAP) versus Local (LLE) Methods in Nonlinear Dimensionality Reduction Global (ISOMAP) versus Local (LLE) Methods in Nonlinear Dimensionality Reduction A presentation by Evan Ettinger on a Paper by Vin de Silva and Joshua B. Tenenbaum May 12, 2005 Outline Introduction The

More information

LECTURE NOTE #11 PROF. ALAN YUILLE

LECTURE NOTE #11 PROF. ALAN YUILLE LECTURE NOTE #11 PROF. ALAN YUILLE 1. NonLinear Dimension Reduction Spectral Methods. The basic idea is to assume that the data lies on a manifold/surface in D-dimensional space, see figure (1) Perform

More information

A mul&scale autocorrela&on func&on for anisotropy studies

A mul&scale autocorrela&on func&on for anisotropy studies A mul&scale autocorrela&on func&on for anisotropy studies Mario Scuderi 1, M. De Domenico, H Lyberis and A. Insolia 1 Department of Physics and Astronomy & INFN Catania University ITALY DAA2011 Erice,

More information

Nonlinear Dimensionality Reduction

Nonlinear Dimensionality Reduction Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Kernel PCA 2 Isomap 3 Locally Linear Embedding 4 Laplacian Eigenmap

More information

All-atom Molecular Mechanics. Trent E. Balius AMS 535 / CHE /27/2010

All-atom Molecular Mechanics. Trent E. Balius AMS 535 / CHE /27/2010 All-atom Molecular Mechanics Trent E. Balius AMS 535 / CHE 535 09/27/2010 Outline Molecular models Molecular mechanics Force Fields Potential energy function functional form parameters and parameterization

More information

March 13, Paper: R.R. Coifman, S. Lafon, Diffusion maps ([Coifman06]) Seminar: Learning with Graphs, Prof. Hein, Saarland University

March 13, Paper: R.R. Coifman, S. Lafon, Diffusion maps ([Coifman06]) Seminar: Learning with Graphs, Prof. Hein, Saarland University Kernels March 13, 2008 Paper: R.R. Coifman, S. Lafon, maps ([Coifman06]) Seminar: Learning with Graphs, Prof. Hein, Saarland University Kernels Figure: Example Application from [LafonWWW] meaningful geometric

More information

Density Functional Theory: from theory to Applications

Density Functional Theory: from theory to Applications Density Functional Theory: from theory to Applications Uni Mainz May 27, 2012 Large barrier-activated processes time-dependent bias potential extended Lagrangian formalism Basic idea: during the MD dynamics

More information

Statistical Learning. Dong Liu. Dept. EEIS, USTC

Statistical Learning. Dong Liu. Dept. EEIS, USTC Statistical Learning Dong Liu Dept. EEIS, USTC Chapter 6. Unsupervised and Semi-Supervised Learning 1. Unsupervised learning 2. k-means 3. Gaussian mixture model 4. Other approaches to clustering 5. Principle

More information

Protein Folding Prof. Eugene Shakhnovich

Protein Folding Prof. Eugene Shakhnovich Protein Folding Eugene Shakhnovich Department of Chemistry and Chemical Biology Harvard University 1 Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot)

More information

Machine learning collective variable discovery for materials design and engineering

Machine learning collective variable discovery for materials design and engineering Machine learning collective variable discovery for materials design and engineering x 1 CV 1 ˆ x 1 y 1 ˆ y 1 z 1 CV 2 ˆ z 1 input hidden bottleneck hidden output CICS Seminar, University of Notre Dame

More information

BIOINF 4371 Drug Design 1 Oliver Kohlbacher & Jens Krüger

BIOINF 4371 Drug Design 1 Oliver Kohlbacher & Jens Krüger BIOINF 4371 Drug Design 1 Oliver Kohlbacher & Jens Krüger Winter 2013/2014 11. Docking Part IV: Receptor Flexibility Overview Receptor flexibility Types of flexibility Implica5ons for docking Examples

More information

Theore&cal Study of Adsorp&on in SIFSIX- 3- Zn Type Porous Materials

Theore&cal Study of Adsorp&on in SIFSIX- 3- Zn Type Porous Materials Theore&cal Study of Adsorp&on in SIFSIX- 3- Zn Type Porous Materials Ahmad Ziaee* 1,2, Drahomir Chovan 1,2, Michael Zaworotko 2,3 and Syed A.M. Tofail 1,2 1 Department of Physics and Energy 2 Materials

More information

Application of Computer in Chemistry SSC Prof. Mohamed Noor Hasan Dr. Hasmerya Maarof Department of Chemistry

Application of Computer in Chemistry SSC Prof. Mohamed Noor Hasan Dr. Hasmerya Maarof Department of Chemistry Application of Computer in Chemistry SSC 3533 Prof. Mohamed Noor Hasan Dr. Hasmerya Maarof Department of Chemistry Outline Fields of applica:on Examples Types of computer Programming languages 2 Introduction

More information

Unsupervised Learning Techniques Class 07, 1 March 2006 Andrea Caponnetto

Unsupervised Learning Techniques Class 07, 1 March 2006 Andrea Caponnetto Unsupervised Learning Techniques 9.520 Class 07, 1 March 2006 Andrea Caponnetto About this class Goal To introduce some methods for unsupervised learning: Gaussian Mixtures, K-Means, ISOMAP, HLLE, Laplacian

More information

From graph to manifold Laplacian: The convergence rate

From graph to manifold Laplacian: The convergence rate Appl. Comput. Harmon. Anal. 2 (2006) 28 34 www.elsevier.com/locate/acha Letter to the Editor From graph to manifold Laplacian: The convergence rate A. Singer Department of athematics, Yale University,

More information

Regression on Manifolds Using Kernel Dimension Reduction

Regression on Manifolds Using Kernel Dimension Reduction Jens Nilsson JENSN@MATHS.LTH.SE Centre for Mathematical Sciences, Lund University, Box 118, SE-221 00 Lund, Sweden Fei Sha FEISHA@CS.BERKELEY.EDU Computer Science Division, University of California, Berkeley,

More information

Nonequilibrium Response in a Model for Sensory Adapta8on

Nonequilibrium Response in a Model for Sensory Adapta8on The 7 th KIAS Conference on Sta8s8cal Physics, 4-7 July 2016 Nonequilibrium Sta8s8cal Physics of Complex Systems Nonequilibrium Response in a Model for Sensory Adapta8on Shouwen Wang and Lei- Han Tang

More information

Visualizing folding of proteins (1 3) and RNA (2) in terms of

Visualizing folding of proteins (1 3) and RNA (2) in terms of Can energy landscape roughness of proteins and RNA be measured by using mechanical unfolding experiments? Changbong Hyeon and D. Thirumalai Chemical Physics Program, Institute for Physical Science and

More information

Graphs, Geometry and Semi-supervised Learning

Graphs, Geometry and Semi-supervised Learning Graphs, Geometry and Semi-supervised Learning Mikhail Belkin The Ohio State University, Dept of Computer Science and Engineering and Dept of Statistics Collaborators: Partha Niyogi, Vikas Sindhwani In

More information

RG Flows, Entanglement & Holography Workshop. Michigan Center for Theore0cal Physics September 17 21, 2012

RG Flows, Entanglement & Holography Workshop. Michigan Center for Theore0cal Physics September 17 21, 2012 RG Flows, Entanglement & Holography Workshop Michigan Center for Theore0cal Physics September 17 21, 2012 Intersec0ons in Theore0cal Physics: Par$cle Physics Sta$s$cal Mechanics Renormalization Group Flows

More information

Lecture 17: Face Recogni2on

Lecture 17: Face Recogni2on Lecture 17: Face Recogni2on Dr. Juan Carlos Niebles Stanford AI Lab Professor Fei-Fei Li Stanford Vision Lab Lecture 17-1! What we will learn today Introduc2on to face recogni2on Principal Component Analysis

More information

Slides modified from: PATTERN RECOGNITION AND MACHINE LEARNING CHRISTOPHER M. BISHOP

Slides modified from: PATTERN RECOGNITION AND MACHINE LEARNING CHRISTOPHER M. BISHOP Slides modified from: PATTERN RECOGNITION AND MACHINE LEARNING CHRISTOPHER M. BISHOP Predic?ve Distribu?on (1) Predict t for new values of x by integra?ng over w: where The Evidence Approxima?on (1) The

More information

A characteriza+on/reliability oriented simula+on framework modeling charge transport and degrada+on in dielectric stacks

A characteriza+on/reliability oriented simula+on framework modeling charge transport and degrada+on in dielectric stacks A characteriza+on/reliability oriented simula+on framework modeling charge transport and degrada+on in dielectric stacks Luca Larcher University of Modena and Reggio Emilia MDLab Italy Outline Simula=on

More information

Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations.

Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations. Previously Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations y = Ax Or A simply represents data Notion of eigenvectors,

More information

Lecture 3: Compressive Classification

Lecture 3: Compressive Classification Lecture 3: Compressive Classification Richard Baraniuk Rice University dsp.rice.edu/cs Compressive Sampling Signal Sparsity wideband signal samples large Gabor (TF) coefficients Fourier matrix Compressive

More information

Nonlinear Dimensionality Reduction

Nonlinear Dimensionality Reduction Nonlinear Dimensionality Reduction Piyush Rai CS5350/6350: Machine Learning October 25, 2011 Recap: Linear Dimensionality Reduction Linear Dimensionality Reduction: Based on a linear projection of the

More information

The kinetics of protein folding is often remarkably simple. For

The kinetics of protein folding is often remarkably simple. For Fast protein folding kinetics Jack Schonbrun* and Ken A. Dill *Graduate Group in Biophysics and Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94118 Communicated by

More information

Nonlinear Methods. Data often lies on or near a nonlinear low-dimensional curve aka manifold.

Nonlinear Methods. Data often lies on or near a nonlinear low-dimensional curve aka manifold. Nonlinear Methods Data often lies on or near a nonlinear low-dimensional curve aka manifold. 27 Laplacian Eigenmaps Linear methods Lower-dimensional linear projection that preserves distances between all

More information

Nuggets on coarse- graining and mul,scale computa,onal schemes Maria Fyta Ins,tut für Computerphysik, Universität Stu<gart Stu<gart, Germany

Nuggets on coarse- graining and mul,scale computa,onal schemes Maria Fyta Ins,tut für Computerphysik, Universität Stu<gart Stu<gart, Germany Nuggets on coarse- graining and mul,scale computa,onal schemes Maria Fyta Ins,tut für Computerphysik, Universität Stu

More information

EECS 275 Matrix Computation

EECS 275 Matrix Computation EECS 275 Matrix Computation Ming-Hsuan Yang Electrical Engineering and Computer Science University of California at Merced Merced, CA 95344 http://faculty.ucmerced.edu/mhyang Lecture 23 1 / 27 Overview

More information

macroscopic view (phenomenological) microscopic view (atomistic) computing reaction rate rate of reactions experiments thermodynamics

macroscopic view (phenomenological) microscopic view (atomistic) computing reaction rate rate of reactions experiments thermodynamics Rate Theory (overview) macroscopic view (phenomenological) rate of reactions experiments thermodynamics Van t Hoff & Arrhenius equation microscopic view (atomistic) statistical mechanics transition state

More information

Learning Eigenfunctions: Links with Spectral Clustering and Kernel PCA

Learning Eigenfunctions: Links with Spectral Clustering and Kernel PCA Learning Eigenfunctions: Links with Spectral Clustering and Kernel PCA Yoshua Bengio Pascal Vincent Jean-François Paiement University of Montreal April 2, Snowbird Learning 2003 Learning Modal Structures

More information

Diffusion/Inference geometries of data features, situational awareness and visualization. Ronald R Coifman Mathematics Yale University

Diffusion/Inference geometries of data features, situational awareness and visualization. Ronald R Coifman Mathematics Yale University Diffusion/Inference geometries of data features, situational awareness and visualization Ronald R Coifman Mathematics Yale University Digital data is generally converted to point clouds in high dimensional

More information

Apprentissage non supervisée

Apprentissage non supervisée Apprentissage non supervisée Cours 3 Higher dimensions Jairo Cugliari Master ECD 2015-2016 From low to high dimension Density estimation Histograms and KDE Calibration can be done automacally But! Let

More information

Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian

Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian Amit Singer Princeton University Department of Mathematics and Program in Applied and Computational Mathematics

More information

Electronic Structure Calcula/ons: Density Func/onal Theory. and. Predic/ng Cataly/c Ac/vity

Electronic Structure Calcula/ons: Density Func/onal Theory. and. Predic/ng Cataly/c Ac/vity Electronic Structure Calcula/ons: Density Func/onal Theory and Predic/ng Cataly/c Ac/vity Review Examples of experiments that do not agree with classical physics: Photoelectric effect Photons and the quan/za/on

More information

Black holes in Einstein s gravity and beyond

Black holes in Einstein s gravity and beyond Black holes in Einstein s gravity and beyond Andrei Starinets Rudolf Peierls Centre for Theore=cal Physics University of Oxford 20 September 2014 Outline Gravity and the metric Einstein s equa=ons Symmetry

More information

Unsupervised Learning: K- Means & PCA

Unsupervised Learning: K- Means & PCA Unsupervised Learning: K- Means & PCA Unsupervised Learning Supervised learning used labeled data pairs (x, y) to learn a func>on f : X Y But, what if we don t have labels? No labels = unsupervised learning

More information

On single nucleon wave func0ons. Igal Talmi The Weizman Ins0tute of Science Rehovot, Israel

On single nucleon wave func0ons. Igal Talmi The Weizman Ins0tute of Science Rehovot, Israel On single nucleon wave func0ons Igal Talmi The Weizman Ins0tute of Science Rehovot, Israel The main success of the nuclear shell model Nuclei with magic numbers of protons and neutrons exhibit extra stability.

More information

Future foam: Non trivial topology from bubble collisions in eternal infla9on

Future foam: Non trivial topology from bubble collisions in eternal infla9on Future foam: Non trivial topology from bubble collisions in eternal infla9on Yasuhiro Sekino (Okayama Ins9tute for Quantum Physics) Based on R. Bousso, B. Freivogel, Y. Sekino, S. Shenker, L. Susskind,

More information

Class Notes. Examining Repeated Measures Data on Individuals

Class Notes. Examining Repeated Measures Data on Individuals Ronald Heck Week 12: Class Notes 1 Class Notes Examining Repeated Measures Data on Individuals Generalized linear mixed models (GLMM) also provide a means of incorporang longitudinal designs with categorical

More information

Protein Folding by Robotics

Protein Folding by Robotics Protein Folding by Robotics 1 TBI Winterseminar 2006 February 21, 2006 Protein Folding by Robotics 1 TBI Winterseminar 2006 February 21, 2006 Protein Folding by Robotics Probabilistic Roadmap Planning

More information

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on Professor Wei-Min Shen Week 13.1 and 13.2 1 Status Check Extra credits? Announcement Evalua/on process will start soon

More information

Robust Laplacian Eigenmaps Using Global Information

Robust Laplacian Eigenmaps Using Global Information Manifold Learning and its Applications: Papers from the AAAI Fall Symposium (FS-9-) Robust Laplacian Eigenmaps Using Global Information Shounak Roychowdhury ECE University of Texas at Austin, Austin, TX

More information

Recovering Kinetics from a Simplified Protein Folding Model Using Replica Exchange Simulations: A Kinetic Network and Effective Stochastic Dynamics

Recovering Kinetics from a Simplified Protein Folding Model Using Replica Exchange Simulations: A Kinetic Network and Effective Stochastic Dynamics 11702 J. Phys. Chem. B 2009, 113, 11702 11709 Recovering Kinetics from a Simplified Protein Folding Model Using Replica Exchange Simulations: A Kinetic Network and Effective Stochastic Dynamics Weihua

More information

Graph Metrics and Dimension Reduction

Graph Metrics and Dimension Reduction Graph Metrics and Dimension Reduction Minh Tang 1 Michael Trosset 2 1 Applied Mathematics and Statistics The Johns Hopkins University 2 Department of Statistics Indiana University, Bloomington November

More information

Building your toolbelt

Building your toolbelt Building your toolbelt Using math to make meaning in the physical world. Dimensional analysis Func;onal dependence / scaling Special cases / limi;ng cases Reading the physics in the representa;on (graphs)

More information

Distance Metric Learning in Data Mining (Part II) Fei Wang and Jimeng Sun IBM TJ Watson Research Center

Distance Metric Learning in Data Mining (Part II) Fei Wang and Jimeng Sun IBM TJ Watson Research Center Distance Metric Learning in Data Mining (Part II) Fei Wang and Jimeng Sun IBM TJ Watson Research Center 1 Outline Part I - Applications Motivation and Introduction Patient similarity application Part II

More information

The Kramers problem and first passage times.

The Kramers problem and first passage times. Chapter 8 The Kramers problem and first passage times. The Kramers problem is to find the rate at which a Brownian particle escapes from a potential well over a potential barrier. One method of attack

More information

Face Recognition Using Laplacianfaces He et al. (IEEE Trans PAMI, 2005) presented by Hassan A. Kingravi

Face Recognition Using Laplacianfaces He et al. (IEEE Trans PAMI, 2005) presented by Hassan A. Kingravi Face Recognition Using Laplacianfaces He et al. (IEEE Trans PAMI, 2005) presented by Hassan A. Kingravi Overview Introduction Linear Methods for Dimensionality Reduction Nonlinear Methods and Manifold

More information

ECE Unit 4. Realizable system used to approximate the ideal system is shown below: Figure 4.47 (b) Digital Processing of Analog Signals

ECE Unit 4. Realizable system used to approximate the ideal system is shown below: Figure 4.47 (b) Digital Processing of Analog Signals ECE 8440 - Unit 4 Digital Processing of Analog Signals- - Non- Ideal Case (See sec8on 4.8) Before considering the non- ideal case, recall the ideal case: 1 Assump8ons involved in ideal case: - no aliasing

More information

Energy Barriers and Rates - Transition State Theory for Physicists

Energy Barriers and Rates - Transition State Theory for Physicists Energy Barriers and Rates - Transition State Theory for Physicists Daniel C. Elton October 12, 2013 Useful relations 1 cal = 4.184 J 1 kcal mole 1 = 0.0434 ev per particle 1 kj mole 1 = 0.0104 ev per particle

More information

Black Hole Entropy and Thermodynamics of Space=me

Black Hole Entropy and Thermodynamics of Space=me Black Hole Entropy and Thermodynamics of Space=me Ted Jacobson University of Maryland 1 Sadi Carnot How does the maximum efficiency of steam engines depend on the working fluid? Carnot: given the temperatures

More information

Nonlinear Manifold Learning Summary

Nonlinear Manifold Learning Summary Nonlinear Manifold Learning 6.454 Summary Alexander Ihler ihler@mit.edu October 6, 2003 Abstract Manifold learning is the process of estimating a low-dimensional structure which underlies a collection

More information

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on. Instructor: Wei-Min Shen

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on. Instructor: Wei-Min Shen CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on Instructor: Wei-Min Shen Today s Lecture Search Techniques (review & con/nue) Op/miza/on Techniques Home Work 1: descrip/on

More information

CSCI1950 Z Computa3onal Methods for Biology Lecture 24. Ben Raphael April 29, hgp://cs.brown.edu/courses/csci1950 z/ Network Mo3fs

CSCI1950 Z Computa3onal Methods for Biology Lecture 24. Ben Raphael April 29, hgp://cs.brown.edu/courses/csci1950 z/ Network Mo3fs CSCI1950 Z Computa3onal Methods for Biology Lecture 24 Ben Raphael April 29, 2009 hgp://cs.brown.edu/courses/csci1950 z/ Network Mo3fs Subnetworks with more occurrences than expected by chance. How to

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 5 May 1998

arxiv:cond-mat/ v1 [cond-mat.soft] 5 May 1998 Linking Rates of Folding in Lattice Models of Proteins with Underlying Thermodynamic Characteristics arxiv:cond-mat/9805061v1 [cond-mat.soft] 5 May 1998 D.K.Klimov and D.Thirumalai Institute for Physical

More information

Lecture 17: Face Recogni2on

Lecture 17: Face Recogni2on Lecture 17: Face Recogni2on Dr. Juan Carlos Niebles Stanford AI Lab Professor Fei-Fei Li Stanford Vision Lab Lecture 17-1! What we will learn today Introduc2on to face recogni2on Principal Component Analysis

More information

Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros

Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros Ensemble refinement of protein crystal structures in PHENIX Tom Burnley Piet Gros Incomplete modelling of disorder contributes to R factor gap Only ~5% of residues in the PDB are modelled with more than

More information

Dimension Reduction and Low-dimensional Embedding

Dimension Reduction and Low-dimensional Embedding Dimension Reduction and Low-dimensional Embedding Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1/26 Dimension

More information

Effec%ve Field Theory of Dissipa%ve Fluids

Effec%ve Field Theory of Dissipa%ve Fluids Effec%ve Field Theory of Dissipa%ve Fluids Hong Liu Paolo Glorioso arxiv: 1511.03646 and to appear Michael Crossley Conserved quan%%es and hydrodynamics Consider long wavelength disturbances of a system

More information

Multiscale bi-harmonic Analysis of Digital Data Bases and Earth moving distances.

Multiscale bi-harmonic Analysis of Digital Data Bases and Earth moving distances. Multiscale bi-harmonic Analysis of Digital Data Bases and Earth moving distances. R. Coifman, Department of Mathematics, program of Applied Mathematics Yale University Joint work with M. Gavish and W.

More information

Sta$s$cal sequence recogni$on

Sta$s$cal sequence recogni$on Sta$s$cal sequence recogni$on Determinis$c sequence recogni$on Last $me, temporal integra$on of local distances via DP Integrates local matches over $me Normalizes $me varia$ons For cts speech, segments

More information

Major Concepts Langevin Equation

Major Concepts Langevin Equation Major Concepts Langevin Equation Model for a tagged subsystem in a solvent Harmonic bath with temperature, T Friction & Correlated forces (FDR) Markovian/Ohmic vs. Memory Chemical Kinetics Master equation

More information

Velocity damping and fragmenta2on in non- central intermediate energy heavy- ion collisions

Velocity damping and fragmenta2on in non- central intermediate energy heavy- ion collisions Degree of equilibra2on depends on contact 2me and gradient of poten2al driving equilibra2on Velocity damping and fragmenta2on in non- central intermediate energy heavy- ion collisions 112,124 Sn + 112,124

More information

Modelling of Equipment, Processes, and Systems

Modelling of Equipment, Processes, and Systems 1 Modelling of Equipment, Processes, and Systems 2 Modelling Tools Simple Programs Spreadsheet tools like Excel Mathema7cal Tools MatLab, Mathcad, Maple, and Mathema7ca Special Purpose Codes Macroflow,

More information

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed. Macromolecular Processes 20. Protein Folding Composed of 50 500 amino acids linked in 1D sequence by the polypeptide backbone The amino acid physical and chemical properties of the 20 amino acids dictate

More information

Diffusion Wavelets for multiscale analysis on manifolds and graphs: constructions and applications

Diffusion Wavelets for multiscale analysis on manifolds and graphs: constructions and applications Diffusion Wavelets for multiscale analysis on manifolds and graphs: constructions and applications Mauro Maggioni EPFL, Lausanne Dec. 19th, 5 EPFL Multiscale Analysis and Diffusion Wavelets - Mauro Maggioni,

More information