Interatomic potentials with error bars. Gábor Csányi Engineering Laboratory

Similar documents
Machine learning the Born-Oppenheimer potential energy surface: from molecules to materials. Gábor Csányi Engineering Laboratory

Accuracy and transferability of GAP models for tungsten

Potentials, periodicity

From Atoms to Materials: Predictive Theory and Simulations

2. Surface geometric and electronic structure: a primer

Interatomic Potentials. The electronic-structure problem

Reactive potentials and applications

Bond-order potential for molybdenum: Application to dislocation behavior

Set the initial conditions r i. Update neighborlist. r i. Get new forces F i. Solve the equations of motion numerically over time step t : r i t n

Modèle de liaisons fortes au 4ème moment pour traiter l ordre-désordre dans les alliages

COMP 551 Applied Machine Learning Lecture 20: Gaussian processes

Material Surfaces, Grain Boundaries and Interfaces: Structure-Property Relationship Predictions

GWAS V: Gaussian processes

Force Fields in Molecular Mechanics

Classical potentials for metals

Lecture contents. A few concepts from Quantum Mechanics. Tight-binding model Solid state physics review

Gaussian Approximation Potentials: a brief tutorial introduction

Comparisons of DFT-MD, TB- MD and classical MD calculations of radiation damage and plasmawallinteractions

A neural network representation of the potential energy surface in Si- and Si-Li systems

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Construction of standardized neural network-based interatomic models for structure prediction acceleration

The Kernel Trick, Gram Matrices, and Feature Extraction. CS6787 Lecture 4 Fall 2017

Learning Gaussian Process Models from Uncertain Data

Gaussian Processes (10/16/13)

Tokamak profile database construction incorporating Gaussian process regression

10-701/ Recitation : Kernels

Announcements. Proposals graded

Rustam Z. Khaliullin University of Zürich

Neutron inverse kinetics via Gaussian Processes

COMP 551 Applied Machine Learning Lecture 21: Bayesian optimisation

References in the Supporting Information:

Environment-dependent interatomic potential for bulk silicon

Mark Gales October y (x) x 1. x 2 y (x) Inputs. Outputs. x d. y (x) Second Output layer layer. layer.

Machine Learning. Kernels. Fall (Kernels, Kernelized Perceptron and SVM) Professor Liang Huang. (Chap. 12 of CIML)

Bayesian reconstruction of free energy profiles from umbrella samples

Everything starts with atomic structure and bonding

CSC2541 Lecture 2 Bayesian Occam s Razor and Gaussian Processes

An introduction to Molecular Dynamics. EMBO, June 2016

Lecture 9. Time series prediction

Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and volume numbers)

ENERGY LANDSCAPE OF POINT DEFECTS IN BCC METALS

Bayesian Error Estimation in Density Functional Theory

Support Vector Machines

plane in a cubic unit cell. Clearly label the axes. (b) Draw two directions in the ( 112)

N = N A Pb A Pb. = ln N Q v kt. = kt ln v N

Non-Bayesian Classifiers Part II: Linear Discriminants and Support Vector Machines

Machine Learning. Bayesian Regression & Classification. Marc Toussaint U Stuttgart

Adaptive Multi-Modal Sensing of General Concealed Targets

what can deep learning learn from linear regression? Benjamin Recht University of California, Berkeley

Electronic Structure Theory for Periodic Systems: The Concepts. Christian Ratsch

Distribution-Free Distribution Regression

Indentation of Silicon: Phase Transition? Indentation of Silicon: Phase Transition?

Introduction to DFTB. Marcus Elstner. July 28, 2006

ATOMISTIC MODELING OF DIFFUSION IN ALUMINUM

A NEW EMBEDDED-ATOM METHOD INTERATOMIC POTENTIAL FOR TUNGSTEN-HYDROGEN SYSTEM

Neural Network Potentials

Computer Vision Group Prof. Daniel Cremers. 2. Regression (cont.)

Reliability Monitoring Using Log Gaussian Process Regression

Modeling Materials. Continuum, Atomistic and Multiscale Techniques. gg CAMBRIDGE ^0 TADMOR ELLAD B. HHHHM. University of Minnesota, USA

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

From Physical Modelling to Big Data Analytics: Examples and Challenges


Overview. Probabilistic Interpretation of Linear Regression Maximum Likelihood Estimation Bayesian Estimation MAP Estimation

Bayesian Inference: Principles and Practice 3. Sparse Bayesian Models and the Relevance Vector Machine

On the adaptive finite element analysis of the Kohn-Sham equations

Gaussian processes for inference in stochastic differential equations

Introduction to Gaussian Processes

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Reactive Empirical Force Fields

Assessment of interatomic potentials for molecular dynamics simulations of GaAs deposition

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017

Geometry Optimisation

Prediction of double gene knockout measurements

VIRTUAL LATTICE TECHNIQUE AND THE INTERATOMIC POTENTIALS OF ZINC-BLEND-TYPE BINARY COMPOUNDS

MD simulation of methane in nanochannels

Statistical Data Mining and Machine Learning Hilary Term 2016

A Combined Ab Initio and Bond-Order Potentials Study of Cohesion in Iridium

The electronic structure of materials 1

Modeling thermal conductivity: a Green-Kubo approach

Representing High-Dimensional Potential-Energy Surfaces by Artificial Neural Networks

The Application of Extreme Learning Machine based on Gaussian Kernel in Image Classification

CHAPTER 4 IMPERFECTIONS IN SOLIDS PROBLEM SOLUTIONS

Gaussian with mean ( µ ) and standard deviation ( σ)

DEPARTMENT OF COMPUTER SCIENCE Autumn Semester MACHINE LEARNING AND ADAPTIVE INTELLIGENCE

Dept of Mechanical Engineering MIT Nanoengineering group

IAP 2006: From nano to macro: Introduction to atomistic modeling techniques and application in a case study of modeling fracture of copper (1.

CS598 Machine Learning in Computational Biology (Lecture 5: Matrix - part 2) Professor Jian Peng Teaching Assistant: Rongda Zhu

Probabilistic Reasoning in Deep Learning

Computer Vision Group Prof. Daniel Cremers. 10a. Markov Chain Monte Carlo

Nearest Neighbor. Machine Learning CSE546 Kevin Jamieson University of Washington. October 26, Kevin Jamieson 2

An EAM potential for the dynamical simulation of Ni-Al alloys

1 Bulk Simulations in a HCP lattice

Engineering Part IIB: Module 4F10 Statistical Pattern Processing Lecture 6: Multi-Layer Perceptrons I

Statistical Methods for Data Mining

MSE 102, Fall 2014 Midterm #2. Write your name here [10 points]:

Introduction. Chapter 1

Relevance of jamming to the mechanical properties of solids Sidney Nagel University of Chicago Capri; September 12, 2014

Multivariate statistical methods and data mining in particle physics

Gaussian Processes. 1 What problems can be solved by Gaussian Processes?

Transcription:

Interatomic potentials with error bars Gábor Csányi Engineering Laboratory

What makes a potential Ingredients Desirable properties Representation of atomic neighbourhood smoothness, faithfulness, continuity Interpolation of functions flexible but smooth functional form, few sensible parameters Database of configurations predictive power non-domain specific

What makes a potential Ingredients Desirable properties {Functional form Representation of atomic neighbourhood Interpolation of functions smoothness, faithfulness, continuity flexible but smooth functional form, few sensible parameters Database of configurations predictive power non-domain specific

Function fitting with basis functions Fit a function f(x) based on observations y {y i } at {x i } f(x) = NX i k(x i,x) e.g. k(x, x 0 )= we 2 x x0 2 /2 2 i=1

Function fitting with basis functions Fit a function f(x) based on observations y {y i } at {x i } f(x) = y j = NX i k(x i,x) e.g. k(x, x 0 )= we 2 x x0 2 /2 2 NX i k(x i,x j ) i=1 i=1

Function fitting with basis functions Fit a function f(x) based on observations y {y i } at {x i } f(x) = y j = NX i k(x i,x) e.g. k(x, x 0 )= we 2 x x0 2 /2 2 i=1 NX regularised fit: i k(x i,x j )+ 2 ij arbitrary σ,σw,σν i=1

Function fitting with basis functions Fit a function f(x) based on observations y {y i } at {x i } f(x) = y j = NX i=1 NX i=1 y =(K + = C 1 y i k(x i,x) i k(x i,x j )+ 2 ij 2 I) e.g. k(x, x 0 )= 2 we x x0 2 /2 2 regularised fit: arbitrary σ,σw,σν [K] ij k(x i,x j ) C K + 2 I k k(x i,x)

Function fitting with basis functions Fit a function f(x) based on observations y {y i } at {x i } f(x) = y j = NX i=1 NX i=1 y =(K + = C 1 y i k(x i,x) f(x) =k T C 1 y i k(x i,x j )+ 2 ij 2 I) e.g. k(x, x 0 )= 2 we x x0 2 /2 2 regularised fit: arbitrary σ,σw,σν [K] ij k(x i,x j ) C K + 2 I k k(x i,x)

Function fitting with basis functions Fit a function f(x) based on observations y {y i } at {x i } f(x) = y j = NX i=1 NX i=1 y =(K + = C 1 y i k(x i,x) i k(x i,x j )+ 2 ij 2 I) e.g. k(x, x 0 )= 2 we x x0 2 /2 2 regularised fit: arbitrary σ,σw,σν [K] ij k(x i,x j ) C K + 2 I k k(x i,x) f(x) =k T C 1 y

Machine learning framework: Kernel regression "(q (i) )= NX k k K q (i), q (k) Linear regression: K DP (q (i), q (k) )=q (i) q (k) Neural networks "(q (i) )= X j K NN q (i), q (k) = q (i) q (k) 2 + const. q (i) j NX k k q (k) j = q (i) Gaussian kernel K SE q (i), q (k) =exp X j (q (i) q (k) ) 2 2 2 j

1D example 1 f(x) GP splines 0.5 0 f(x) -0.5-1 1.5 2 2.5 3 3.5 4 r Gaussians basis functions are wide! f(x) =k T C 1 y

Samples from Gaussian posterior P (y N+1 y) / Normal(k T C 1 y,...) 2 0-2 -4-2 0 2 4

Samples from Gaussian posterior P (y N+1 y) / Normal(k T C 1 y,...) 2 0-2 -4-2 0 2 4

Samples from Gaussian posterior P (y N+1 y) / Normal(k T C 1 y,...) 2 0-2 -4-2 0 2 4

Samples from Gaussian posterior P (y N+1 y) / Normal(k T C 1 y,...) 2 0-2 -4-2 0 2 4

Samples from Gaussian posterior P (y N+1 y) / Normal(k T C 1 y,...) 2 0-2 -4-2 0 2 4

Generalised input data Often direct measurements of yi not available Consider any linear operator Compute P (y N+1 L(y) L(y) L can be built using Σ, @/@x etc Basis functions adapt to input data, e.g. for derivative data @k(x, x 0 )/@x Can deal with gradient data, sums, etc.

Traditional ideas for functional forms Pair potentials: Lennard-Jones, RDF-derived, etc. Three-body terms: Stillinger-Weber, MEAM, etc. " i = 1 2 X j V 2 ( r ij ) Embedded Atom (no angular dependence) Bond Order Potential (BOP) Tight-binding-derived attractive term with pair-potential repulsion ReaxFF: kitchen-sink + hundreds of parameters " i = P j ( r ij )

Traditional ideas for functional forms Pair potentials: Lennard-Jones, RDF-derived, etc. Three-body terms: Stillinger-Weber, MEAM, etc. " i = 1 2 X j V 2 ( r ij ) Embedded Atom (no angular dependence) Bond Order Potential (BOP) Tight-binding-derived attractive term with pair-potential repulsion ReaxFF: kitchen-sink + hundreds of parameters " i = P j ( r ij ) Representation is implicit

Traditional ideas for functional forms Pair potentials: Lennard-Jones, RDF-derived, etc. Three-body terms: Stillinger-Weber, MEAM, etc. " i = 1 2 X j V 2 ( r ij ) Embedded Atom (no angular dependence) Bond Order Potential (BOP) Tight-binding-derived attractive term with pair-potential repulsion ReaxFF: kitchen-sink + hundreds of parameters " i = P j ( r ij ) Representation is implicit These are NOT THE CORRECT functions. Limited accuracy, not systematic

Traditional ideas for functional forms Pair potentials: Lennard-Jones, RDF-derived, etc. Three-body terms: Stillinger-Weber, MEAM, etc. " i = 1 2 X j V 2 ( r ij ) Embedded Atom (no angular dependence) Bond Order Potential (BOP) Tight-binding-derived attractive term with pair-potential repulsion ReaxFF: kitchen-sink + hundreds of parameters " i = P j ( r ij ) Representation is implicit These are NOT THE CORRECT functions. Limited accuracy, not systematic given by GOAL: potentials based on quantum mechanics

Fundamental limitation of analytic functional form Energy [ev / atom] 7 6 5 4 3 Tungsten, Embedded Finnis-Sinclair Atom Model bcc fcc sc DFT bcc fcc sc Carbon (diamond) Brenner DFT 2 1 0-1 12 14 16 18 20 22 Volume [A 3 / atom] Elastic constants: Brenner 1061 133 736 717

Construct smooth similarity kernel directly cutoff: compact support

Construct smooth similarity kernel directly Overlap integral Z S( i, i 0)= i (r) i 0(r)dr, cutoff: compact support

Construct smooth similarity kernel directly Overlap integral Z S( i, i 0)= i (r) i 0(r)dr, Integrate over all 3D rotations: Z k( i, i 0)= S( i, ˆR i 0) 2 d ˆR = Z d ˆR cutoff: compact support Z i (r) i 0( ˆRr)dr 2

Construct smooth similarity kernel directly Overlap integral Z S( i, i 0)= i (r) i 0(r)dr, Integrate over all 3D rotations: Z k( i, i 0)= S( i, ˆR i 0) 2 d ˆR = Z d ˆR cutoff: compact support Z i (r) i 0( ˆRr)dr 2 After LOTS of algebra: X SOAP kernel k( i, i 0)= n,n 0,l p (i) nn 0 l p(i0 ) nn 0 l nl p nn 0 l = c nl c n 0 l

Construct smooth similarity kernel directly Overlap integral Z S( i, i 0)= i (r) i 0(r)dr, Integrate over all 3D rotations: Z k( i, i 0)= S( i, ˆR i 0) 2 d ˆR = Z d ˆR cutoff: compact support Z i (r) i 0( ˆRr)dr 2 After LOTS of algebra: X SOAP kernel k( i, i 0)= n,n 0,l p (i) nn 0 l p(i0 ) nn 0 l nl p nn 0 l = c nl c n 0 l

How to generate databases? Target applications: large systems Capability of full quantum mechanics (QM): small systems QM MD on representative small systems: sheared primitive cell elasticity large unit cell phonons surface unit cells surface energy gamma surfaces screw dislocation vacancy in small cell vacancy vacancy @ gamma surface vacancy near dislocation Iterative refinement 1. QM MD Initial database 2. Model MD 3. QM Revised database What is the acceptable validation protocol? How far can the domain of validity be extended?

Existing potentials for tungsten Error 15% 50% BOP MEAM FS DFT reference 0 C11 C12 C44 Elastic const. Vacancy energy (100) (110) (111) (112) Surface energy

Building up databases for tungsten (W)

Existing potentials for tungsten Error 15% 50% BOP MEAM FS DFT reference 0 C11 C12 C44 Elastic const. Vacancy energy (100) (110) (111) (112) Surface energy

Existing potentials for tungsten Error DFT reference 15% 0 50% GAP BOP MEAM FS C11 C12 C44 Elastic const. Vacancy energy (100) (110) (111) (112) Surface energy

Peierls barrier for screw dislocation glide

Vacancy-dislocation binding energy (~100,000 atoms in 3D simulation box)

He-Vacancy interaction in tungsten - preliminary Collaboration with Duc Nguyen-Manh (CCFE) Add He-W interaction on top of W potential 200 training configurations of He@V, 44 test energy error force error

He-Vacancy interaction in tungsten 2-body SOAP-GAP Needs more He data, other defects Do the same with H@W, with Takuji Oda

Self-aware potentials: predicting the error Bulk silicon: Diamond and Beta-tin phases Predicted error correctly signals where GAP is unreliable Energy Error

Outstanding problems Accuracy on database accuracy in properties? Database contents region of validity? Alloys - permutational complexity? Chemical variability? Systematic treatment of long range effects Electronic temperature