Before I entered the Texas Academy of Mathematics and Science as a junior, I had very little

Similar documents
Approximating the basis set dependence of coupled cluster calculations: Evaluation of perturbation theory approximations for stable molecules

Exercise 1: Structure and dipole moment of a small molecule

Solution of the Electronic Schrödinger Equation. Using Basis Sets to Solve the Electronic Schrödinger Equation with Electron Correlation

Lec20 Fri 3mar17

Chemistry 334 Part 2: Computational Quantum Chemistry

G1-3 These methods are based on ab initio molecular orbital calculations. Electron correlation is calculated using MP2 or MP4 and QCI.

Uptake of OH radical to aqueous aerosol: a computational study

Accurate multireference configuration interaction calculations on the lowest 1 and 3 electronic states of C 2,CN, BN, and BO

Basis sets for electron correlation

QUANTUM CHEMISTRY PROJECT 3: ATOMIC AND MOLECULAR STRUCTURE

Session 1. Introduction to Computational Chemistry. Computational (chemistry education) and/or (Computational chemistry) education

Jack Simons, Henry Eyring Scientist and Professor Chemistry Department University of Utah

Lec20 Wed 1mar17 update 3mar 10am

Ab initio calculations for potential energy surfaces. D. Talbi GRAAL- Montpellier

MP2 Basis Set Limit Binding Energy Estimates of Hydrogen-bonded Complexes from Extrapolation-oriented Basis Sets

Introduction to computational chemistry Exercise I: Structure and electronic energy of a small molecule. Vesa Hänninen

Theoretical determination of the heat of formation of methylene

Towards gas-phase accuracy for condensed phase problems

On the selection of domains and orbital pairs in local correlation treatments

THE PERFORMANCE OF DENSITY FUNCTIONALS WITH RESPECT TO THE CORRELATION CONSISTENT BASIS SETS. Xuelin Wang, B.S., M.S

FROM DEVILS TO MATHEMATICS. It was not a usual Sunday morning. My family and I were gathered at the kitchen table,

FEATURE ARTICLE. Ab Initio Calculation of Nonbonded Interactions: Are We There Yet? A. K. Rappé* and E. R. Bernstein*

Homologation of Boronic Esters with Organolithium Compounds: A Computational Assessment of Mechanism

Jack Simons, Henry Eyring Scientist and Professor Chemistry Department University of Utah

Introduction to Computational Chemistry

This is a very succinct primer intended as supplementary material for an undergraduate course in physical chemistry.

Benchmark calculations with correlated molecular wave functions

Lecture 5: More about one- Final words about the Hartree-Fock theory. First step above it by the Møller-Plesset perturbation theory.

Chemistry 2000 Lecture 1: Introduction to the molecular orbital theory

Comment on: Estimating the Hartree Fock limit from finite basis set calculations [Jensen F (2005) Theor Chem Acc 113:267]

Academic Affairs Assessment of Student Learning Report for Academic Year

r R A 1 r R B + 1 ψ(r) = αψ A (r)+βψ B (r) (5) where we assume that ψ A and ψ B are ground states: ψ A (r) = π 1/2 e r R A ψ B (r) = π 1/2 e r R B.

Computational Chemistry I

5.111 Lecture Summary #7 Wednesday, September 17, 2014

Computational Chemistry. An Introduction to Molecular Dynamic Simulations

Size-extensive wave functions for QMC A linear-scaling GVB approach

Theoretical study of the OH-initiated atmospheric oxidation mechanism. of perfluoro methyl vinyl ether, CF 3 OCF=CF 2

Electron Correlation Methods

T. Helgaker, Department of Chemistry, University of Oslo, Norway. T. Ruden, University of Oslo, Norway. W. Klopper, University of Karlsruhe, Germany

CHEMISTRY Topic #1: Atomic Structure and Nuclear Chemistry Fall 2017 Dr. Susan Findlay See Exercises 4.1 to 4.3

4 Post-Hartree Fock Methods: MPn and Configuration Interaction

High-level Quantum Chemistry Methods and Benchmark Datasets for Molecules

Highly Accurate Ab Initio Computation of Thermochemical Data

Computational Material Science Part II. Ito Chao ( ) Institute of Chemistry Academia Sinica

154 J. Chem. Phys. 108 (1), 1 January /98/108(1)/154/23/$ American Institute of Physics

Quantum Monte Carlo methods

Performance of Hartree-Fock and Correlated Methods

Chemistry. Faculty Kent Davis, chair; Pablo Hilario, Denise Lee-Haye, Marie Pak, Robert Wilson Departmental Office: 355 Chan Shun Hall; (707)

Chemistry 4560/5560 Molecular Modeling Fall 2014

Valence electronic structure of isopropyl iodide investigated by electron momentum spectroscopy. --- Influence of intramolecular interactions

Supporting Information

Electronic structure theory: Fundamentals to frontiers. 1. Hartree-Fock theory

Open-Shell Calculations

OVERVIEW OF QUANTUM CHEMISTRY METHODS

Computational Methods. Chem 561

Stuart Carter Department of Chemistry, University of Reading, Reading RG6 2AD, United Kingdom

FEATURE ARTICLE. Exploration of Basis Set Issues for Calculation of Intermolecular Interactions

1. It can help us decide which of several Lewis dot structures is closest to representing the properties of the real compound.

Chemistry. Faculty. Major Requirements for the Major in Chemistry

AN INTRODUCTION TO QUANTUM CHEMISTRY. Mark S. Gordon Iowa State University

Fully Automated Implementation of the Incremental Scheme: Application. to CCSD Energies for Hydrocarbons and Transition Metal Compounds

Chem 467 Supplement to Lecture 19 Hydrogen Atom, Atomic Orbitals

Methods for Treating Electron Correlation CHEM 430

Extrapolation of Atomic Natural Orbitals of basis set to complete basis set limit

Quantum Statistics (2)

Practical Advice for Quantum Chemistry Computations. C. David Sherrill School of Chemistry and Biochemistry Georgia Institute of Technology

An Introduction to Basis Sets:

List of Figures Page Figure No. Figure Caption No. Figure 1.1.

The Intersection of Chemistry and Biology: An Interview with Professor W. E. Moerner

Crystal Structure Prediction using CRYSTALG program

Energy levels of group 10 transition metal atoms and ions

QUANTUM CHEMISTRY PROJECT 3: PARTS B AND C

Density Functional Theory

Atomic orbitals of finite range as basis sets. Javier Junquera

(2 pts) a. What is the time-dependent Schrödinger Equation for a one-dimensional particle in the potential, V (x)?

Electron Correlation - Methods beyond Hartree-Fock

Atomic Theory. Atomic Theory. Atomic Theory. The Bohr Model 2/29/16

MITOCW ocw lec8

Divergence in Møller Plesset theory: A simple explanation based on a two-state model

Systematic ab initio calculations on the energetics and stability of covalent O 4

Density functional theory predictions of anharmonicity and spectroscopic constants for diatomic molecules

George Muller: Chapter 1 Questions

MOLECULAR STRUCTURE. Molecular Structure - B. Molecular Structure - B. Molecular Structure - B. Molecular Structure - B. Molecular Structure - B

CHEM ORGANIC CHEMISTRY

The Potential Energy Surface

Chemistry Publications

Chemistry Courses -1

Thermal Degradation of Polymers

Teaching By Courses in Sub-disciplines

Fragmentation methods

Beyond Hartree-Fock: MP2 and Coupled-Cluster Methods for Large Systems

Steven L. Mielke and Bruce C. Garrett Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richmond, Washington 99352

Close agreement between the orientation dependence of hydrogen bonds observed in protein structures and quantum mechanical calculations

Local correlation in the virtual space in multireference singles and doubles configuration interaction

Lecture 06 Lecture 4: Part II - Uncertainty principle

heat of formation of carbon dioxide

CHEM 344 Molecular Modeling

Basis Set for Molecular Orbital Theory

Lecture 3A 09/30/11. How many of you have read the sections assigned for the first two days of class?

Chemistry Physical Chemistry II Course Syllabus Spring 2019

Transcription:

Vinay Ramasesh Getting Involved with Research Before I entered the Texas Academy of Mathematics and Science as a junior, I had very little experience with actual scientific research. As a freshman, I d messed around with fractals and music, finding a few patterns and proving some interesting results, but overall, my scientific endeavors consisted mainly of building motors and simple mag-lev devices, and doing math problems. Luckily, that all changed when I came to TAMS. One of the great things about this school was, and still is, the amount of research opportunities that it provides. As a residential high school located on the University of North Texas campus, TAMS makes it very easy for its students to conduct research under UNT faculty members. Not only does living on campus make going in to the lab very convenient, but the University professors are eager to have bright high school students working in their labs. The summer before I entered TAMS, I knew I wanted to do research while I was there, but I had no idea what field to pursue. My main fear was that I wouldn t know nearly enough to contribute to any scientific research. As a newly-graduated sophomore, the only science classes I had taken were biology and chemistry, and I wasn t too comfortable with biology. My strength, I felt, was math, but again, I didn t know enough to conduct pure math research. I began to wonder how any high school student could expect to make a meaningful contribution to any professor s research. So, I looked at previous TAMS students and their work, to get a sense of what research would be like. Many projects caught my eye, and most were conducted in the lab of Dr. Angela Wilson, Professor of Chemistry at UNT. Her work was in computational quantum chemistry, and it combined computational work with high-level math. Not only did her

research seem interesting, but the projects TAMS students had done in the past seemed both challenging and doable. I sent Dr. Wilson an e-mail, requesting more information about her work, and her reply was encouraging. She gave me several links to websites with information about computational chemistry, and told me to contact her when the school year started if I was still interested. I set to work reading about computational chemistry. A lot of the sites she pointed me to had information that went well above my head, but I was able to understand the gist. Ab initio computational chemistry attempts to predict thermodynamic properties of molecules from first principles. In the molecular realm, first principles translates to quantum mechanics, in the form of the Schrödinger equation: Eφ x = ħ2 2m 2 φ x + V(x)φ x This famous equation relates the wavefunction, φ x, of an electron to the potential energy, E, of the molecular system. The wavefunction loosely corresponds to the probability distribution of an electron within a molecule, and the potential energy value is used as a starting point for other, more practical, calculations, such as those involving molecular enthalpies of formation. Unfortunately, this equation is so complex that it cannot be solved analytically for systems with more than one electron! This poses a bit of a problem, since the universe isn t composed solely of hydrogen. That s where computational chemistry comes in. It uses numerical approximation methods, similar to Euler s method or the Runge-Kutta algorithm for solving differential equations numerically. These approximation schemes, known as computational methods, are used in conjunction with basis sets, or sets of functions which approximate the electronic wavefunction. Currently, a variety of different computational methods and basis sets are used, with each

enjoying varied success with different types of molecules. One general trend is evident, however: as the size of the molecule we want to calculate the properties of increases, the computational expense of performing the calculation in terms of computer processor time, memory resources, and hard disk space increase very steeply. For example, the simplest computational method which takes into account electronic interactions, Second-Order Møller- Plesset Perturbation Theory (MP2), scales as N 5, where N is the number of basis functions in the system. This steep scaling of computational cost with system size is a major reason that the insights of computational chemistry cannot be applied to systems of biological interest, such as proteins or nucleotides. When I met with Dr. Wilson in the fall, she told me that this problem was one her group was looking to work on. They had recently created a new computational method, the Correlation- Consistent Composite Approach (ccca) which employs several existing methods working in conjunction to accurately predict molecular energies. One of their current goals was to extend the scope of the method to work on larger molecules, but to do this they needed to overcome the scaling problem. As Dr. Wilson explained to me, researchers had recently developed computational methods known as local methods, which made certain approximations to decrease the expense of working with larger molecules. In the description of space employed by local methods, electrons interact mainly with other electrons that occupy spatially close orbitals; with conventional methods, all electrons interact with each other. This approximation reduces the number of integrals the computational program has to evaluate, and thus reduces the computational cost. However, this also reduces the accuracy of local methods. A study by Werner, et al., [1], showed that the calculations made using Local Second-Order Møller-Plesset Perturbation

Theory, when compared to the conventional MP2 calculations, were only 97-98% accurate. For most molecules, this corresponds to an error in energy of about 10-15 kcal/mol. This error bound is largely unacceptable for high-accuracy studies. So, a first step in harnessing local methods for use with high-level studies is to find a way to increase their accuracy without simultaneously increasing their expense. This was the driving idea behind my work in Dr. Wilson s lab, although I was more generally involved in identifying aspects of the localization process that required further effort. The Main Work 1 Background 1.1 Local Methods Conventional methods employ a description of space in which electrons occupy molecular orbitals delocalized over the entire molecule. These orbitals increase in size with the system, leading to the high scaling of computational effort with system size. To circumvent this problem and reduce computational expense, local electron correlation methods, which use a localized description of electronic interactions, have been developed. The implementation scheme for the local methods considered here, introduced by Pulay [2] and implemented by Pulay and Saebø, employs local orbital spaces (called domains) rather than delocalized molecular orbitals (MOs). This orbital localization restricts the number of possible substituted configurations in the electronic wavefunction, only considering those substitutions which contribute significantly to the molecular energy. 1.2 Complete One-electron Basis Set Extrapolations Errors in quantum calculations arise from two sources intrinsic error of the method (or level of theory), and error due to imperfect basis sets. To eliminate basis set errors, it is desirable

to calculate the value of a property with a complete basis set. However, since such calculations are impossible, an approximation to this value must be used. Dunning has introduced the correlation-consistent polarized valence (cc-pvnz with n=d, T, Q, 5 ) basis sets [3], which are now available for a large portion of the periodic table. These sets are constructed by adding shells of functions to a core set of orbital functions. As the zeta level (n) of the basis set increases, the value of the property being calculated converges towards the one-electron complete basis set () limit. By extrapolating to the limit, the error coupling of the oneparticle basis set expansion and the n-particle electron correlation is eliminated, leaving only the intrinsic error of the trial wave function. Current extrapolation schemes are not exact functions, but only approximations to an as-yet unknown formula. 2 Goal I sought to discover whether local second second-order Møller-Plesset perturbation theory [LMP2] energies systematically converged to a limit and to discover how the accuracy of LMP2 with respect to canonical MP2 varied across basis set levels, including at the limit. I explored the convergence to the limit of LMP2 single-point energies verses the convergence of standard MP2 single-point energies, applying both methods to smaller hydrocarbons as well as a test set of large organic and organometallic molecules. I also tested three extrapolation schemes to see which yielded the best results when used with local methods. Finally, I investigated changes in the accuracy of LMP2 limit energies versus system size. 3 Methodology In order to study convergence trends of local and conventional MP2 methods for a wide variety of molecules, I selected a set of 24 relatively large systems from the 600-member training

set of Cioslowski and co-workers [4] that contain 18 or more atoms. These molecules include cyclic, poly-cyclic and non-cyclic alkanes, conjugated molecules, cage structures, and organosulfur and organometallic compounds. To this I added a set of 7 smaller hydrocarbons, to give a total test set of 31 molecules. I optimized the molecular structures with the B3LYP computational method at the ccpvtz basis set. Then, I ran single-point energy calculations on these optimized structures at ccpvdz, cc-pvtz, and cc-pvqz with both conventional MP2 and LMP2. I used the energies obtained with cc-pvdz, cc-pvtz, and cc-pvqz to calculate the limit energy for each molecule and method. I chose three widely-used extrapolation schemes to test: the simple exponential fit (-F) proposed by Xantheas and Dunning [5] and Feller [6], which uses the formula E( n) E Bexp( Cn), where n is the zeta level of the basis set; the mixed exponential/gaussian functional (-P) devised by Peterson et al. [7], which uses the formula E n E B n C n ; 2 ( ) exp[ ( 1)] exp[ ( 1) ] and the two-point Schwartz (-S4) 1/( l max 4 ) scheme [8], E( l ) E max ( l max B 1 4 ) 2, where l max is the value of the highest angular momentum function in the basis set (equal to the zeta-level of the basis set for the atoms used in this study). 4 Results

(1) As I increased the basis set level in the calculation, the errors in the LMP2 calculations decreased. On average, the LMP2/cc-pVQZ error was 42.68% lower than the LMP2/cc-pVTZ error, which was 24.69% lower than the LMP2/cc-pVDZ error. Stated otherwise, LMP2 energies are closer to the canonical MP2 values and thus more accurate at higher basis set levels. To confirm the reduction in correlation error with increasing basis set size (for most molecules), I ran single-point calculations on the smaller hydrocarbons (through butane) with ccpv5z. As expected, errors in LMP2 correlation energy are less with cc-pv5z than they are with cc-pvqz. On average, LMP2/cc-pV5Z correlation errors are 52.82% of the LMP2/cc-pVQZ errors, with similar values occurring for the domain-extended variants of LMP2. The cc-pv5z errors were never greater than either the cc-pvqz or cc-pvtz errors. This leads directly to the conclusion regarding limits below. (2) At the limit, LMP2 energy errors were lower than those at each individual basisset level. LMP2/-P always recovered more than 99.2% of the canonical MP2/-P energy, a vast improvement over the 97.1% seen at the cc-pvdz basis-set level. This implies that extrapolating LMP2 values to the limit lowers the energy differences between LMP2 and MP2, a conclusion consistent with the reduction of errors with increasing basis-set levels. Thus, extrapolations can significantly increase the accuracy of this local method. In the future, researchers can use this technique to obtain accurate potential energies of molecules for calculating enthalpies of formation and stable structures. LMP2/ recovers an average of 99.59% of the conventional limit correlation energy. The worst case occurred for bullvalene, in which only 99.21% was recovered. This is a vast improvement over the average of 97.53% seen at the cc-pvdz level, with the worst case of

96.40% occurring at this level for bicyclo[2.2.2]octane. As table 2 shows, by extrapolating to the limit, LMP2 correlation errors are reduced from an average of 15.81 kcal/mol at the ccpvdz level to an average of 3.59 kcal/mol at the limit. It is interesting to see a mathematical reason behind the reduction in errors at the limit. Solving the -P equations reveals that the -P energy is a linear combination of the cc-pvdz, cc-pvtz, and cc-pvqz energies: E 11 7 4 10 5 3 e e E e e E e e E D T Q e 11 7 4 10 5 3 So, the LMP2/-P error is a linear combination of the cc-pvdz, cc-pvtz, and cc-pvqz errors: E 11 7 4 10 5 3 e E e E e e 11 D 7 4 10 T 5 3 E Q Notice that the coefficient of E T is negative, while those of and EQ are positive. Thus, ED the LMP2/cc-pVTZ error cancels, to some degree, the cc-pvdz and cc-pvqz errors. We can use the percentages given in the section 4.1 to express the average errors at the cc-pvdz and ccpvtz levels as multiples of the cc-pvqz error. So on average, E 2.317 11 7 4 10 5 3 e E 1.745 e E e e 11 Q 7 4 10 5 Q 3 E Q, which simplifies to E 0. 516, E Q showing why the -P energy is more accurate than the cc-pvqz energy, and therefore more accurate than the energies obtained with other basis sets. (3) Comparison of schemes: Now that we know that extrapolating LMP2 calculations to the limit is an inexpensive way to increase their accuracy, we need to know

Correlation Percent which scheme to use to calculate such limits. The -F scheme initially appeared to perform best with local methods; with it, the local energy errors were lowest. However, as I increased the sizes of domains used in the LMP2 calculation, the magnitude of the LMP2/-F energies grew larger than the MP2/-F values. This is physically incorrect, as local energies should always be lower in magnitude than the corresponding non-local values. This flaw, along with the observation that the MP2/-F values were inconsistent with the MP2/-P and MP2/-S4 energies, shows the weakness of the -F scheme. LMP2/-P errors were, on average, 0.35 kcal/mol less than LMP2/-S4 errors. This implies that the -P scheme will yield the most accurate results in studies with local methods, a fact useful in any study involving large molecules. (4) Size-consistency of LMP2: Figure 1 shows a plot of the percent of the MP2/-P energy recovered with LMP2/-P for alkanes versus molecular size. The decreasing trend, best modeled with a power function, implies that LMP2/-P values will be within 1% of the Figure 1. LMP2 Size-Consistency 99.80% 99.75% 99.70% 99.65% 99.60% 99.55% 99.50% 0 1 2 3 4 5 6 7 8 9 Number of Carbons non-local values for alkanes with up to 1045 carbons. Thus, low-cost LMP2/-P calculations will likely be accurate for systems well into the range of biomolecules. My main contribution was to make accurate quantum calculations on important molecules more feasible by determining that LMP2 energy values can be made more accurate by extrapolating to the limit and by establishing that the -P scheme should be used for such an extrapolation. My research may, therefore,

one day aid vital advances in studies that depend on knowing thermodynamic properties of large molecules. 5 References [1] Werner, H.-J.; Pflüger, K. Ann. Rep. in Comp. Chem. 2007, 2, 53. [2] Pulay, P. Chem Phys. Lett. 1983, 100, 154. [3] Dunning Jr., T. H. J. Chem. Phys. 1989, 90, 1007. [4] Cioslowski, J.; Schimeczek, M.; Liu, G.; Stoyanov, V. J. Chem. Phys. 2000, 113, 9377. [5] Xantheas, S. S.; Dunning Jr., T. H. J. Phys. Chem. 1993, 97, 18. [6] Feller, D. J. Chem. Phys. 1992, 96, 6104. [7] Peterson, K. A.; Woon, D. E.; Dunning Jr., T. H. J. Chem. Phys. 1994, 100, 7410. [8] Martin, J. M. L. Chem. Phys. Lett. 1996, 259, 669.