Block Iterative Eigensolvers for Sequences of Dense Correlated Eigenvalue Problems

Similar documents
Improving the performance of applied science numerical simulations: an application to Density Functional Theory

A knowledge-based approach to high-performance computing in ab initio simulations.

Mitglied der Helmholtz-Gemeinschaft. Linear algebra tasks in Materials Science: optimization and portability

Quantum Theory of Materials

Solid State Theory: Band Structure Methods

An Optimized and Scalable Eigensolver for Sequences of Eigenvalue Problems

EIGIFP: A MATLAB Program for Solving Large Symmetric Generalized Eigenvalue Problems

DELFT UNIVERSITY OF TECHNOLOGY

On the Ritz values of normal matrices

Preconditioned Eigenvalue Solvers for electronic structure calculations. Andrew V. Knyazev. Householder Symposium XVI May 26, 2005

Arnoldi Methods in SLEPc

Density Functional Theory

Electron Correlation

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

Eigenvalue Problems CHAPTER 1 : PRELIMINARIES

Efficient implementation of the overlap operator on multi-gpus

LARGE SPARSE EIGENVALUE PROBLEMS. General Tools for Solving Large Eigen-Problems

Electronic Structure Methodology 1

LARGE SPARSE EIGENVALUE PROBLEMS

ELSI: A Unified Software Interface for Kohn-Sham Electronic Structure Solvers

Applications of Matrix Functions Part III: Quantum Chemistry

On the Modification of an Eigenvalue Problem that Preserves an Eigenspace

OVERVIEW OF QUANTUM CHEMISTRY METHODS

ELECTRONIC STRUCTURE CALCULATIONS FOR THE SOLID STATE PHYSICS

Time-dependent density functional theory (TDDFT)

Density Functional Theory. Martin Lüders Daresbury Laboratory

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

Matrix Algorithms. Volume II: Eigensystems. G. W. Stewart H1HJ1L. University of Maryland College Park, Maryland

Divide and conquer algorithms for large eigenvalue problems Yousef Saad Department of Computer Science and Engineering University of Minnesota

Last Time. Social Network Graphs Betweenness. Graph Laplacian. Girvan-Newman Algorithm. Spectral Bisection

Introduction to density-functional theory. Emmanuel Fromager

Jacobi-Based Eigenvalue Solver on GPU. Lung-Sheng Chien, NVIDIA

Electronic structure calculations with GPAW. Jussi Enkovaara CSC IT Center for Science, Finland

ABSTRACT OF DISSERTATION. Ping Zhang

Time-dependent density functional theory (TDDFT)

Key concepts in Density Functional Theory (I) Silvana Botti

Finding Rightmost Eigenvalues of Large, Sparse, Nonsymmetric Parameterized Eigenvalue Problems

Computational Methods. Chem 561

Incomplete Cholesky preconditioners that exploit the low-rank property

Wavelets for density functional calculations: Four families and three. applications

Math 504 (Fall 2011) 1. (*) Consider the matrices

Introduction to Density Functional Theory with Applications to Graphene Branislav K. Nikolić

Time-dependent linear-response variational Monte Carlo.

Preconditioned inverse iteration and shift-invert Arnoldi method

Keeping σ fixed for several steps, iterating on µ and neglecting the remainder in the Lagrange interpolation one obtains. θ = λ j λ j 1 λ j σ, (2.

Lecture 8: Introduction to Density Functional Theory

A Non-Linear Eigensolver-Based Alternative to Traditional Self-Consistent Electronic Structure Calculation Methods

APPLIED NUMERICAL LINEAR ALGEBRA

ILAS 2017 July 25, 2017

Teoría del Funcional de la Densidad (Density Functional Theory)

Density Functional Theory for Electrons in Materials

VASP: running on HPC resources. University of Vienna, Faculty of Physics and Center for Computational Materials Science, Vienna, Austria

Density Functional Theory

CHEM6085: Density Functional Theory

Chapter 2 Approximation Methods Can be Used When Exact Solutions to the Schrödinger Equation Can Not be Found.

Applied Mathematics 205. Unit V: Eigenvalue Problems. Lecturer: Dr. David Knezevic

Iterative projection methods for sparse nonlinear eigenvalue problems

M.A. Botchev. September 5, 2014

DFT in practice. Sergey V. Levchenko. Fritz-Haber-Institut der MPG, Berlin, DE

Inexact and Nonlinear Extensions of the FEAST Eigenvalue Algorithm

Numerical Methods I Eigenvalue Problems

Density-Matrix-Based Algorithms for Solving Eingenvalue Problems

A HARMONIC RESTARTED ARNOLDI ALGORITHM FOR CALCULATING EIGENVALUES AND DETERMINING MULTIPLICITY

Electronic structure theory: Fundamentals to frontiers. 1. Hartree-Fock theory

Accelerating and Scaling Lanczos Diagonalization with GPGPU

Polynomial filtering for interior eigenvalue problems Yousef Saad Department of Computer Science and Engineering University of Minnesota

Iterative methods for symmetric eigenvalue problems

HARMONIC RAYLEIGH RITZ EXTRACTION FOR THE MULTIPARAMETER EIGENVALUE PROBLEM

On the Numerics of 3D Kohn-Sham System

Reduction of nonlinear eigenproblems with JD

v(r i r j ) = h(r i )+ 1 N

Cucheb: A GPU implementation of the filtered Lanczos procedure. Department of Computer Science and Engineering University of Minnesota, Twin Cities

Applications of Renormalization Group Methods in Nuclear Physics 2

4.8 Arnoldi Iteration, Krylov Subspaces and GMRES

The geometric mean algorithm

A very efficient two-density approach to atomistic simulations and a proof of principle for small atoms and molecules

Electrochemistry project, Chemistry Department, November Ab-initio Molecular Dynamics Simulation

Exchange Correlation Functional Investigation of RT-TDDFT on a Sodium Chloride. Dimer. Philip Straughn

On prescribing Ritz values and GMRES residual norms generated by Arnoldi processes

Parallel Eigensolver Performance on High Performance Computers

Eigenvalue Problems. Eigenvalue problems occur in many areas of science and engineering, such as structural analysis

Density Functional Theory

Scientific Computing: An Introductory Survey

Rational Krylov methods for linear and nonlinear eigenvalue problems

Introduction to Density Functional Theory

DELFT UNIVERSITY OF TECHNOLOGY

Large Scale Electronic Structure Calculations

CLIMBING THE LADDER OF DENSITY FUNCTIONAL APPROXIMATIONS JOHN P. PERDEW DEPARTMENT OF PHYSICS TEMPLE UNIVERSITY PHILADELPHIA, PA 19122

Quantum Mechanical Simulations

Is there life after the Lanczos method? What is LOBPCG?

Lecture # 11 The Power Method for Eigenvalues Part II. The power method find the largest (in magnitude) eigenvalue of. A R n n.

ECS130 Scientific Computing Handout E February 13, 2017

An Arnoldi Method for Nonlinear Symmetric Eigenvalue Problems

Solving Ill-Posed Cauchy Problems in Three Space Dimensions using Krylov Methods

Iterative Diagonalization in Augmented Plane Wave Based Methods in Electronic Structure Calculations

Introduction to DFT and Density Functionals. by Michel Côté Université de Montréal Département de physique

Moments, Model Reduction and Nonlinearity in Solving Linear Algebraic Problems

Dept of Mechanical Engineering MIT Nanoengineering group

Polynomial optimization and a Jacobi Davidson type method for commuting matrices

Transcription:

Mitglied der Helmholtz-Gemeinschaft Block Iterative Eigensolvers for Sequences of Dense Correlated Eigenvalue Problems Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli

Motivation and Goals Reverse Simulation total energy band energy gap conductivity forces, etc. Simulations = = Mathematical model Algorithmic structure Goal Increasing the performance of large legacy codes by exploiting physical information extracted from the simulations that can be used to speed-up the algorithms used in such codes Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 2

Outline Stating the problem: how sequences of generalized eigenproblems arise in all-electron computations Eigenvectors angle evolution ALGORITHM SIM Exploiting eigenvector collinearity: block iterative eigensolvers Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 3

Outline Stating the problem: how sequences of generalized eigenproblems arise in all-electron computations Eigenvectors angle evolution ALGORITHM SIM Exploiting eigenvector collinearity: block iterative eigensolvers Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 4

The Foundations Investigative framework Quantum Mechanics and its ingredients n H = 2m h2 2 n Z α i i=1 i=1 α x i a α + 1 Hamiltonian i<j x i x j Φ(x 1 ;s 1,x 2 ;s 2,...,x n ;s n ) Wavefunction ( ) n Φ : R 3 {± 1 2 } R high-dimensional anti-symmetric function describes the orbitals of atoms and molecules. In the Born-Oppenheimer approximation, it is the solution of the Electronic Schrödinger Equation H Φ(x 1 ;s 1,x 2 ;s 2,...,x n ;s n ) = E Φ(x 1 ;s 1,x 2 ;s 2,...,x n ;s n ) Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 5

Density Functional Theory (DFT) 1 Φ(x 1 ;s 1,x 2 ;s 2,...,x n ;s n ) = Λ i,a φ a (x i ;s i ) 2 density of states n(r) = a φ a (r) 2 3 In the Schrödinger equation the exact Coulomb interaction is substituted with an effective potential V 0 (r) = V I (r) + V H (r) + V xc (r) Hohenberg-Kohn theorem one-to-one correspondence n(r) V 0 (r) = V 0 (r) = V 0 (r)[n]! a functional E[n] : E 0 = min n E[n] The high-dimensional Schrödinger equation translates into a set of coupled non-linear low-dimensional self-consistent Kohn-Sham (KS) equation ( ) a solve Ĥ KS φ a (r) = h2 2m 2 + V 0 (r) φ a (r) = ε a φ a (r) Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 6

Kohn-Sham scheme and DFT Self-consistent cycle Typically this set of equations is solved using an iterative self-consistent cycle Initial guess n init (r) = Compute KS potential V 0 (r)[n] Solve KS equations Ĥ KS φ a (r) = ε a φ a (r) No OUTPUT Energy, forces,... Yes = Converged? n (l+1) n (l) < η Compute new density n(r) = a φ a (r) 2 In practice this iterative cycle is much more computationally challenging and requires some form of broadly defined discretization Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 7

Generalized eigenvalue problems Ax = λbx A common way of discretizing the KS equations is to expand the wave functions φ a (r) on a basis set φ a (r) φ k,ν (r) = c G k,ν ψ G(k,r) G+k K max This expansion is then inserted in the KS equations ψ G (k,r) Ĥ KS c G k,ν ψ G (k,r) = λ kν ψ G (k,r) c G k,ν ψ G (k,r), G G and, by defining the matrix entries for the left and right hand side respectively as Hamiltonian A k and overlap matrices B k, [A k B k ] GG = α dr ψ G (k,r)[ Ĥ KS ˆ1 ] ψ G (k,r) one arrives at generalized eigenvalue equations parametrized by k P k : (A k ) GG c G kν = λ kν (B k ) GG c G kν. A kx i = λ i B k x i. G G Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 8

Discretized Kohn-Sham scheme Self-consistent cycle Initial guess n start (r) = Compute KS potential V 0 (r)[n] Solve a set of eigenproblems P k1...p kn No OUTPUT Energy,... Yes = Converged? n (l+1) n (l) < η Compute new density n(r) = k,ν φ k,ν (r) 2 Observations: 1 A and B are respectively hermitian and hermitian positive definite 2 eigenproblems across k index have different size and we consider them independent from each other (for the moment) 3 eigenvectors of problems of same k are seemingly uncorrelated across iterations i 4 k= 1:10-100 ; i = 1:20-50 Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 9

Outline Stating the problem: how sequences of generalized eigenproblems arise in all-electron computations Eigenvectors angle evolution ALGORITHM SIM Exploiting eigenvector collinearity: block iterative eigensolvers Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 10

Sequences of eigs: an ALGORITHM SIM case Sequences of eigenproblems Consider the set of generalized eigenproblems P (1)...P (l) P (l+1)...p (N) (P) N { Could this sequence P (l)} of eigenproblems evolve following a convergence pattern in line with the convergence of n(r)? Numerical study: studied the evolutions of the angles b/w eigenvectors of successive iterations developed a method that establishes systematically a one-to-one correspondence b/w eigenvectors collected data on eigenvectors deviation angles 1 analyzed deviation angles at fixed λ for all ks 2 analyzed deviation angles at fixed k for all λs below Fermi Energy Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 11

Angle evolution fixed k Example: a metallic compound at fixed k of subspace angle for eigenvectors of k point 1 and lowest 75 eigs 0Evolution 10 AuAg Angle b/w eigenvectors of adjacent iterations 10 2 10 4 10 6 10 8 10 10 2 6 10 14 18 22 Iterations (2 > 22) Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 12

Correlation and its exploitation correlation between successive eigenvectors x (l 1) and x (l) Angles decrease monotonically with some oscillation Majority of angles are small after the first few iterations Note: Mathematical model correlation. Correlation systematic analysis of the simulation. ALGORITHM SIM The stage is favorable to an iterative eigensolver where the eigenvectors of P (l 1) are fed to the solve P (l) Next stages of the investigation: 1 Establish which eigensolvers can exploit the evolution (Implicit Restarted Arnoldi, Krylov-Schur, Subspace Iteration, Davidson-like, etc.) 2 Investigate if approximate eigenvectors can speed-up iterative solvers 3 Understand if iterative methods be competitive with direct methods for dense problems Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 13

Outline Stating the problem: how sequences of generalized eigenproblems arise in all-electron computations Eigenvectors angle evolution ALGORITHM SIM Exploiting eigenvector collinearity: block iterative eigensolvers Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 14

Block Iterative Eigensolvers Two essential properties the iterative algorithms have to comply with: 1 the ability to receive as input a sizable set of approximate eigenvectors; 2 the capacity to solve simultaneously for a substantial portion of eigenpairs. Block iterative methods constitutes the natural choice: they accept a variable set of multiple starting vectors; these methods have a faster convergence rate and avoid stalling when facing small clusters of eigenvalues; when augmented with polynomial accelerators their performance is further improved. ALGORITHMS: Block Krylov-Schur Block Chebyshev-Davidson Chebyshev Subspace Iteration LOBPCG Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 15

Experimental tests setup Matrix sizes: 2,000 5,700 Num of fixed-point iterations: 15 30 Num of k-points: 6 27 B ill-conditioned B is in general almost singular. Examples: size(a) = 50 κ(a) 10 4 size(a) = 500 κ(a) 10 7 We used the standard form for the problem Ax = λbx A y = λy with A = L 1 AL T and y = L T x Numerical Study: Approx. vs Random solutions against Iteration Index Approx. vs Random solutions against Spectrum Fraction Numerical tests were performed using Matlab version R2011b (7.13.0.564) running on an Intel i7 CPU with 8 cores at 2.93 GHz. Four cores and 8 Gb of RAM were fully dedicated to computations Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 16

100 Speed up vs. iterations for CaFe 2 As 2 (n=2612) 90 80 ChSI Speed up (%) 70 60 50 40 Lobpcg BChDav 3% BChDav 7% ChSI 3% ChSI 7% Lobpcg 3% Lobpcg 7% 30 20 10 BChDav 0 3 7 11 14 17 20 23 Iteration Index Figure: Comparison between the 3 most effective block iterative eigensolvers for CaFe 2 As 2 with respect to the outer-iteration index Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 17

80 70 60 CPU time vs. iterations for CaFe 2 As 2 (n=2612) BChDav 7% ChSI 7% Lobpcg 7% BChDav 3% ChSI 3% Lobpcg 3% CPU Time (seconds) 50 40 30 20 10 0 3 7 11 14 17 20 23 Iteration index Figure: Comparison between the 3 most effective block iterative eigensolvers for CaFe 2 As 2 with respect to the outer-iteration index Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 18

100 90 80 Speed up vs eigenspectrum fraction for Na 15 Cl 14 Li (n=3893) BChDav ChSI Lobpcg 70 Speed up (%) 60 50 40 30 20 10 0 1 2 3 4 5 6 7 Eigenspectrum fraction (%) Figure: Comparison between the 3 most effective block iterative eigensolvers for Na 15 Cl 14 Li with respect to eigenspectrum fraction Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 19

400 CPU time vs. eigenspectrum fraction for Na 15 Cl 14 Li (n=3893) CPU Time (seconds) 350 300 250 200 150 BChDav approx ChSI approx Lobpcg approx BChDav random ChSI random Lobpcg random 100 50 0 1 2 3 4 5 6 7 Eigenspectrum fraction (%) Figure: Comparison between the 3 most effective block iterative eigensolvers for Na 15 Cl 14 Li with respect to eigenspectrum fraction Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 20

In summary All methods allow the possibility of seeding the eigensolver with as many eigenvectors as needed 1 Chebyshev Subspace Iteration and LOBPCG methods: Experience between 55% and 85% speed-up The speed-up has little dependence on the eigesprectrum fraction or the iteration index Their efficiency highly depend on the efficiency of Matrix-Matrix multiplication sub-routines GPUs 2 Block Chebyshev-Davidson method: Experiences a speed-up up to 45 % The increased feeding capacity allows speed-ups even when a substantial fraction of the spectrum is sought after Evolution of the sequence implies an increase in speed-ups towards the end of the sequence Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 21

Conclusions and future work Feeding eigenvectors of P (l 1) speed-ups the iterative solver for P (l) ; The algorithmic structure of FLAPW could be re-thought to take into consideration our results reverse simulation can substantially influence the computational paradigm of an application; Conduct a more low-level analysis of speed-up on variants of the investigated algorithms Planning the generalization of a block Chebyshev-filtered Subspace Iteration eigensolver with a general interface for the use of approximate solutions; Analysis on the structure of the entries of A and B across adjacent iteration seems to suggest an exploitation of low-rank updates for sequences of eigenproblems; Correlation may open the road to a reduction of eigenproblem complexity through a sequence of reductions to tridiagonal form. Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 22

References 1 EDN Block Iterative Eigensolvers for Sequences of Correlated Eigenvalue Problems Submitted to SIAM Journal on Scientific Computing [arxiv:1206.3768] 2 EDN, P. Bientinesi, and S. Blügel, Correlation in sequences of generalized eigenproblems arising in Density Functional Theory, Comp. Phys. Comm. 183 (2012), pp. 1674-1682, [arxiv:1108.2594]. Birkbeck University, London, June the 29th 2012 Edoardo Di Napoli Folie 23