Computational Materials Science with the WIEN2k code

Similar documents
Materials Science with WIEN2k on vsc1 and vsc2

Iterative Diagonalization in Augmented Plane Wave Based Methods in Electronic Structure Calculations

CHAPTER 3 WIEN2k. Chapter 3 : WIEN2k 50

Chapter 3. The (L)APW+lo Method. 3.1 Choosing A Basis Set

Country

Country

ELSI: A Unified Software Interface for Kohn-Sham Electronic Structure Solvers

Density functional theory (DFT) and the concepts of the augmented-plane-wave plus local orbital (L)APW+lo method

COMPUTATIONAL TOOL. Fig. 4.1 Opening screen of w2web

Large scale applications:

The Linearized Augmented Planewave (LAPW) Method

Electronic structure calculations with GPAW. Jussi Enkovaara CSC IT Center for Science, Finland

CRYSTAL in parallel: replicated and distributed (MPP) data. Why parallel?

Density functional theory (DFT) and the concepts of the augmented-plane-wave plus local orbital (L)APW+lo method

FULL POTENTIAL LINEARIZED AUGMENTED PLANE WAVE (FP-LAPW) IN THE FRAMEWORK OF DENSITY FUNCTIONAL THEORY

Basic introduction of NWChem software

VASP: running on HPC resources. University of Vienna, Faculty of Physics and Center for Computational Materials Science, Vienna, Austria

exciting in a nutshell

Parallel Eigensolver Performance on High Performance Computers

ELECTRONIC STRUCTURE AND CHEMICAL BONDING IN LAVES PHASES Al 2 Ca, Be 2 Ag AND Be 2 Ti. D. Shapiro, D. Fuks, A. Kiv

Quantum Chemical Calculations by Parallel Computer from Commodity PC Components

A Package for calculating elastic tensors of cubic Phases by using second-order derivative with WIEN2k Package

A Green Function Method for Large Scale Electronic Structure Calculations. Rudolf Zeller

Parallelization of the Molecular Orbital Program MOS-F

Cluster Computing: Updraft. Charles Reid Scientific Computing Summer Workshop June 29, 2010

WRF performance tuning for the Intel Woodcrest Processor

Parallel Eigensolver Performance on High Performance Computers 1

Direct Self-Consistent Field Computations on GPU Clusters

Table of Contents. Table of Contents Spin-orbit splitting of semiconductor band structures

Solution Behavior and Structural Properties of Cu(I) Complexes Featuring m-terphenyl Isocyanides

Photoelectron Interference Pattern (PEIP): A Two-particle Bragg-reflection Demonstration

Computational Nanoscience: Do It Yourself!

Symmetry Assumptions, Kramers Kronig Transformation and Analytical Continuation in Ab Initio Calculations of Optical Conductivities

Basic introduction of NWChem software

Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers

Q-Chem 4.0: Expanding the Frontiers. Jing Kong Q-Chem Inc. Pittsburgh, PA

Electronic structure, atomic forces and structural relaxations by WIEN2k

Electronic structure of Ce 2 Rh 3 Al 9

Due: since the calculation takes longer than before, we ll make it due on 02/05/2016, Friday

Computationally Efficient Analysis of Large Array FTIR Data In Chemical Reaction Studies Using Distributed Computing Strategy

Comparison of various abinitio codes used in periodic calculations

Speed-up of ATK compared to

Band calculations: Theory and Applications

One Optimized I/O Configuration per HPC Application

All electron optimized effective potential method for solids

Electron transport through molecular junctions and FHI-aims

NOT FOR DISTRIBUTION REVIEW COPY. Na 2 IrO 3 as a molecular orbital crystal

for XPS surface analysis

Core loss spectra (EELS, XAS)

A Data Communication Reliability and Trustability Study for Cluster Computing

Pseudopotentials: design, testing, typical errors

Making electronic structure methods scale: Large systems and (massively) parallel computing

Theory, Interpretation and Applications of X-ray Spectra*

Core-Level spectroscopy. Experiments and first-principles calculations. Tomoyuki Yamamoto. Waseda University, Japan

Massively parallel electronic structure calculations with Python software. Jussi Enkovaara Software Engineering CSC the finnish IT center for science

Quantum Modeling of Solids: Basic Properties

Half-metallicity in Rhodium doped Chromium Phosphide: An ab-initio study

Some notes on efficient computing and setting up high performance computing environments

Bonding of hexagonal BN to transition metal surfaces: An ab initio density-functional theory study

ELECTRONIC AND MAGNETIC PROPERTIES OF BERKELIUM MONONITRIDE BKN: A FIRST- PRINCIPLES STUDY

Preconditioned Parallel Block Jacobi SVD Algorithm

Introduction of XPS Absolute binding energies of core states Applications to silicene

The Augmented Spherical Wave Method

Parallelization of the QC-lib Quantum Computer Simulator Library

Supporting Information

Modeling and visualization of molecular dynamic processes

Band Structure Calculations; Electronic and Optical Properties

NHM Tutorial Part I. Brief Usage of the NHM

This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract

FEAST eigenvalue algorithm and solver: review and perspectives

ab initio Electronic Structure Calculations

INITIAL INTEGRATION AND EVALUATION

Large Scale Electronic Structure Calculations

Using Web-Based Computations in Organic Chemistry

Performance Evaluation of Some Inverse Iteration Algorithms on PowerXCell T M 8i Processor

Supporting Online Material

Electronic, magnetic and spectroscopic properties of free Fe clusters

Introduction of XPS Absolute binding energies of core states Applications to silicone Outlook

Supporting Information

Self-compensating incorporation of Mn in Ga 1 x Mn x As

Schwarz-type methods and their application in geomechanics

Size- and site-dependence of XMCD spectra of iron clusters from ab-initio calculations

Exercises: In the following you find some suggestions for exercises, which teach you various tasks one may perform with WIEN2k.

Development of Transferable Materials Information and Knowledge Base for Computational Materials Science

Introduction to Hartree-Fock calculations in Spartan

The Munich SPRKKR-program package A spin polarised relativistic Korringa-Kohn-Rostoker code for calculating solid state properties

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics

EXPERIMENTAL VERIFICATION OF CALCULATED LATTICE. RELAXATIONS AROUND IMPURITIES IN CdTe

Some surprising results of the Kohn-Sham Density Functional

Fundamentals and applications of Density Functional Theory Astrid Marthinsen PhD candidate, Department of Materials Science and Engineering

Cyclops Tensor Framework: reducing communication and eliminating load imbalance in massively parallel contractions

A Quantum Chemistry Domain-Specific Language for Heterogeneous Clusters

All-electron density functional theory on Intel MIC: Elk

Efficient algorithms for symmetric tensor contractions

Before we start: Important setup of your Computer

Wannier Function Based First Principles Method for Disordered Systems

Improving the solar cells efficiency of metal Sulfide thin-film nanostructure via first principle calculations

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic

Solving the Inverse Toeplitz Eigenproblem Using ScaLAPACK and MPI *

SESSION 2. (September 26, 2000) B. Lake Spin-gap and magnetic coherence in a high-temperature superconductor

Transcription:

Computational Materials Science with the WIEN2k code P. Blaha Institute of Materials Chemistry TU Wien pblaha@theochem.tuwien.ac.at http://www.wien2k.at

Computational Materials Science describe materials by quantummechanical simulations(ab initio) simulate: infinite ( perfect ) bulk solids impurities, vacancies in solids surfaces nanostructures atomic and electronic structure stability, phase transitions, mechanical properties, magnetism, chemical bonding,. spectroscopies (IR, Raman, XPS, XAS, XES, EELS,Mössbauer, NMR, STM)

Method: DFT calculations numerical solution of a Schrödinger-like equation (Kohn-Sham) k k k V ( r) i ei i 2 expansion into augmented plane waves basisfunctions f Kn : variational principle 1 2 k = C f K n k n k < E n n 50-100 APWs / atom < H > < E >= < > C k n > =0 generalized eigenvalue problem H C=E S C Setup and diagonalization of (real or complex) matrices of size 10.000 to 50.000 (up to 50 Gb memory, only 10% of e i )

Loops: loop 1: different structures (atomic positions) loop 2: scf-cycle (solve [-½ 2 +V] =E new V iterate) loop 3: k-loop (solve H =E for different k-points) loop 4: setup + diagonalization of H C = E S C largest effort, highest optimization, best parallelization, scaling of time and memory F90, mpi, Scalapack, blas 1 4 2 5 3 6 processors loop over APWs 7 8 9 in parallel (via scripts, slow network, common NFS) 10000-50000 sequential (efficient multi-secant BROYDEN-mixing; L.Marks PRB 78, 075114) coarse grain parallel (different jobs) or sequential (forces new positions)

WIEN2k software package An Augmented Plane Wave Plus Local Orbital Program for Calculating Crystal Properties Peter Blaha Karlheinz Schwarz Georg Madsen Dieter Kvasnicka Joachim Luitz November 2001 Vienna, AUSTRIA Vienna University of Technology http://www.wien2k.at developed over more than 25 years 1400 registered groups 2000 mailinglist users Europe: A, B, CH, CZ, D, DK, ES, F, FIN, GR, H, I, IL, IRE, N, NL, PL, RO, S, SK, SL, SI, UK, ETH Zürich, MPI Stuttgart, FHI Berlin, DESY, TH Aachen, ESRF, Prague, IJS Ljubjlana, Paris, Chalmers, Cambridge, Oxford America: ARG, BZ, CDN, MX, USA (MIT, NIST, Berkeley, Princeton, Harvard, Argonne NL, Los Alamos NL, Oak Ridge NL, Penn State, Purdue, Georgia Tech, Lehigh, John Hopkins, Chicago, Stony Brook, SUNY, UC St.Barbara, UCLA) far east: AUS, China, India, JPN, Korea, Pakistan, Singapore,Taiwan (Beijing, Tokyo, Osaka, Kyoto, Sendai, Tsukuba, Hong Kong) 55 industries (Canon, Eastman, Exxon, Fuji, Hitachi, IBM, Idemitsu Petrochem., Kansai, Komatsu, Konica-Minolta, A.D.Little, Mitsubishi, Mitsui Mining, Motorola, NEC, Nippon Steel, Norsk Hydro, Osram, Panasonic, Samsung, Seiko Epson, Siemens, Sony, Sumitomo,TDK,Toyota).

w2web GUI (graphical user interface) Structure generator spacegroup selection import cif file step by step initialization symmetry detection automatic input generation SCF calculations Magnetism (spin-polarization) Spin-orbit coupling Forces (automatic geometry optimization) Guided Tasks Energy band structure DOS Electron density X-ray spectra Optics command line possible!

current hardware: SFB Aurora IBM Cluster 72 x Dual Xeon 3.6 GHz, 1MB L2 Infiniband Switch (installed 2005) SUN Cluster 72 x Quad-Opterons 2.4 GHz Infiniband Switch (installed 2006) WIEN2k on IBM-BlueGene (2000 processors)

Timing and parallel performance: Task Time (s) r, V eff 151 H, S setup 1176 full diagonalization 2454 iterative diag. 461 3x3 super-cell of a h-bn/ni(111) surface model with 99 atoms/cell (N bas = 16900, 1 scf-cycle on 4 cores) iterative block-davidson diagonalization with improved preconditioner [formally H -1 instead of diag(h-es) -1 ; but only factorization of H required] hand-coded mpi (setup) scalapack (diagonalization) (limitations due to stacked Infiniband switch)

h-bn / Rh(111) nanomesh STM Experiment: M.Corso et al., Science 303, 217(2004) partial double layer model Theory: R.Laskowski et al., PRL 98, 106802 (2007): Corrugated single layer model 12x12 Rh, 13x13 BN, 1108 atoms/cell, HC=ESC (50000x50000, 50 GB memory) 64 cpus, 2h/scf-cycle (was 20h!!) 3 month of computing time N corrugation Theoretical model was confirmed later! H. Dill, et. al.: Science 319, S. 1824(2008)

Thank you for your attention!