arxiv: v1 [hep-lat] 23 Dec 2010

Similar documents
arxiv: v2 [hep-lat] 4 Feb 2012

PoS(LATTICE 2008)114. Investigation of the η -η c -mixing with improved stochastic estimators

arxiv: v1 [hep-lat] 2 Nov 2015

Omega baryon electromagnetic form factors from lattice QCD

arxiv: v2 [hep-lat] 23 Dec 2008

NUCLEON AND PION-NUCLEON FORM FACTORS FROM LATTICE QCD

Nucleon generalized form factors with twisted mass fermions

arxiv: v1 [hep-lat] 4 Nov 2014

Baryon spectroscopy with spatially improved quark sources

Improving many flavor QCD simulations using multiple GPUs

Low-lying positive-parity excited states of the nucleon

Pseudoscalar Flavor-Singlet Physics with Staggered Fermions

Two-loop evaluation of large Wilson loops with overlap fermions: the b-quark mass shift, and the quark-antiquark potential

arxiv: v1 [hep-lat] 7 Oct 2007

Lattice simulation of 2+1 flavors of overlap light quarks

Nucleon structure from 2+1-flavor dynamical DWF ensembles

The Wave Function of the Roper Resonance

Quark tensor and axial charges within the Schwinger-Dyson formalism

Thermal transition temperature from twisted mass QCD

Department of Physical Sciences, University of Helsinki and Helsinki Institute of Physics, Finland

arxiv: v1 [hep-lat] 26 Dec 2009

Pseudo-Critical Temperature and Thermal Equation of State from N f = 2 Twisted Mass Lattice QCD

Ernst-Michael Ilgenfritz, Michael Müller-Preussker and Andre Sternbeck

University of Athens, Institute of Accelerating Systems and Applications, Athens, Greece

arxiv: v1 [hep-lat] 19 Jan 2016

Nucleon Spectroscopy with Multi-Particle Operators

Pseudoscalar Flavor-Singlet Physics with Staggered Fermions p.1/23

Critical end point of Nf=3 QCD at finite temperature and density

Nucleon form factors and moments of parton distributions in twisted mass lattice QCD

arxiv: v1 [hep-lat] 6 Nov 2012

PoS(LAT2005)205. B s meson excited states from the lattice. UKQCD Collaboration

PoS(EPS-HEP2011)179. Lattice Flavour Physics

Baryon correlators containing different diquarks from lattice simulations

The 1405 MeV Lambda Resonance in Full-QCD

Glueball relevant study on isoscalars from N f = 2 lattice QCD

PoS(LATTICE 2015)263. The leading hadronic contribution to γ-z mixing. Vera Gülpers 1, Harvey Meyer 1,2, Georg von Hippel 1, Hartmut Wittig 1,2

Lattice QCD Calculation of Nucleon Tensor Charge

Quark and Glue Momenta and Angular Momenta in the Proton a Lattice Calculation

Hadron Structure from Lattice QCD

Hadron structure from lattice QCD

arxiv: v1 [hep-lat] 30 Oct 2018

arxiv: v1 [hep-lat] 18 Aug 2017

Spectroscopy of charmed baryons from lattice QCD

PoS(LATTICE 2015)261. Scalar and vector form factors of D πlν and D Klν decays with N f = Twisted fermions

PoS(LATTICE 2013)500. Charmonium, D s and D s from overlap fermion on domain wall fermion configurations

Nucleon form factors and moments of GPDs in twisted mass lattice QCD

Hyperons and charmed baryons axial charges from lattice QCD. Christos Kallidonis

arxiv:hep-lat/ v1 5 Oct 2006

Transverse momentum distributions inside the nucleon from lattice QCD

Form factors on the lattice

The kaon B-parameter from unquenched mixed action lattice QCD

arxiv: v1 [hep-lat] 19 Dec 2012

Thermodynamics using p4-improved staggered fermion action on QCDOC

Cascades on the Lattice

arxiv: v1 [hep-lat] 17 Oct 2009

Transverse Momentum Distributions of Partons in the Nucleon

Michael CREUTZ Physics Department 510A, Brookhaven National Laboratory, Upton, NY 11973, USA

RG scaling at chiral phase transition in two-flavor QCD

Wave functions of the Nucleon

PoS(Lattice 2010)120. Strange and charmed baryons using N f = 2 twisted mass QCD. Mauro Papinutto,Jaume Carbonell

First results from dynamical chirally improved fermions

Nucleon structure near the physical pion mass

to N transition and form factors in Lattice QCD

arxiv:hep-lat/ v1 5 Oct 2006

Constraints for the QCD phase diagram from imaginary chemical potential

PoS(LAT2006)094. The decay constants f B + and f D + from three-flavor lattice QCD

arxiv: v1 [hep-lat] 27 Sep 2011

Excited States of the Nucleon in Lattice QCD

η and η mesons from N f = flavour lattice QCD

PoS(LATTICE 2013)393. K and D oscillations in the Standard Model and its. extensions from N f = Twisted Mass LQCD

Nucleon Deformation from Lattice QCD Antonios Tsapalis

(Towards) Baryon Resonances from Lattice QCD

Localization properties of the topological charge density and the low lying eigenmodes of overlap fermions

arxiv: v1 [hep-lat] 3 Nov 2009

lattice QCD and the hadron spectrum Jozef Dudek ODU/JLab

arxiv: v1 [hep-lat] 19 Jul 2009

arxiv:hep-lat/ v2 13 Oct 1998

Faddeev equations: a view of baryon properties

Nuclear Force from Lattice QCD

PoS(LATTICE2014)169. The Nγ transition form factors on the lattice. Akaki Rusetsky. Andria Agadjanov. Véronique Bernard

PoS(LATTICE 2013)403. Using all-to-all propagators for K ππ decays. Daiqian Zhang. Columbia University

Thermodynamics of strongly-coupled lattice QCD in the chiral limit

arxiv: v1 [hep-lat] 31 Oct 2015

arxiv: v1 [hep-lat] 22 Oct 2013

arxiv: v1 [hep-lat] 10 Jul 2012

lattice QCD and the hadron spectrum Jozef Dudek ODU/JLab

The Polyakov Loop and the Eigenvalues of the Dirac Operator

Hopping Parameter Expansion for Heavy-Light Systems

Transverse momentum-dependent parton distributions from lattice QCD. Michael Engelhardt New Mexico State University

SUPA, School of Physics and Astronomy, University of Glasgow, Glasgow, G12 8QQ, UK

PoS(LAT2006)208. Diseases with rooted staggered quarks. Michael Creutz Brookhaven National Laboratory, Upton, NY 11973, USA

arxiv: v1 [hep-lat] 23 Nov 2018

Universality check of the overlap fermions in the Schrödinger functional

Lattice Studies of Baryon Resonances

spectroscopy overview Jozef Dudek Old Dominion University & Jefferson Lab thanks for inviting a whinging pom

Accelerating Quantum Chromodynamics Calculations with GPUs

arxiv: v1 [hep-lat] 25 Oct 2018

Double poles in Lattice QCD with mixed actions

University of Groningen

Hadron Deformation and Form Factors in Lattice QCD

Transcription:

arxiv:2.568v [hep-lat] 23 Dec 2 C. Alexandrou Department of Physics, University of Cyprus, P.O. Box 2537, 678 Nicosia, Cyprus and Computation-based Science and Technology Research Center, Cyprus Institute, P.O. Box 27456, 645 Nicosia, Cyprus E-mail: alexand@cyi.ac.cy D. Christaras Department of Physics, University of Cyprus, P.O. Box 2537, 678 Nicosia, Cyprus E-mail: christaras.dimitrios@ucy.ac.cy Computation-based Science and Technology Research Center, Cyprus Institute, P.O. Box 27456, 645 Nicosia, Cyprus E-mail: a.ocais@cyi.ac.cy A. Strelchenko Computation-based Science and Technology Research Center, Cyprus Institute, P.O. Box 27456, 645 Nicosia, Cyprus E-mail: a.strelchenko@cyi.ac.cy We present an implementation of the disconnected diagram contributions to quantities such as the flavor-singlet pseudoscalar meson mass which are accelerated by GPGPU technology utilizing the NVIDIA CUDA platform. To enable the exact evaluation of the disconnected loops we use a 6 3 32 lattice and N f = 2 Wilson fermions simulated by the SESAM Collaboration. The disconnected loops are also computed using stochastic methods with several noise reduction techniques. In particular, we analyze various dilution schemes as well as the recently proposed truncated solver method. We find consistency among the different methods used for the determination of the η mass, albeit that the gauge noise for the ensemble studied is large. We also find that the effect of dilution does not go beyond that of optimal statistical noise in many cases. It has been observed, however, that spin dilution does have a significant effect for some quantities studied. The XXVIII International Symposium on Lattice Field Theory, Lattice2 June 4-9, 2 Villasimius, Italy Speaker. c Copyright owned by the author(s) under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike Licence. http://pos.sissa.it/

. Introduction An accurate estimate of disconnected contributions to flavor singlet quantities remains one of the most computationally demanding problems in hadronic physics. The most commonly adopted approach is to apply stochastic methods in order to estimate the quark propagator. A number of methods to reduce the stochastic noise inherent in such an approach has been developed and their respective merits investigated in detail in Ref. []. Such methods typically require large numbers of Dirac matrix inversions and hardware accelerators, such as graphics processors (GPUs), can dramatically accelerate these inversions [2]. The main goal of the present study is two-fold: Firstly, we compute the disconnected contribution to the flavor-singlet pseudo-scalar meson, η, mass which is also related to the U A () anomaly in QCD. Here, this is used as a case-study for the purposes of evaluating the efficacy of the implementation. Secondly, we examine the efficiency of various stochastic noise reduction techniques. More precisely, at this stage, we consider two techniques of variance reduction: partitioning (or dilution) [3] and the truncated solver method [4]. We performed an exact evaluation of the disconnected loops for N f = 2 Wilson fermions on a lattice of size 6 3 32 using GPUs. The calculation is then repeated using stochastic methods. The exact calculation gives us an accurate benchmark by which to compare all stochastic variance reduction methods and explicitly exposes the gauge noise underlying each quantity to be measured. 2. Lattice ensemble and simulation parameters For this exploratory study we use N f = 2 Wilson fermions at β = 5.6 and hopping parameter κ =.57, which corresponds to pion mass of m π = 884 MeV on a lattice of size 6 3 32 [5]. The lattice spacing is a =.8 fm as determined from the nucleon mass at the physical point [6]. For constructing the meson propagators we utilized both local and smeared quark fields. In the latter case, we apply gauge-covariant Gaussian smearing using a range of smearing parameters. The stochastic estimate of the disconnected quark loops is performed using complex Z 2 noise for the source vectors in combination with several partitioning (dilution) schemes and the truncated solver method []. Specifically, we consider various combinations of space, spin and color dilution schemes. Colour dilution leads to a multiplicative factor of 3 for the number of inversions. In spin space, a full dilution leads to a factor of 4 for the inversions. In this case an even-odd partitioning of the space can alternatively be employed leading to an increase of a factor of 2 in the number of inversions. For spatial dilutions, in addition to an even-odd dilution, we have also applied a cubic dilution, where separate sources are placed on each vertex of an elementary 3-d cube and repeated throughout the lattice, leading to an increase of a factor of 8 in the number of inversions. Time dilution is applied in all cases and translational invariance exploited so that this does not increase the number of required inversions. The truncated solver method [] effectively partitions the problem into a low precision and high precision space. A large number of low precision inversions are carried out to achieve an approximation to the propagator with low stochastic error (but only accurate to low precision). A high precision stochastic correction is then applied using a small ensemble with the corresponding inversions carried out to high precision. We use a stochastic ensemble of 5 noise vectors for the 2

low precision space with an ensemble of 5 noise vectors for the high-precision correction. The inversion tolerance for the low precision was chosen to be 6 such that one can restrict oneself to a single precision conjugate gradient inversion (which is very efficient on GPU accelerators), while in the case of full precision the tolerance was set to. The ensemble sizes were chosen to be quite large in order to avoid any quantity-specific tuning of the ensemble sizes. Finally, as was already mentioned, the exact evaluation of the all-to-all propagator is also carried out. This is clearly the most computational intensive part, and was only possible due to the use of graphics accelerators employing the QUDA library (as was used for all inversions), which provides mixed precision implementations of CG and BiCGstab solvers for the NVIDIA CUDA platform [7]. This provides a benchmark at the level of gauge-noise for all quantities with contributions from disconnected loops. 3. Results For all-to-all propagators, a general isovector two-point correlation function, C BA (p, t), for the creation of a particle at timeslice t with momentum p from the operator Γ A and its annihilation at timeslice t + t with the operator Γ B is given by, C BA (p, t) = L 3 T Tr(S F (y,t;x,t + t)γ B (p)s F (x,t + t;y,t)γ A (p)), (3.) x,y,t where S F (x,t;x,t ) is the propagator from spacetime point (x,t) to spacetime point (x,t ), spin and colour indices are suppressed and phases for momentum projections (and quark smearing operations) are incorporated into the definition of the operators Γ A and Γ B. For isoscalar quantities, disconnected loops give a contribution D BA (p, t) to the correlation function, D BA (p, t) = L 3 T Tr(S F (x,t;x,t)γ B (p)s F (y,t + t;y,t + t)γ A (p)), (3.2) x,y,t In our particular case of the η meson in an N f = 2 gauge ensemble, if we suppress all operator and momentum indices, C η (t) = C π (t) 2D(t). (3.3) For mesons on lattices with periodic boundary conditions C(t) e mt + e m(n T t) for t large (where N T is the lattice temporal extent). We can therefore analyze the ratio of the disconnected quark loop, D(t), and connected correlation function, C π (t), to extract the flavour-singlet pseudoscalar meson mass, D(t) C π (t) t A B e mη t + e mη (NT t) e m πt + e m π(n T t), (3.4) where m π and m η are the masses of the π and η mesons and A, B are additional fit parameters. m π can be determined separately to the % level and inserted as a prior leaving only 3 parameters in the fit function. This approach also accounts for independent smearing of the connected and disconnected loops. In the case of the exact evaluation, the only source of error comes from the statistical error of the gauge ensemble, and therefore we will employ this fact to assess the results obtained using 3

Ratio of Disconnected to Connected.... 5 5 2 25 3 t/a Figure : Ratio D(t) C π (t) for the η meson computed using exact approach (without smearing in red and with smearing in green colours). stochastic methods. We compare exact results for the ratio D(t)/C π (t) obtained with local quark field operators with those obtained with smeared quark fields in Fig.. Given that m π can be determined to the % level it is clear that the gauge noise derived from the disconnected loops in this quantity is large. A naive fitting of the data constrained between t min = 2a and t max = a gives a value for the mass am L η =.4 ±.4 and am S η =.4 ±.5 for local and smeared operators, respectively. Here we used am π =.3454(9). On the other hand, fitting in the range t min = 3a and t max = a gives us accordingly am L η =.5 ±.7 and am S η =.49 ±.9. If we look at the same quantities using the truncated solver method, the corresponding plots in logarithmic scale (for local and smeared operators) are given in Fig. 2. The results of fitting in the range t min = 2a and t max = a gives am L η =.42±.7 for local fields and am S η =.42±.6 for smeared fields, respectively. Fitting in the range t min = 3a and t max = a provides the following estimates for the pseudo-scalar mass: am L η =.57 ±. and am S η =.5 ±.. These results are summarized in Table 3. The value of am η =.4(5) is relatively consistent with the results obtained using N f = 2 twisted mass fermions at m π 5 MeV [8]. We have also considered the stochastic estimate of the disconnected diagrams using Z 2 noise and 2 different approaches. First, we examine the number of noise vectors required such that one can reach a level of stochastic accuracy consistent with the statistical noise of the gaugefield ensemble. To this end, we have inspected the trace of zero-momentum projected disconnected loop Tr(S F (x,x)γ), for a range of operators Γ. For example, for the particular case of a dilution approach with a γ 5 operator insertion, the left of Fig. 3 shows the dependency of the magnitude of the error in the trace on the number of noise vectors. This figure shows the number of inversions, N inv, required for each dilution scheme along the x-axis, e.g., full colour dilution with full spin dilution would 4

Ratio of Disconnected to Connected.... e-5 e-6 5 5 2 25 3 t/a Figure 2: Ratio D(t) C π (t) for the η meson computed using TSM (without smearing in red and with smearing in green colours). Table : The η mass using exact and the truncated solver method (TSM) for the evaluation of the disconnected loop. t min t max am L η am S η Exact 2.4 ±.4.4 ±.5 Exact 3.5 ±.7.49 ±.9 TSM 2.42 ±.7.42 ±.6 TSM 3.57 ±..5 ±. require 2 inversions (a factor of 3 for each colour and 4 for each spin, as described earlier). As a reference, we also plot the optimal statistical error (behaving as Ninv ) extrapolating from the first data point to show the expected behaviour of increasing the ensemble size. Finally we insert the gauge-level noise (from the exact calculation) at the point where this gauge error is consistent with overall error from the optimal error extrapolation. As one can see, in the case of the γ 5 -operator the dilution approach is consistent with the optimal statistical error and one needs at least 37 inversions to achieve gauge noise accuracy. A similar analysis can be done for other disconnected loops, on the right of Fig. we show results for an identity operator insertion, where the gauge noise can be reached with just 5 noise vectors and, consequently, dilution can have no beneficial effect for the measurement. Clearly, the size of the stochastic ensemble required is operator-dependent, as noted in Ref. []. In Fig. 4, we show similar plots for a γ γ 3 operator insertion. On the left we show an identical 5

.8.8.6.6.4 - -.4 -.6.4 - -.4 -.8 -.6 - -.8 Figure 3: Magnitude of errors for Tr(S F (x,x)γ). Right: for the γ 5 -operator (gauge level noise achievable with 37 inversions); Left: for the identity operator (gauge level noise achievable with 5 inversions)..6.4 - -.4.5..5 -.5 -. -.5 -.6 - Figure 4: Magnitude of errors for Tr(S F (x,x)γ γ 3 ). Left: for all dilution schemes; Right: for all dilution schemes which include full spin dilution (gauge level noise achievable with 74 inversions) plot to those in Fig. 3 with the exception that the gauge noise cannot be reached within a sample size of and the gauge noise is simply plotted near this limit for illustrative purposes. On the right we plot the same data for the specific cases where full spin dilution is used. As can be seen, spin dilution has, in this case, a dramatic effect that allows the achievement of gauge level noise within 74 inversions. This effect is likely due to the strong off-diagonal nature of this gamma combination in this basis and has been also been observed in other quantities. Also, we again observe that dilution behaves consistently with the optimal statistical error. 4. Summary We have computed the disconnected contribution to η meson mass using both exact and stochastic evaluation. Stability in the fit region has not been observed and the level of noise from the gaugefield ensemble is large, particularly at large separations. We have also compared the 6

truncated solver method against the exact approach in our attempt to evaluate the η meson mass and find results consistent though non-conclusive between the two approaches. This is of course due to the gaugefield ensemble noise inherent in the quantity for the sample. We analyzed efficiency of different noise reduction techniques using the gaugefield ensemble noise from the exact evaluation as a benchmark. We find that, on these lattices, for the stochastic methods the gauge noise becomes the dominant source of the error already for 37 inversions in the case of Tr(S F (x,x)γ 5 ) and just 5 inversions in the case of Tr(S F (x,x)). The statistical error from using the dilution approach appears to behave similarly to statistical noise in many cases for these types of operators. In particular cases however, such as Tr(S F (x,x)γ γ 3 ), spin dilution has been seen to have a significant effect. Acknowledgments Alan Ó Cais is supported by the Research Promotion Foundation of Cyprus under grant ΠPOΣEΛKYΣH/ΠPONE 38/9. The production runs were carried out on an 8-node Tesla cluster at the Cyprus Institute, funded under the same grant. Additional production runs were carried out on the Lincoln cluster at the National Center for Supercomputing Applications at the University of Illinois. References [] Bali G. S., Collins S. and Schafer A., Effective noise reduction techniques for disconnected loops in Lattice QCD, Comput. Phys. Commun. 8 (2) 57 [arxiv:9.397 [hep-lat]]. [2] Barros K., Babich R., Brower R., Clark M. and Rebbi C., Blasting through lattice calculations using CUDA, PoS LAT28 (28) 45 [arxiv:8.5365 [hep-lat]]. [3] Foley Justin et al, Practical all-to-all propagators for lattice QCD, Comput. Phys. Commun. 72 (25) 45 [arxiv:hep-lat/5523]. [4] Bali G. S., Collins S. and Schafer A., Disconnected contributions to hadronic structure: a new method for stochastic noise reduction, PoS. LAT27 (27) 4 [arxiv:79.327 [hep-lat]]. [5] P. Hagler, J. W. Negele, D. B. Renner, W. Schroers, T. Lippert and K. Schilling [LHPC collaboration and SESAM collaboration], Phys. Rev. D 68, 3455 (23) [arxiv:hep-lat/348]. [6] C. Alexandrou, G. Koutsou, J. W. Negele and A. Tsapalis, Phys. Rev. D 74, 3458 (26) [arxiv:hep-lat/657]. [7] Clark M. A., Babich R.,Barros K., Brower R. C. and Rebbi C., Solving Lattice QCD systems of equations using mixed precision solvers on GPUs, Comput. Phys. Commun. 8 (2) 57 [arxiv:9.39 [hep-lat]]. [8] K. Jansen, C. Michael and C. Urbach [ETM Collaboration], Eur. Phys. J. C 58 (28) 26 [arxiv:84.387 [hep-lat]]. 7