Parallel adaptive methods for Feynman loop integrals. Conference on Computational Physics (CCP 2011)

Outline Elise de Doncker 1 Fukuko Yuasa 2 1 Department of Computer Science, Western Michigan University, Kalamazoo MI 49008, U. S. 2 High Energy Accelerator Research Organization (KEK), Tsukuba, Ibaraki 305-0801, Japan Conference on Computational Physics (CCP 2011)

Outline Outline 1 Introduction 2 Iterated Integration Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface 3 4 Feynman loop diagrams 5 6

Introduction Automatic numerical integration Obtain an approximation Q(f) to an integral If = f( x) d x D in order to satisfy a specified accuracy requirement for the error Ef = Qf If (and an error estimate Ef ) such that: Qf If Ef max { t a, t r If } for given integrand function f, region D and (absolute/relative) error tolerances t a and t r.

Iterated integral Introduction Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface Integration over a product region D = D 1... D l, Iterated integral If = d x (1)... d x (l) f( x (1),..., x (l) ), D 1 D l implemented using lower-dimensional code across successive groups of dimensions, j = 1,...,l. E.g., 1D integration code (such as DQAGE from Quadpack[15]) can be used for 1D levels; or a combination of 1D and multivariate methods (such as DCUHRE [4]) across levels.

Outline Introduction Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface 1 Introduction 2 Iterated Integration Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface 3 4 Feynman loop diagrams 5 6

Priority driven adaptive algorithm Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface Sample problem Algorithm on each iterated level Evaluate initial region & update results Initialize priority queue to empty while (evaluation limit not reached and estimated error too large) Retrieve region from priority queue Split region Evaluate subregions & update results Insert subregions into priority queue (2D) k w kf(x k, y k ) (Subregion approx.) (1D 1D) i u i j v jf(x i, y j ) 0 dy 2δy = (x+y 1) 2 +δ 2 ] 2δy (x+y 1) 2 +δ 2 Figure: (for δ = 0.1) 1 0 dx 1 1 0 dx [ 1 0 dy

Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface Iterated vs. standard multivariate integration (a) (b) Figure: (a) Standard subdivision; (b) Iterated adaptive strategy for singularity on diagonal

Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface Iterated vs. standard multivariate integration 1 0 dx 1 0 dy 2δy (x+y 1) 2 +δ 2, δ = 10 p DQAGE DQAGE DCUHRE p ABS. ERR. # EVAL. ABS. ERR. # EVAL. 1 0.00e+00 21255 2.06e-12 144165 2 2.40e-13 93135 5.96e-12 1998675 3 3.49e-13 208035 1.37e-12 21040551 4 1.58e-13 388125 8.04e-12 99999963 5 4.49e-13 561585 4.40e-07 99999963 6 1.69e-09 527205 3.38e-02 99999963 7 1.42e-10 686745 1.99e+00 99999963 8 3.94e-10 902145 3.04e+00 99999963 9 3.20e-08 106965 3.13e+00 99999963 10 4.32e-09 1964385 3.14e+00 99999963 11 1.87e-01 58651365 3.14e+00 99999963

Modifications Elise de Doncker for relative and Fukuko error Yuasa tolerances Parallel adaptive also methods result. for Introduction Error tolerance interface Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface For double integration, If = I 1 = b 1 a 1 dx b 2 a 2 dy f(x, y), an error is incurred at both inner and outer integration [14], Q I b1 a 1 dx E 2 (x) + E 1 t (2) a b 2 a 2 + t (1) a. The error control can be implemented using estimated errors [8]. Extension to general l times iterated integral: Q I t (1) a + D 1 t (2) a +... + where D j is the volume of region D j. l 1 j = 1 D j t (l) a

(1) On the rule or points level: As in non-adaptive algorithms, e.g., in Monte-Carlo algorithms and in composite rules using grid or lattice points, If = D f k w kf( x k ) : computation of the f( x k ) evaluation points in parallel; (2) On the region level: As in adaptive methods, task pool algorithms, load balancing (distributed memory systems); or maintaining shared priority queue structure; (3) On the iterated integration level: we compute inner integrals in parallel (within adaptive algorithm), e.g. (in 2D): over subregion S (inner region D 2 ) S F(x)dx k w kf(x k ), with F(x k ) = D 2 f(x k, y)dy; (4) Distributed computation of integrals from reductions.

Outline Introduction 1 Introduction 2 Iterated Integration Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface 3 4 Feynman loop diagrams 5 6

Higher order corrections (in addition to the lowest order/tree level) are required for accurate theoretical predictions of cross-sections for particle interactions. Loop diagrams need to be taken into account, necessitating the evaluation of loop integrals. L-loop integral with N internal lines I = Γ `N nl Z 1 NY 2 ( 1) N dx (4π) nl/2 i δ(1 X C N n(l+1)/2 x i ) (D iδc) N nl/2 0 i=1 C and D are polynomials determined by the topology of the corresponding diagram and physical parameters. The integral only exists in the limit as δ 0.

Sample loop diagrams m, (p2) e, - (p1) t, (p4) m 2 γ, (p1) m M m 4 m 3 m m, (p3) e, + (p2) m 1 t, (p3) produced by GRACEFIG produced by GRACEFIG g b e, - (p1) m1 e, - (p5) m5 g g t W t W H b e, + (p2) m2 m3 m4 Z, (p4) e, + (p3) produced by GRACEFIG produced by GRACEFIG Figure: One-loop vertex; Box e e + t t; Diagram for g g b bh; Pentagon for e e + e e + Z [7]

One-loop integral example Sample box integral (real part): 1 0 dx 1 x dy 1 x y dz g(x,y) (d 2 δ 2 ) [10] 0 0 (d 2 +δ 2 ) 2 Figure: Surface where (quadratic) d(x, y, z) = 0

1 0 dx 1 x 0 dy 1 x y 0 dz f(x, y, z) Figure: Plot of inner integral (as a fn. of x, y) for sample box function

Outline Introduction 1 Introduction 2 Iterated Integration Priority driven adaptive algorithm Iterated vs. standard multivariate integration Error tolerance interface 3 4 Feynman loop diagrams 5 6

Introduction Automatic packages are available for one-loop [17, 1, 2, 11, 16, 13, 3, 5, 12]. However, many diagrams are required for an interaction; analytic integration not possible in general (for higher order and for general mass configurations). Numerical Direct Computation (DCM), (e.g., [9, 18, 6, 7]): numerical iterated integration + extrapolation/ sequence acceleration for limit calculation (as δ 0) Reduction + DCM: reduction applied for one-loop through hexagon, DCM for resulting 2D (triangle) and 3D (box) integral Motivation for DCM on multi-core, particularly for 3D and higher dimensions

Hexagon reduction Reduction n-dimensional N-point function set of box and triangle functions distribution of work load Representation: I n N = d n k iπ n/2 1 N l=1 ((k r l) 2 m 2 l ) with external momenta p j and r l = l j=1 p j. The n-dimensional hexagon, pentagon and box functions (N = 6, 5, 4) are expressed in terms of n-dimensional triangle and n + 2-dimensional box functions. In non-exceptional kinematic conditions, N-point functions with N 6 can be expressed in terms of pentagon functions.

Reduction n-dimensional N-point function Reduction I n N = N κ=1 B κi N 1,κ + (N n 1) det(g) det(s) In+2 N, det(s) 0, G is the Gram matrix, rank(g) = min{4, N 1} and B κ = N λ=1 S 1 κλ, S κλ = (r λ r κ ) 2 + mλ 2 + m2 κ, 1 κ,λ N hexagon I6 n = lin. combination of six pentagon In 5 functions, pentagon I5 n = lin. combination of five box In 4 fncs. + O(ε), box I4 n = lin. combination of four triangle In 3 and a box In+2 4 Infrared singularities emerge in the box and triangle functions through poles in 1 ε = 2 4 n and can be handled through sector decomposition.

Iterated integral parallelization (DCM for 3D) Timing results Figure: Time (s) for singular function

Iterated integral parallelization Figure: Time (s) for non-scalar loop integral (box) [10]

Iterated integral parallelization Figure: Time (s) for high degree polynomial

The two-level integration was tested with various schemes for the error control of inner and outer integrations. Extensions to a small number of levels have proved efficient with good results for several applications, e.g., in high energy physics and for some problems in biometrical modeling (higher dimensions). Computation of the integration rule can be parallelized on successive levels. Further extensions/generalizations: hybrid systems (cluster/distributed), multi/many-core nodes; more complex, higher-dimensional DCM problems.

BIBLIOGRAPHY BAUER, C. In KEK Proceedings 2002-11 (2002), vol. [MZ-TH/02-04], pp. 179 185. Do Hoang Son, Ph.D. thesis at the Physics Department, Johannes Gutenberg Univ., 2003. BÉLANGER, G., BOUDJEMA, F., FUJIMOTO, J., ISHIKAWA, T., KANEKO, T., KATO, K., AND SHIMIZU, Y. Automatic calculations in high energy physics and GRACE at one-loop. Physics Reports 430 (2006), 117 209.

BERGER, C. F., BERN, Z., DIXON, L. J., FEBRES CORDERO, F., FORDE, D., GLEISBERG, T., ITA, H., KOSOWER, D. A., AND MAITRE, D. Next-to-leading order qcd predictions for w+3-jet distributions at hadron colliders. Phys. Rev. D 80 (2009), 074036. arxiv:0907.1984 [hep-ph]. BERNTSEN, J., ESPELID, T. O., AND GENZ, A. Algorithm 698: DCUHRE-an adaptive multidimensional integration routine for a vector of integrals. ACM Trans. Math. Softw. 17 (1991), 452 456. BINOTH, T., GUILLET, J.-P., HEINRICH, G., PILON, E., AND REITER, T.

Golem95: a numerical program to calculate one-loop tensor integrals with up to six external legs. Comput. Phys. Commun. 180 (2009), 2317. DE DONCKER, E., FUJIMOTO, J., HAMAGUCHI, N., ISHIKAWA, T., KURIHARA, Y., SHIMIZU, Y., AND YUASA, F. Transformation, reduction and extrapolation techniques for. Springer Lecture Notes in Computer Science (LNCS) 6017 (2010), 139 154. DE DONCKER, E., FUJIMOTO, J., HAMAGUCHI, N., ISHIKAWA, T., KURIHARA, Y., SHIMIZU, Y., AND YUASA, F. Quadpack computation of. Journal of Computational Science (JoCS) (2011), doi:10.1016/j.jocs.2011.06.003.

DE DONCKER, E., AND KAUGARS, K. Dimensional recursion for multivariate adaptive integration. Procedia Computer Science 1 (2010). DE DONCKER, E., SHIMIZU, Y., FUJIMOTO, J., AND YUASA, F. Computation of loop integrals using extrapolation. Computer Physics Communications 159 (2004), 145 156. DE DONCKER, E., SHIMIZU, Y., FUJIMOTO, J., YUASA, F., CUCOS, L., AND VAN VOORST, J. Loop integration results using numerical extrapolation for a non-scalar integral. Nuclear Instruments and in Physics Research Section A 539 (2004), 269 273. hep-ph/0405098.

GIELE, W. T., AND ZANDERIGHI, G. On the numerical evaluation of one-loop amplitudes: the gluonic case. JHEP 0806 (2008), 038. HEINRICH, G., OSSOLA, G., REITER, T., AND TRAMONTANO, F. Tensorial reconstruction at the integrand level. JHEP 1010 (2010), 105. arxiv:1008.2441 [hep-ph]. HIRSCHI, V., FREDERIX, R., FRIXIONE, S., GARZELLI, M. V., MALTONI, F., AND PITTAU, R. Automation of one-loop qcd corrections. arxiv:1103.0621 [hep-ph]. KAHANER, D., MOLER, C., AND NASH, S.

Numerical and Software. Prentice Hall, 1988. PIESSENS, R., DE DONCKER, E., ÜBERHUBER, C. W., AND KAHANER, D. K. QUADPACK, A Subroutine Package for Automatic Integration. Springer Series in Computational Mathematics. Springer-Verlag, 1983. VAN HAMEREN, A., PAPADOPOULOS, C. G., AND PITTAU, R. Automated one-loop calculations: a proof of concept. JHEP 0909 (2009), 106. VAN OLDENBORGH, G. J., AND VERMASEREN, J. A. M. Automatic loop calculations with feynarts, formcalc and looptools. Nucl. Phys. Proc. Suppl. 89 (2000), 231.

YUASA, F., ISHIKAWA, T., FUJIMOTO, J., HAMAGUCHI, N., DE DONCKER, E., AND SHIMIZU, Y. Numerical evaluation of Feynman integrals by a direct computation method. In XII Adv. Comp. and Anal. Tech. in Phys. Res. (2008). PoS (ACAT08) 122; arxiv:0904.2823.