DIFFERENTIAL CRITERIA FOR POSITIVE DEFINITENESS

PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY Volume, Number, Pages S -9939(XX)- DIFFERENTIAL CRITERIA FOR POSITIVE DEFINITENESS J. A. PALMER Abstract. We show how the Mellin transform can be used to derive differential criteria for integral representations of functions as scale mixtures, and apply this idea to derive differential criteria for positive definiteness of functions in a manner similar to that of Gneiting (999,). We also give a simple derivation of Williamson s result on multiply monotone functions (Williamson, 956), with a limiting process giving the Bernstein-Widder theorem.. Introduction Recently, Gneiting (999,) has derived differential criteria for positive definiteness based on finding necessary and sufficient conditions that a function be representable as a scale mixture of a certain function known to be positive definite. The use of differential criteria for integral representations is also seen in the Bernstein-Widder theorem on completely monotone functions, and in Schoenberg (94) and Williamson s (956) theorem on multiply monotone functions. In this paper we show how the Mellin transform can be used to facilitate the analysis of scale mixture representations and the derivation differential criteria for such representations... Scale mixtures. Let f and g be real-valued functions, continuous on [,. We say that f is a scale mixture of g if there is a non-decreasing function µ, bounded on [, such that, () f(x) = g ( ) x dµ() x [, where the integral will generally be the Riemann-Stieltjes integral. If µ is also continuous, then dµ() = h() d, with h non-negative and integrable on (, ), and f(x) = g(x/)h() d. We refer to the latter as the scale convolution of g and h. It is related to ordinary convolution through the change of variables, x = e z and = e t, which yields, () f exp = ( g exp ) ( h exp ) where denotes convolution and denotes composition. We generally use the Riemann-Stieltjes integral rather than the Lebesgue integral since the latter requires absolute integrability. We will however use Lebesgue integrals when they exist, and write f L(, ) when f is Lebesgue integrable on (, ). The two types of convolution arise together in the consideration of the distribution function of sums and products of measurable functions. It is well known that the distribution function of a finite sum is the (ordinary) convolution of the respective distribution functions. Similarly, if X(ω) and Y (ω) are non-negative µ-measurable functions with continuously differentiable distribution c 997 American Mathematical Society

J. A. PALMER... Examples. (i) The class of functions that are completely monotonic on (, ) is equivalent to the class of scale mixtures of e x by the Bernstein-Widder theorem. (ii) The class of functions f(r) such that f( x ) is positive definite on R n, is equivalent to the class of scale mixtures of r ν J ν (r) where ν = (n )/ [, Thm.. In this paper we shall be interested primarily in the fact that scale mixtures preserve positive definiteness. Thus, a test determining whether a function f is a scale mixture of a given positive definite function, is also a test determining whether f is positive definite... The Mellin transform. It is well known that the Fourier and Laplace transforms can be used to solve integral equations of the convolution type. Given (), these transforms can also be used to solve equations of the scale convolution type. The latter however can be solved without changing variables using the Mellin transform [5, 4,, which is defined by, (3) M [f(x) ; s f(s) x s f(x) dx for s C such that the integral is convergent. If x k g(x) and x k h(x) L(, ) for some k R then [5, Thm. 44, ( ) x (4) f(x) = g h() d f(s) = g(s) h(s) and x k f(x) L(, ). Then we can solve for h(x) by inverting the transform. If x k f(x) L(, ), then (3) can be inverted almost everywhere using the formula [5, Thm. 8, (5) M [ f(s) ; x πi k+i k i x s f(s) ds = f(x+) + f(x ) where f(x+) and f(x ) denote the right and left hand limits of f at x. Thus, for example, when h in (4) is continuous on (, ), we have, h(x) = M [ f(s) ; x, x (, ) g(s) We can similarly solve integral equations of the form f(x) = g(x)h() d. First note the following two properties, which follow readily from the definition (3): (6) M [x a f(x) ; s = ( ) a f s, M [f(x a ) ; s = a f(s + a) functions G(x) and H(x), then for the distribution function F of X(ω)Y(ω) we have, ( ) x ( ) x F (x) = G dh() = H dg() and for the derivative f(x) F (x), we have, ( ) x ( ) f(x) = g x h() d = h g() d

DIFFERENTIAL CRITERIA FOR POSITIVE DEFINITENESS 3 We thus have M [ t h(t ) ; s = h( s), and it follows that, (7) f(x) = g(x)h() d f(s) = g(s) h( s) The integral in (7) is also referred to as a scale mixture. Another basic property we shall use is the scaling property : (8) M [f(ax) ; s = a s f(s) Thus far, we have only considered functions for which x k f(x) L(, ) for some k R. In order to justify the use of transforms of functions like cos(x), for which x k f(x) is not absolutely integrable for any k R, we require the following theorem [5, Thm. 3, which applies to functions for which there exists a k R such that x k +it f(x) is uniformly integrable with respect to t. Theorem. Let the integral xs f(x) dx f(s) be uniformly convergent (as the limits are approached independently) for s = k + it, t in any finite interval. Then, (9) πi lim λ k+iλ k iλ ( t λ for all x > such that f(x+) and f(x ) exist. ) x s f(s) ds = f(x+) + f(x ) We can take (9) as the most general definition of the Mellin inverse since the C() summation in (9) is equivalent to the ordinary summation in (5) when the latter exists. Like the Laplace transform, the Mellin transform can be used to convert integrodifferential equations into algebraic equations. We use in particular the following relation. Let D denote the differential operator. The transform of the operator D n is given by, () M [D n f(x) ; s = ( ) n Γ(s) Γ(s n) f(s n) and from (6), we have, () M [( x) n D n f(x) ; s = Γ(s + n) Γ(s) These formulae are valid for negative n, where D n I n denotes the nth iterated integral, where I x. For example, if f is integrable on (, x), we have, [ x () M f(t) dt ; s = s f(s + ) For f integrable on (x, ), t >, we have, [ (3) M f(t) dt ; s = s f(s + ) x We will also use the following result on differentiation in the transform domain, (4) M [(log x) n f(x) ; s = dn ds n f(s) f(s)

4 J. A. PALMER Regarding the Mellin transform of Stieltjes integrals, we note that if µ is nondecreasing and bounded on (, ), and µ(x) is O(x ɛ ), ɛ >, as x, then µ(s) exists for < Re(s) < ɛ, and, (5) x s dµ(x) = (s ) µ(s ) in this region. We define the class of functions M k, k R, to consist of all real-valued functions on (, ) such that either (i) x k f(x) is absolutely integrable on (, ), or (ii) f is bounded on [, ), and x k +it f(x) dx is uniformly convergent for t in any finite interval. If f M k, then M [M [f(x) ; s ; x = f(x) at all points of continuity of f, where the contour integral is taken on the line Re(s) = k. Based on the preceding, we formulate the following theorem for the type of scale mixture considered in the sequel. Lemma. If f, g M k for some k R, then there exists a function µ nondecreasing and bounded on (, ) such that, (6) f(x) = if and only if, (7) M [ s is non-decreasing and bounded on (, ). g(x) dµ() f( s) g( s) ; x Suppose that (6) holds with µ non-decreasing and bounded on (, ), and let µ(+) =. Note that x s dµ(x) = (s ) µ(s ). We have, [ [ x s g(x) dµ() dx = x s g(x) dx dµ() = s g(s) µ( s) where s = k + it, and the interchange in the order of integration is justified since g M k and thus either g is bounded, in which case the bracketed integral on the left is absolutely convergent, or x k g(x) is absolutely integrable, in which case the bracketed integral on the right is absolutely convergent. Thus, s f( s) g( s) = µ(s) and the Mellin inverse of the left side is equal to µ(x) at all points of continuity of µ, and to (µ(x+) + µ(x ))/ at points of discontinuity, and is thus non-decreasing and bounded. Conversely, suppose that (7) is non-decreasing and bounded on [, ). Then We assume in the sequel that any non-decreasing bounded function µ used in an integral satisfies µ() = and is thus uniquely determined at points of continuity.. Differential criteria for scale-mixture representations The derivation of differential criteria for the representation of functions as scalemixtures is based on the

DIFFERENTIAL CRITERIA FOR POSITIVE DEFINITENESS 5.. Multiply monotone functions. A function is defined to be n-times monotone, n, if ( ) n f (n ) (x) is non-negative, non-increasing, and convex for x >. It is shown in [3 that an n-times monotone function f can be represented in the form, (8) f(t) = ( ut) n + dγ(u) where γ(t) is non-decreasing, and (a) + max(a, ). The argument in [3 is based on the integration by parts development of the Taylor series expansion of f. Alternatively, using Lemma to invert (8), we have (9) γ(t) = M [ ( s f( s) M [ ( t) n + ; s ) ; t at all points of continuity. For the Mellin transform of ( t) n +, we have, M [ ( t) n + ; s = t s ( t) n dt = Γ(s)Γ(n) Γ(s + n) where we identify the integral as the Beta function. Noting that M [ γ( s) ; t = γ(t ), we have for (9), ( ) () γ = t Γ(n) M [ Γ(s + n) f(s) ; t s Γ(s) Using (3), (5), and () we have, [ M ( u) n df (n ) (u) ; s = s ( )n u s+n df (n ) (u) t = s Using this in (), we have, () γ(t) = Γ(n) /t ( )(s + n )Γ(s + n ) Γ(s) ( u) n df (n ) (u) = Γ(n) f(s) = s t Γ(s + n) Γ(s) + ( u) n df (n ) f(s) ( ) u which is equivalent to equation. of [3. Thus f is n-times monotone if and only if γ(t) defined by () is non-decreasing, i.e. f (n ) (t) is non decreasing on (, )... Complete monotonicity. A function is defined to be completely monotonic if it satisfies the inequalities f (n) (x) for x >, n =,...,. The Bernstein- Widder theorem states that a function f is completely monotonic if and only if it can be represented in the form, () f(x) = with µ non-decreasing on (, ).gh e tx dµ(t) Lemma. f is completely monotonic on (, ) if and only if (3) M [Γ(s) f(s) ; t is non-decreasing and bounded on (, ).

6 J. A. PALMER 3. Positive definiteness on R n Let P n denote the class of functions positive definite on R n. Schoenberg [ used the polar representation of radial symmetric functions (functions of x ) and Bochner s theorem to show that a real valued radially symmetric function f on R n is positive definite if and only if f(x) = g( x ) with g(r) = r ν J ν (ru) dα(u) where α(u) is non-decreasing and bounded for u, and ν = (n )/. Using the fact that [4, p. 63, ( ) s (4) M [ r ν J ν (r) ; s Γ = s n/ ( ) n s Γ with Lemma and the scaling property (8), we have, α( n/ u) = M [s Γ( ) n+s Γ ( ) s g( s) ; u at all points of continuity. Lemma 3. f P n if and only if, ( ) πs (5) M [Γ(s) cos f(s) ; t > t Theorem. f P n if and only if XXX P. Theorem. f is positive definite on R if and only if, ) exp ( t f(t) dt x x is completely monotonic. 4. Functions with compact support Substituting x = e z in (3), we see that the Mellin transform is equivalent to the bilateral Laplace transform of f(e x ). Now let f(x) be concentrated on [,, with f() = and f() =. Then the support of f(e x ) is [, ), and the Mellin transform of f is equivalent to the one-sided Laplace transform of f(e x ). Theorem 3. Theorem 4.

DIFFERENTIAL CRITERIA FOR POSITIVE DEFINITENESS 7 References [ L. Debnath. Integral transforms and their applications. CRC Press, 995. [ I. J. Schoenberg. Metric spaces and completely monotone functions. Annals of Mathematics, 39(4):8 84, 938. [3 I. J. Schoenberg. Multiply monotone functions and their laplace transforms. Duke Math. J., 3:89 7, 956. [4 I. N. Sneddon. The use of integral transforms. McGraw-Hill, 97. [5 E. C. Titchmarsh. Introduction to the theory of Fourier integrals. Oxford: Clarendon Press, nd edition, 948. Electrical and Computer Engineering Dept., Univ. of California San Diego Current address: La Jolla, CA 993 E-mail address: japalmer@ucsd.edu