Strictly positive definite functions on a real inner product space

Similar documents
Strictly Positive Definite Functions on a Real Inner Product Space

The Learning Problem and Regularization Class 03, 11 February 2004 Tomaso Poggio and Sayan Mukherjee

arxiv: v3 [math.ra] 10 Jun 2016

Smoothness and uniqueness in ridge function representation

Strictly Hermitian Positive Definite Functions

Non-absolutely monotonic functions which preserve non-negative definiteness

On Ridge Functions. Allan Pinkus. September 23, Technion. Allan Pinkus (Technion) Ridge Function September 23, / 27

Approximation of Multivariate Functions

On Euclidean distance matrices

BP -HOMOLOGY AND AN IMPLICATION FOR SYMMETRIC POLYNOMIALS. 1. Introduction and results

Reproducing Kernel Hilbert Spaces Class 03, 15 February 2006 Andrea Caponnetto

Reproducing Kernel Hilbert Spaces

Means of unitaries, conjugations, and the Friedrichs operator

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets

290 J.M. Carnicer, J.M. Pe~na basis (u 1 ; : : : ; u n ) consisting of minimally supported elements, yet also has a basis (v 1 ; : : : ; v n ) which f

Kernel Method: Data Analysis with Positive Definite Kernels

Reproducing Kernel Hilbert Spaces

Kernels for Multi task Learning

Complexity and regularization issues in kernel-based learning

Decomposition of Riesz frames and wavelets into a finite union of linearly independent sets

j=1 [We will show that the triangle inequality holds for each p-norm in Chapter 3 Section 6.] The 1-norm is A F = tr(a H A).

1 Directional Derivatives and Differentiability

Positive definite preserving linear transformations on symmetric matrix spaces

Packing, coding, and ground states From information theory to physics. Lecture III. Packing and energy minimization bounds in compact spaces

Course 212: Academic Year Section 1: Metric Spaces

THE ALTERNATIVE DUNFORD-PETTIS PROPERTY FOR SUBSPACES OF THE COMPACT OPERATORS

Section 3.9. Matrix Norm

Interpolation on lines by ridge functions

EXERCISES. a b = a + b l aq b = ab - (a + b) + 2. a b = a + b + 1 n0i) = oii + ii + fi. A. Examples of Rings. C. Ring of 2 x 2 Matrices

a i,1 a i,j a i,m be vectors with positive entries. The linear programming problem associated to A, b and c is to find all vectors sa b

Exercise Solutions to Functional Analysis

MODULE 8 Topics: Null space, range, column space, row space and rank of a matrix

3. Some tools for the analysis of sequential strategies based on a Gaussian process prior

Fundamentality of Ridge Functions

Random Feature Maps for Dot Product Kernels Supplementary Material

Uniqueness of the Solutions of Some Completion Problems

A CHARACTERIZATION OF STRICT LOCAL MINIMIZERS OF ORDER ONE FOR STATIC MINMAX PROBLEMS IN THE PARAMETRIC CONSTRAINT CASE

Hankel Operators plus Orthogonal Polynomials. Yield Combinatorial Identities

Elementary linear algebra

The spectra of super line multigraphs

Geometry and topology of continuous best and near best approximations

Kernel Methods. Outline

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization

Lower Bounds for Approximation by MLP Neural Networks

Elements of Positive Definite Kernel and Reproducing Kernel Hilbert Space

Complete Nevanlinna-Pick Kernels

Lecture 7: Positive Semidefinite Matrices

2. Linear algebra. matrices and vectors. linear equations. range and nullspace of matrices. function of vectors, gradient and Hessian

On some positive definite functions

Kaczmarz algorithm in Hilbert space

An Interlacing Property of Eigenvalues of Strictly Totally Positive Matrices

arxiv: v1 [math.ca] 7 Jan 2015

MATH 225 Summer 2005 Linear Algebra II Solutions to Assignment 1 Due: Wednesday July 13, 2005

Boolean Inner-Product Spaces and Boolean Matrices

MATH SOLUTIONS TO PRACTICE MIDTERM LECTURE 1, SUMMER Given vector spaces V and W, V W is the vector space given by

A 2 G 2 A 1 A 1. (3) A double edge pointing from α i to α j if α i, α j are not perpendicular and α i 2 = 2 α j 2

9.1 Eigenvectors and Eigenvalues of a Linear Map

SCHUR IDEALS AND HOMOMORPHISMS OF THE SEMIDEFINITE CONE

Invertible Matrices over Idempotent Semirings

Reproducing Kernel Hilbert Spaces

Kernel Methods. Machine Learning A W VO

Convergence of a Generalized Gradient Selection Approach for the Decomposition Method

642:550, Summer 2004, Supplement 6 The Perron-Frobenius Theorem. Summer 2004

Assignment 1: From the Definition of Convexity to Helley Theorem

l(y j ) = 0 for all y j (1)

Strictly Positive Definite Functions on the Circle

EC 521 MATHEMATICAL METHODS FOR ECONOMICS. Lecture 1: Preliminaries

Szemerédi s Lemma for the Analyst

An Introduction to Linear Matrix Inequalities. Raktim Bhattacharya Aerospace Engineering, Texas A&M University

MATH 631: ALGEBRAIC GEOMETRY: HOMEWORK 1 SOLUTIONS

Problem Set 5: Solutions Math 201A: Fall 2016

Notes on uniform convergence

(a) For an accumulation point a of S, the number l is the limit of f(x) as x approaches a, or lim x a f(x) = l, iff

Chapter 2 Linear Transformations

RKHS, Mercer s theorem, Unbounded domains, Frames and Wavelets Class 22, 2004 Tomaso Poggio and Sayan Mukherjee

CSC Linear Programming and Combinatorial Optimization Lecture 10: Semidefinite Programming

Here each term has degree 2 (the sum of exponents is 2 for all summands). A quadratic form of three variables looks as

On the inverse matrix of the Laplacian and all ones matrix

A 3 3 DILATION COUNTEREXAMPLE

N-WEAKLY SUPERCYCLIC MATRICES

NEWMAN S INEQUALITY FOR INCREASING EXPONENTIAL SUMS

POSITIVE POLYNOMIALS LECTURE NOTES (09: 10/05/10) Contents

UNIQUENESS OF THE UNIFORM NORM

Topological vectorspaces

Some analysis problems 1. x x 2 +yn2, y > 0. g(y) := lim

Linear Algebra March 16, 2019

ENTANGLED STATES ARISING FROM INDECOMPOSABLE POSITIVE LINEAR MAPS. 1. Introduction

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

CIS 520: Machine Learning Oct 09, Kernel Methods

A note on the generalization performance of kernel classifiers with margin. Theodoros Evgeniou and Massimiliano Pontil

Reproducing Kernel Hilbert Spaces

ANALYSIS QUALIFYING EXAM FALL 2017: SOLUTIONS. 1 cos(nx) lim. n 2 x 2. g n (x) = 1 cos(nx) n 2 x 2. x 2.

MATH 131A: REAL ANALYSIS (BIG IDEAS)

Vector Calculus. Lecture Notes

Finite dimensional topological vector spaces

Stability of Adjointable Mappings in Hilbert

Immerse Metric Space Homework

Copositive Plus Matrices

Theory of Positive Definite Kernel and Reproducing Kernel Hilbert Space

A GENERAL FORMULATION FOR SUPPORT VECTOR MACHINES. Wei Chu, S. Sathiya Keerthi, Chong Jin Ong

Transcription:

Advances in Computational Mathematics 20: 263 271, 2004. 2004 Kluwer Academic Publishers. Printed in the Netherlands. Strictly positive definite functions on a real inner product space Allan Pinkus Department of Mathematics, Technion, I. I. T., Haifa, 32000, Israel Received 5 March 2002; accepted 21 June 2002 Communicated by C.A. Micchelli If ft)= a k t k converges for all t R with all coefficients a k 0, then the function f x, y ) is positive definite on H H for any inner product space H.SetK ={k: a k > 0}. We show that f x, y ) is strictly positive definite if and only if K contains the index 0 plus an infinite number of even integers and an infinite number of odd integers. Keywords: strictly positive definite, inner product space AMS subject classification: 42A82, 41A05 1. Introduction Let H be any real inner product space,, the inner product thereon, and B a subset of H. We will say that a function f : R R is positive definite on B if for any choice of x 1,...,x n in B and any n, the matrix f x i, x j )) n is positive definite. Equivalently, for all x 1,...,x n in B and c 1,...,c n R c i f x i, x ) j c j 0. We say that a function f : R R is strictly positive definite on B if for any choice of distinct x 1,...,x n in B and any n, the matrix f x i, x j )) n is strictly positive definite. Equivalently, for x i as above, and every c 1,...,c n R, not all zero, c i f x i, x ) j c j > 0.

264 A. Pinkus / Strictly positive definite functions This is not the usual definition of a strictly) positive definite function which classically considers f x y ) rather than f x, y ).) We know that for any x 1,...,x n in B the matrix x i, x j ) n is itself positive definite. That is, given c 1,...,c n R c i x i, x j c j = ci x i,c j x j 2 = c i x i 0. It is natural to ask for conditions on a function f : R R such that f x, y ) is positive definite or, additionally, strictly positive definite on a particular B. It is known see, e.g., [7; 1, p. 159; 4]) that if for all positive definite matrices a ij ) n and for all n) the matrix f a ij )) n is positive definite, then f is necessarily of the form ft)= a k t k 1.1) with a k 0forallk, and where the series converges for all t R f is real entire, and absolutely monotone on R + ). Thus if we demand that f be positive definite on H where H is infinite-dimensional, or where H = R d with the usual inner product x, y = d x i y i and for all d, thenf is necessarily of the form 1.1). However there are H for which condition 1.1) is not necessary in order that f be positive definite on H. For example, if H = R with x,y =xy x,y R), then the functions ft) = t p and ft) = sgn t) t p give rise to positive definite functions for any p > 0. To the best of my knowledge, there is no complete characterization of positive definite functions on R d, for any fixed d. In this paper we will always assume that f has the form of 1.1). Our main result will be that among all such functions f is strictly positive definite on H if and only if the set K ={k: a k > 0} 1.2) contains the index 0 plus an infinite number of even integers and an infinite number of odd integers. This result is in fact valid for any B in any real inner product space H which contains an interval symmetric about the origin.

A. Pinkus / Strictly positive definite functions 265 The motivation for considering this question comes partially from certain papers in Learning Theory, see, e.g., [2,8 11]. Positive definite kernels Rx, y) = f x, y ) give rise to reproducing kernel Hilbert spaces RKHS). That is, with respect to some suitably chosen inner products, ) there is defined the reproducing space of functions g defined on B satisfying gy) = g ), f, y )) for y B. Iff x, y ) is positive definite, but not strictly positive definite, on B, then c i f x i, x ) j c j = 0 for some distinct x 1,...,x n B and c 1,...,c n not all zero. Moreover in this case 0 = c i f x i, x j ) c j = c i f, x i ), c j f, x j )) which immediately implies that = c i f, x i ) c i g x i) = 0 for every g in this RKHS. This is generally an undesirable property, especially if one wishes to use this RKHS as an approximating set. Thus it is preferable that the kernel be strictly positive definite. The approximating property just alluded to may have many forms. One possible form see [9,10] for something similar) is to demand that span { f, y ) : y B } be dense in CB) where B is any compact subset of H. If H = R d for any fixed d, and f is positive definite, then the property of f x, y ) being strictly positive definite is of course necessary for density. However it is not sufficient. Assuming B contains a neighborhood of the origin and f is of the form 1.1), then the necessary and sufficient condition for density, as above, is that K contains 0, an infinite set of even positive) indices {m i }, and an infinite set of odd indices {n i } such that 1 m i = 1 n i =. j=1

266 A. Pinkus / Strictly positive definite functions This is a Müntz condition. This result may be essentially found in [3]. Moreover, this result has nothing to do with positive definite functions. This density property holds for every real entire function f where K is now the set of integers associated with nonzero coefficients, i.e., the set of k for which a k 0andthea k may be negative). And of course there are other nonentire and nonpositive definite f for which the density property also holds. 2. Main result The main result of this paper is the following: Theorem. Assume ft)= with a k 0, all k, is convergent for all t, and a k t k K ={k: a k > 0}. Let H be any real inner product space and let B be a subset of H containing {αz: α c} for some c>0andz H, z =1. Then f is strictly positive definite on B if and only if K contains the index 0 plus an infinite number of even integers and an infinite number of odd integers. Proof. We will first prove the necessity in the case H = R and x,y = xy. We assume [ c, c] B for some c>0andletx 1,...,x n be n distinct real values in B. The matrix fxi x j ) ) n is strictly positive definite if and only if c i fx i x j )c j > 0 2.1) for every choice of c 1,...,c n not all zero. As ft)= a k t k is convergent for all t, we may rewrite 2.1) as a k c i x i x j ) k c j = a k c i x k i ) 2.

Since the a k 0forallk,wehave A. Pinkus / Strictly positive definite functions 267 ) 2 a k c i xi k 0. Our aim is to determine exact conditions on K implying strict positivity for all choices of c 1,...,c n not all zero. In other words, we want necessary and sufficient conditions on K such that there exist no choice of distinct points x 1,...,x n in B and c 1,...,c n not all zero such that c i xi k = 0 2.2) for all k K. To see that we must have 0 K, letx 1 = 0, c 1 0andn = 1. Then c 1 f0)c 1 = a 0 c 2 1 > 0 implies that a 0 > 0, i.e., 0 K. Assume K contains only a finite number, say m, of even indices. Then given any distinct strictly positive x 1,...,x in 0,c] there exist c 1,...,c, not all zero, such that for each k K which is even. Thus c i xi k = 0 c i xi k + c i x i ) k = 0 for each k K which is even, and every k odd. In other words we have found a sum of the form 2.2) with all ±x i B which vanishes for all k K. Similarly, if K contains only a finite number, say m, of odd indices, then given distinct strictly positive x 1,...,x in 0,c] there exist c 1,...,c, not all zero, such that for each k K which is odd. Thus c i xi k = 0 c i xi k c i x i ) k = 0 for each k K which is odd, and every k even. In other words, we have found a sum of the form 2.2) with ±x i B which vanishes for all k K. This proves that if fxy)

268 A. Pinkus / Strictly positive definite functions is strictly positive definite on B then K must contain the index 0 plus an infinite number of even integers and an infinite number of odd integers. Now assume we have any real inner product space H,andB H as in the statement of the theorem. We show that the necessity follows easily from the case H = R. Let us assume that the matrices f x i, x j )) n are strictly positive definite for all choices of distinct x 1,...,x n B and all n. Then for any distinct x 1,...,x n [ c, c], andalln, the matrices f xi z,x j z )) n = fx i x j ) ) n are strictly positive definite. Recall that z, z =1.) This is the previous case with B =[ c, c]. Thus necessarily K must contain the index 0 plus an infinite number of even integers and an infinite number of odd integers. We now prove the sufficiency in the case H = R. We assume that 0 K and K contains an infinite number of even integers and an infinite number of odd integers. We will show that f is strictly positive definite on all of R, and thus on every B R. From the above it suffices to show that there cannot exist distinct x 1,...,x n,andc 1,...,c n not all zero, some n, such that c i xi k = 0 for each k K. It is easily shown that since 0 K it in fact suffices to show that there cannot exist distinct nonzero x 1,...,x n, and nonzero c 1,...,c n,somen, such that c i xi k = 0 2.3) for each k K = K\{0}. Assume this is not the case, and such c i and x i as in 2.3) do exist. Let λ = max { x i : i = 1,...,n }, and Thus y i = x i λ. c i yi k = 0

A. Pinkus / Strictly positive definite functions 269 for an infinite number of even and odd) integers k. Given any ε>0there exists an N such that for k>n c i yi k <ε. Thus y i <1 c i yi k <ε y i =1 for an infinite number of even and odd integers k. The set {i: y i =1} contains either one or two values. If it contains one value i, then c i <εfor every ε > 0, a contradiction. If it contains two values i and j, then implying that c i ± c j <ε c i, c j <ε for every ε>0, which is again a contradiction. This contradiction to our assumption proves sufficiency in the case H = R. It remains to prove the sufficiency in the case of any real inner product space H. Our proof is based on the following result. Let x 1,...,x n be distinct points in H.Then there exist distinct real numbers b 1,...,b n and a positive definite matrix m ij ) n such that x i, x j = b i b j + m ij. We defer the proof of this result. As the m ij ) n is a positive definite matrix, we have c i x i, x j c j = c i b i b j c j + c i m ij c j c i b i b j c j. Similarly, it is well known that c i x i, x j k cj c i b i b j ) k c j. This is a consequence of inequalities between the Schur Hadamard) product of two positive definite matrices. See, e.g., [5, p. 310].) Thus c i f x i, x j ) c j = c i x i, x j k cj a k a k c i b i b j ) k c j = ) 2 a k c i bi k.

270 A. Pinkus / Strictly positive definite functions However as the b i are distinct real values and K contains 0 plus an infinite number of even integers and an infinite number of odd integers, we know that the last term in the above sequence of inequalities is strictly positive for all choices of c 1,...,c n not all zero. This proves the sufficiency. It remains to prove the following result. Proposition. Let H be any real inner product space, and x 1,...,x n be n distinct points in H. Then there exist distinct real numbers b 1,...,b n and a positive definite matrix m ij ) n such that x i, x j = bibj + mij. Proof. Set a ij = x i, x j. We first claim that as the x 1,...,x n are distinct points in H, the positive definite matrix A = a ij ) n has no two identical rows or columns). Assume this is not the case. Then for some i, j {1,...,n} we have whichisthesameas a ii = a ij = a ji = a jj x i, x i = x i, x j = x j, x j. It is now easily checked that x j, x j x i x i, x j x j = 0 which implies that x i = x j contradicting our assumption. The remaining steps of the proof may be found as an exercise in [6, p. 287]. For completeness we record it here. Since A is a positive definite matrix it may be decomposed as A = C T C where C = c kj ) m k=1 n j=1 is an m n matrix and ranka = m. Thus m a ij = c ki c kj, i,j = 1,...,n. k=1 As no two rows of A are identical, it follows that no two columns of C are identical. There therefore exists a v R m, v =1, for which vc = b with b = b 1,...,b n ) where the {b i } n are distinct.

A. Pinkus / Strictly positive definite functions 271 Now let V be any m m orthogonal matrix V T V = I) whose first row is the above v. Let U be the m m matrix whose first row is the above v and all its other entries are zero. Then A = C T C = C T V T VC = C T U T UC + C T V U) T V U)C since U T V U) = V U) T U = 0. From the above, it follows that C T U T UC) ij = b i b j and M = C T V U) T V U)C is positive definite. This completes the proof of the proposition. Remark. If B is bounded, then the above theorem will hold for every ft)= a kt k, a k 0, which converges in the ball of radius sup{ x : x B}. References [1] C. Berg, J.P.R. Christensen and P. Ressel, Harmonic Analysis on Semigroups Springer, New York, 1984). [2] C.J.C. Burges and D.J. Crisp, Uniqueness of the SVM solution, in: NIPS Neural Information Processing Systems), Vol. 12, eds. S.A. Solla, T.K. Leen and K.-R. Müller 2000) pp. 223 229. [3] W. Dahmen and C.A. Micchelli, Some remarks on ridge functions, Approx. Theory Appl. 3 1987) 139 143. [4] C.H. FitzGerald, C.A. Micchelli and A. Pinkus, Functions that preserve families of positive semidefinite matrices, Linear Algebra Appl. 221 1995) 83 102. [5] R. Horn and C.R. Johnson, Topics in Matrix Analysis Cambridge Univ. Press, Cambridge, 1991). [6] G. Pólya and G. Szegö, Problems and Theorems in Analysis, Vol. II Springer, Berlin, 1976). [7] I.J. Schoenberg, Positive definite functions on spheres, Duke Math. J. 9 1942) 96 108. [8] A.J. Smola, B. Schölkopf and K.-R. Müller, The connection between regularization operators and support vector kernels, Neural Networks 11 1998) 637 649. [9] I. Steinwart, On the influence of the kernel on the consistency of support vector machines, J. Machine Learning Res. 2 2001) 67 93. [10] I. Steinwart, Support vector machines are universally consistent, Preprint. [11] V.N. Vapnik, The Nature of Statistical Learning Theory Springer, New York, 1995).