L p Approximation of Sigma Pi Neural Networks

Similar documents
arxiv: v2 [cs.ne] 20 May 2016

THE information capacity is one of the most important

Approximation of Multivariate Functions

On Ridge Functions. Allan Pinkus. September 23, Technion. Allan Pinkus (Technion) Ridge Function September 23, / 27

Applied Mathematics Letters

Upper Bounds on the Number of Hidden Neurons in Feedforward Networks with Arbitrary Bounded Nonlinear Activation Functions

Another Proof of Nathanson s Theorems

2.2 Some Consequences of the Completeness Axiom

Locally convex spaces, the hyperplane separation theorem, and the Krein-Milman theorem

THE CANTOR GAME: WINNING STRATEGIES AND DETERMINACY. by arxiv: v1 [math.ca] 29 Jan 2017 MAGNUS D. LADUE

ON THE CONSTRUCTION OF DUALLY FLAT FINSLER METRICS

Your first day at work MATH 806 (Fall 2015)

Small Cycle Cover of 2-Connected Cubic Graphs

4 CONNECTED PROJECTIVE-PLANAR GRAPHS ARE HAMILTONIAN. Robin Thomas* Xingxing Yu**

arxiv: v1 [math.cv] 21 Sep 2007

Your first day at work MATH 806 (Fall 2015)

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

THE DIRECT SUM, UNION AND INTERSECTION OF POSET MATROIDS

On the mean connected induced subgraph order of cographs

On Linear Operators with Closed Range

NAME: Mathematics 205A, Fall 2008, Final Examination. Answer Key

Comments and Corrections

NON-MONOTONICITY HEIGHT OF PM FUNCTIONS ON INTERVAL. 1. Introduction

The Drazin inverses of products and differences of orthogonal projections

Output Input Stability and Minimum-Phase Nonlinear Systems

Multiplication Operators with Closed Range in Operator Algebras

POINTWISE PRODUCTS OF UNIFORMLY CONTINUOUS FUNCTIONS

IN THIS PAPER, we consider a class of continuous-time recurrent

THE NEARLY ADDITIVE MAPS

Decomposing Bent Functions

Part III. 10 Topological Space Basics. Topological Spaces

Decomposition of Riesz frames and wavelets into a finite union of linearly independent sets

Notes on Integrable Functions and the Riesz Representation Theorem Math 8445, Winter 06, Professor J. Segert. f(x) = f + (x) + f (x).

FINITE AFFINE PLANES. An incidence structure α = (P α, L α ) is said to be a finite affine plane if the following axioms are

Second Order and Higher Order Equations Introduction

A Generalization of Bernoulli's Inequality

Continuous functions that are nowhere differentiable

Houston Journal of Mathematics. c 2008 University of Houston Volume 34, No. 4, 2008

Section II.1. Free Abelian Groups

RELATION BETWEEN SMALL FUNCTIONS WITH DIFFERENTIAL POLYNOMIALS GENERATED BY MEROMORPHIC SOLUTIONS OF HIGHER ORDER LINEAR DIFFERENTIAL EQUATIONS

Theorems. Theorem 1.11: Greatest-Lower-Bound Property. Theorem 1.20: The Archimedean property of. Theorem 1.21: -th Root of Real Numbers

Chapter 2 Metric Spaces

Optimum Sampling Vectors for Wiener Filter Noise Reduction

ON µ-compact SETS IN µ-spaces

Spaces of continuous functions

CHAPTER 7. Connectedness

On a Balanced Property of Compositions

Topology Proceedings. COPYRIGHT c by Topology Proceedings. All rights reserved.

Prediction-based adaptive control of a class of discrete-time nonlinear systems with nonlinear growth rate

Properties of the Integers

a + b = b + a and a b = b a. (a + b) + c = a + (b + c) and (a b) c = a (b c). a (b + c) = a b + a c and (a + b) c = a c + b c.

#A42 INTEGERS 10 (2010), ON THE ITERATION OF A FUNCTION RELATED TO EULER S

The Hausdorff measure of a Sierpinski-like fractal

ON OPTIMAL ESTIMATION PROBLEMS FOR NONLINEAR SYSTEMS AND THEIR APPROXIMATE SOLUTION. A. Alessandri C. Cervellera A.F. Grassia M.

EQUICONTINUITY AND SINGULARITIES OF FAMILIES OF MONOMIAL MAPPINGS

A MATROID EXTENSION RESULT

Neural-Network Methods for Boundary Value Problems with Irregular Boundaries

Convergence to Common Fixed Point for Two Asymptotically Quasi-nonexpansive Mappings in the Intermediate Sense in Banach Spaces

STRUCTURAL AND SPECTRAL PROPERTIES OF k-quasi- -PARANORMAL OPERATORS. Fei Zuo and Hongliang Zuo

Normed Vector Spaces and Double Duals

METRIC HEIGHTS ON AN ABELIAN GROUP

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

Ring Sums, Bridges and Fundamental Sets

Oscillation Criteria for Delay Neutral Difference Equations with Positive and Negative Coefficients. Contents

Math 117: Topology of the Real Numbers

Delay-dependent Stability Analysis for Markovian Jump Systems with Interval Time-varying-delays

GRAY codes were found by Gray [15] and introduced

Strictly Positive Definite Functions on the Circle

Stability of a Class of Singular Difference Equations

Global Asymptotic Stability of a General Class of Recurrent Neural Networks With Time-Varying Delays

Results on stability of linear systems with time varying delay

Direction of Movement of the Element of Minimal Norm in a Moving Convex Set

CHAPTER 8: EXPLORING R

THE STONE-WEIERSTRASS THEOREM AND ITS APPLICATIONS TO L 2 SPACES

SUPPLEMENT TO CHAPTER 3

Chapter 1. Vectors, Matrices, and Linear Spaces

1030 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 56, NO. 5, MAY 2011

The best generalised inverse of the linear operator in normed linear space

MULTIPLE SOLUTIONS FOR AN INDEFINITE KIRCHHOFF-TYPE EQUATION WITH SIGN-CHANGING POTENTIAL

NEWMAN S INEQUALITY FOR INCREASING EXPONENTIAL SUMS

Set-orderedness as a generalization of k-orderedness and cyclability

Wittmann Type Strong Laws of Large Numbers for Blockwise m-negatively Associated Random Variables

Research Article Product of Extended Cesàro Operator and Composition Operator from Lipschitz Space to F p, q, s Space on the Unit Ball

Stabilization and Controllability for the Transmission Wave Equation

GENERALIZED CANTOR SETS AND SETS OF SUMS OF CONVERGENT ALTERNATING SERIES

A Complete Stability Analysis of Planar Discrete-Time Linear Systems Under Saturation

PM functions, their characteristic intervals and iterative roots

Some topics in analysis related to Banach algebras, 2

Chapter 3: Baire category and open mapping theorems

HAIYUN ZHOU, RAVI P. AGARWAL, YEOL JE CHO, AND YONG SOO KIM

arxiv: v2 [math.co] 19 Jun 2018

Bulletin of the. Iranian Mathematical Society

Multilayer feedforward networks are universal approximators

On the bang-bang property of time optimal controls for infinite dimensional linear systems

Asymptotic behavior for sums of non-identically distributed random variables

CS489/698: Intro to ML

Math 550 Notes. Chapter 2. Jesse Crawford. Department of Mathematics Tarleton State University. Fall 2010

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

THEOREMS, ETC., FOR MATH 515

An introduction to some aspects of functional analysis

Transcription:

IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 11, NO. 6, NOVEMBER 2000 1485 L p Approximation of Sigma Pi Neural Networks Yue-hu Luo and Shi-yi Shen Abstract A feedforward Sigma Pi neural networks with a single hidden layer of neural is given by =1 =1 where. In this paper, we investigate the approximation of arbitrary functions : by a Sigma-Pi neural networks in the norm. For an locally integrable function ( ) can approximation any given function, if and only if ( ) can not be written in the form 1 =1 =0 (ln ). Index Terms approximation, neural network, Sigma Pi function. I. INTRODUCTION ONE OF the most interesting questions connection neural networks is the approximation capability of neural networks. There have been many papers related to this topic. For the reference one can read [1] [13] and the references given there. In this paper, the -approximation capability of Sigma Pi neural networks is investigated. In mathematical terminology, it is to find some conditions such under which all the linear combinations of elements are dense in for any compact set in, where is a variant, are constants. Let be a function on. is said to be a SP-function if all functions of the following form: are dense in for any compact set in. is said to be an SP-function if all functions of the following form: are dense in for any compact set in. is said to be an TW-function if all functions of the form (2) are dense in for any compact set in. It has been proved in [9] and [10] if is continuous on then is a SP-function if and only if can not be written in the following form: Manuscript received October 8, 1998; revised June 3, 1999. Y.-h. Luo is with the Department of Mathematics, Nanjin University of Science and Technology, Nanjin, China, 210094. S.-y. Shen is with the Department of Mathemathics, Nankai University, Tianjin, China, 300071. Publisher Item Identifier S 1045-9227(00)09849-0. (1) (2) (3) Recently, the present authors in [15] have proved the previous result still holds under the condition is locally Riemman integrable, i.e., if is locally Riemman integrable on then is an SP-function if and only if can not be written in the form (3). Therefore, if the previous conditions are satisfied, then is an SP-function. It is well known an Locally integrable function on may neither be continuous, nor be locally Riemman integrable. It is natural to ask what is characteristic condition for an locally integrable function to be an SP-function. The purpose of this paper is to answer this question and give a necessary and sufficient condition to solve the problem. The remainder of this paper is organized as follows. Our main results are presented in Section II. In order to prove the main results, several lemmas are given in Section III. Finally, the proof of the main results is presented in Section IV. II. MAIN RESULTS The main results in this paper are as follows. Theorem 1: Let be locally integrable on with. Then is an SP-function if and only if can not be written in the following form on (a.e): where are constants. The following Corollary 2 is an immediate consequence of Theorem 1. Corollary 2: Let be locally integrable on with. Then is an TW-function if and only if is not a polynomial on (a.e). Remark 3: Hornik in [4] and M. Leshno et al. in [3] proved is a TW-function if is essentially locally bounded and nonpolynomial on. Cheng in [12] proved is an TW-function if is locally integrable and belongs to (i.e., the integral is existent for any rapidly decreasing infinitely differentiable function on ). Corollary 2 is a generalization of these results. III. SOME NOTATIONS AND LEMMAS For,, an -multi-index, i.e., and each coordinate of is a nonnegative integer, let,,, if each, and be a function on defined by (4) (5) 1045 9227/00$10.00 2000 IEEE

1486 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 11, NO. 6, NOVEMBER 2000 For a function on and a function on, let be the ort of, i.e.,. Let where is a variant. 1 And let be defined by (6) (7) (8) Lemma 1 [9]: Let be a compact set in and. If, then belongs to the closure of in. Lemma 2 [9], [10]: Let. Then the following statements are equivalent: 1) the closure of in is equal to for any compact set in ; 2) cannot be written in the form (3). Lemma 3: Suppose and. Then can not be written in the form (4) on the interval, if and only if for any -multiindex there exists an such Proof: Necessity. Otherwise, there exists an such (10) -multiindex (11) For a function on and a function on, and are defined by (12) and (9) for any integer with, where. By some computation, we see (12) is equivalent to respectively. Let be the set consisting of all infinitely continuously differentiable functions on. For a set in, let is is compact and is compact and is compact and locally integrable on If is -times differential at, is the partial derivative of at, i.e., If is locally integrable on, then is the general derivative, i.e., is a linear functional on defined by In order to prove Theorem 1, we need the following Lemmas. Lemma 3 Lemma 6 can be proved by the argument used in [3], [4], [9], [10], [15]. Lemma 1 is Corollary 4 in [9], Lemma 2 is Theorem 10 in [9] (or Corollary in [10]). (13) where s are constants depending on. In other words, is a solution of the following differential equation of order : (14) It is easy to verify for any with and,, and are solutions of (14). Thus any solution of (14) may be written in the form (4) because,, and are linearly independent. This completes the proof of the necessity. Sufficiency. It is easy to see satisfies (12) if can be written in the form (4) on, which completes the proof of sufficiency. Lemma 4: Let, and. If can not be written in the form (4) on, then there exists a such can not be written in the form (4) on. Proof: Otherwise, may be written in the form (4) on for any. Since, applying Lemma 1 to, we see for any there exists an -multiindex such 1 (x) 6= 0is satisfied

IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 11, NO. 6, NOVEMBER 2000 1487 (15) where,,. Since is a Frechét space (see, e.g., [14, example 1.46]), it follows from (15) and the Bair s category theory there exits at least a such contains a nonempty open set,. Letting be an integer with and applying Lemma 3 to each element in,we see can be written in the form (4) with on for any. Take such converges to in. Such a sequence of functions is clearly existent. Let be the subspace spanned by,. It is obvious is of finite dimension, hence is closed. According to the previous conclusions proved, belongs to the subspace. Since, the closeness of implies, i.e., can be written in the form (4) on, which is a contradiction! The proof is then complete. Lemma 5: 1) Let,, be a compact set in and,. Then the closure of in ; 2) Let, and be a compact set in. Then the closure of in. Proof (1): From the assumption, we may assume with. From the integrability of,wehave (16) Let. For any positive integer, let : be a partition of with. It is clear belongs to. Thus, it follows from (16) and the following inequality: (17) This complete the proof of the conclusion (1). The proof of the conclusion (2) is similar to of the conclusion (1). And the detail is omitted. Lemma 6: Let and be a nonnegative integer,. Then the following statements are equivalent: 1) for any compact set in ; 2) for any -multi-index, there exists a such (18) 3) for any -multi-index, there exists a such (18) is satisfied. Proof: (2) (3) is obvious. (1) (2): Otherwise, there exists an -multi-index such. Let be a compact with. Then, for any and with, we have (19) (20) Taking into account the assumption (1), (20) implies,, which is clearly impossible since. This completes the proof of (1) (2). (3) (1): For any -multi-index, let be an -multiindex with and. According to the assumption, there exists a such (18) is satisfied. Let be the function defined by. Then belongs to according to Lemma 5. It is easy to see (18) implies. Thus, by Lemma 5 and Lemma 1,. Hence, is dense in. The proof of (3) (1) is complete. IV. THE PROOF OF THE THEOREM The Proof of Theorem 1: Necessity: Otherwise, can be written in the form (4). In this case, it can be verified may be written in the form following form: (21)

1488 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 11, NO. 6, NOVEMBER 2000 where is a constant and each is an locally integrable function on. [In fact, it is easy to see can be written in the form (21) with for any with and, which yield (21).] By (21), a straightforward calculation shows the general derivative of satisfies the following equation: for any If If, we have, we have, from (25), we have (27) Hence, by Lemma 4, there exists a compact set in such, a contradiction. This completes the proof of the necessity. Sufficiency: Without loss of generality, we may assume,. Otherwise, we may choose a such possess such a property. If can not be written in the form (4) on some interval with, then Lemma 4 implies there exists a such can not be written in the form (4) on. It is easy to verify. Thus, by Lemma 3 and Lemma 6, is dense in. If may be written in the form (4) on each interval with, may be written in the form (4) on and on, respectively. Thus, may be written in the following form: where (22) where,if,,if, and are constants. Cases I: for any with. Then is continuous on and can not written in the form (3). Thus it from Lemma 3 and Lemma 6 is dense in. Cases II: There exists an integer with such. Let be the largest with and. For any, take such. Let, where is or. It is easy to see Moreover it can be verified, for, (28) (29) For any, from (26) (29), there exits a such (23) (24) which yields (30) (25) where is the number of elements in the set. Let satisfy. If, then. Thus, from (24), we have (26) Hence, is dense in according to Lemma 6, which implies is dense in. The proof of the sufficiency is then complete. V. CONCLUSION In this paper, the problem on the approximation of several variables by Sigma Pi neural networks is investigated. A nec-

IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 11, NO. 6, NOVEMBER 2000 1489 essary and sufficient condition for an locally integrable function to be qualified as an active function in Sigma Pi neural networks is obtained. ACKNOWLEDGMENT The authors wish to express their gratefulness to the reviewers for their valuable comments and suggestion on revising this paper. REFERENCES [1] Y. Ito, Approximation of continuous function on by linear combinations of shifted rotations of a sigmiodal function without scaling, Neural Networks, vol. 5, no. 5, pp. 105 115, 1992. [2] H. N. Mhaskar, Approximation by superposition of sigmoidal and radial functions, Advances Appl. Math., vol. 13, pp. 350 373, 1992. [3] M. Leshno et al., Multilayer feedforward networks with a nonpolynomial activation function can approximate any function, Neural Networks, vol. 4, no. 6, pp. 861 867, 1993. [4] K. Hornik, Some new results on neural-network approximation, Neural Networks, vol. 6, no. 6, pp. 1096 1072, 1993. [5] B. Lenze, Note on a density question for neural works, Numer. Func. Optim., vol. 15, no. 7 & 8, pp. 909 913, 1994. [6] T. P. Chen, H. Chen, and R.-w. Liu, Approximation capability in C( ) by multilayer feedforward network and problems, IEEE Trans. Neural Networks, vol. 6, pp. 25 31, Jan. 1995. [7] T. P. Chen and H. Chen, Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems, IEEE Trans. Neural Networks, vol. 6, pp. 911 917, July 1995. [8] J. G. Attali and G. Pagès, Approximations by a multilayer perceptron: A new approach, Neural Networks, vol. 10, no. 6, pp. 1069 1081, 1997. [9] A. Pinkus, TDI-subspace of C( ) and some density problems from neural networks, J. Approx. Theory, vol. 85, pp. 269 287, 1996. [10] T. P. Chen and X. W. Wu, Characteristics of activation function in Sigma pi neural networks, Fudan Daxie Xie Bao, vol. 36, no. 6, pp. 639 644, 1997. [11] T. P. Chen and H. Chen, University approximation capability to function of EBF neural networks with arbitrary activation functions, Circuits Syst. Signal Processing, vol. 15, no. 5, pp. 671 683, 1996. [12] T. P. Chen, Approximation problems in systems from neural networks, Sci. China, vol. 36, no. 4, pp. 414 421, 1994. [13] C. M. Li and T. P. Chen, Approximation problems in Sigma Pi neural networks, Chinese Sci. Bull., vol. 41, no. 8, pp. 1072 1074, 1996. [14] W. Rudin, Functional Analysis. New York: McGraw-Hill, 1973. [15] Y. H. Luo and S. Y. Shen, Approximation problems of Sigma Pi neural networks, Appl. Math. J. Chinese Uni. Ser. A, vol. 15, no. 1, pp. 107 112, 2000.