An improved convergence theorem for the Newton method under relaxed continuity assumptions

Similar documents
AN ITERATIVE METHOD FOR COMPUTING ZEROS OF OPERATORS SATISFYING AUTONOMOUS DIFFERENTIAL EQUATIONS

Convergence of a Third-order family of methods in Banach spaces

The Journal of Nonlinear Sciences and Applications

The Newton-Kantorovich (NK) Method

Newton Method with Adaptive Step-Size for Under-Determined Systems of Equations

Journal of Computational and Applied Mathematics

New w-convergence Conditions for the Newton-Kantorovich Method

On the influence of center-lipschitz conditions in the convergence analysis of multi-point iterative methods

On the Midpoint Method for Solving Generalized Equations

A FIXED POINT PROOF OF THE CONVERGENCE OF A NEWTON-TYPE METHOD

YURI LEVIN AND ADI BEN-ISRAEL

Fixed point iteration Numerical Analysis Math 465/565

EXPANDING THE CONVERGENCE DOMAIN FOR CHUN-STANICA-NETA FAMILY OF THIRD ORDER METHODS IN BANACH SPACES

Cycling in Newton s Method

ANTIPODAL FIXED POINT THEORY FOR VOLTERRA MAPS

Nonlinear equations. Norms for R n. Convergence orders for iterative methods

Convergence analysis of a one-step intermediate Newton iterative scheme

Numerical Methods for Large-Scale Nonlinear Systems

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

j=1 r 1 x 1 x n. r m r j (x) r j r j (x) r j (x). r j x k

Cubic regularization of Newton s method for convex problems with constraints

arxiv: v1 [math.na] 25 Sep 2012

Kantorovich s Majorants Principle for Newton s Method

Suppose that the approximate solutions of Eq. (1) satisfy the condition (3). Then (1) if η = 0 in the algorithm Trust Region, then lim inf.

Convergence Conditions for the Secant Method

On a Class of Multidimensional Optimal Transportation Problems

x 2 x n r n J(x + t(x x ))(x x )dt. For warming-up we start with methods for solving a single equation of one variable.

Spectral gradient projection method for solving nonlinear monotone equations

A NOTE ON Q-ORDER OF CONVERGENCE

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization

An Iteratively Regularized Projection Method with Quadratic Convergence for Nonlinear Ill-posed Problems

Optimality Conditions for Nonsmooth Convex Optimization

Existence and uniqueness of solutions for nonlinear ODEs

THE INVERSE FUNCTION THEOREM

Unconstrained optimization

An introduction to Mathematical Theory of Control

YURI LEVIN, MIKHAIL NEDIAK, AND ADI BEN-ISRAEL

Bulletin of the Transilvania University of Braşov Vol 10(59), No Series III: Mathematics, Informatics, Physics, 63-76

MATH 202B - Problem Set 5

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES

Convergence Rates in Regularization for Nonlinear Ill-Posed Equations Involving m-accretive Mappings in Banach Spaces

How large is the class of operator equations solvable by a DSM Newton-type method?

Iterative Solution of a Matrix Riccati Equation Arising in Stochastic Control

Inverse Function Theorem

Subdifferential representation of convex functions: refinements and applications

DYNAMICS ON THE CIRCLE I

Exercise Solutions to Functional Analysis

FIXED POINT ITERATIONS

at time t, in dimension d. The index i varies in a countable set I. We call configuration the family, denoted generically by Φ: U (x i (t) x j (t))

1.4 The Jacobian of a map

A NEW ITERATIVE METHOD FOR THE SPLIT COMMON FIXED POINT PROBLEM IN HILBERT SPACES. Fenghui Wang

Journal of Complexity. New general convergence theory for iterative processes and its applications to Newton Kantorovich type theorems

THIRD-ORDER FAMILY OF METHODS IN BANACH SPACES

17 Solution of Nonlinear Systems

X. Linearization and Newton s Method

Accelerating the cubic regularization of Newton s method on convex problems

On intermediate value theorem in ordered Banach spaces for noncompact and discontinuous mappings

CHAPTER 2: CONVEX SETS AND CONCAVE FUNCTIONS. W. Erwin Diewert January 31, 2008.

L p -convergence of Bernstein Kantorovich-type operators. by Michele Campiti (Bari) and Giorgio Metafune (Lecce)

Viscosity approximation methods for the implicit midpoint rule of asymptotically nonexpansive mappings in Hilbert spaces

Extended And Unified Local Convergence For Newton-Kantorovich Method Under w Conditions With Applications

Newton-type Methods for Solving the Nonsmooth Equations with Finitely Many Maximum Functions

CONVERGENCE PROPERTIES OF COMBINED RELAXATION METHODS

A LEFSCHETZ FIXED POINT THEOREM FOR ADMISSIBLE MAPS IN FRÉCHET SPACES

Metric Spaces and Topology

ITERATED FUNCTION SYSTEMS WITH CONTINUOUS PLACE DEPENDENT PROBABILITIES

Chapter 3: Baire category and open mapping theorems

Sharp estimation of local convergence radius for the Picard iteration

CONVERGENCE BEHAVIOUR OF INEXACT NEWTON METHODS

MAT 544 Problem Set 2 Solutions

A derivative-free nonmonotone line search and its application to the spectral residual method

ASYMPTOTICALLY NONEXPANSIVE MAPPINGS IN MODULAR FUNCTION SPACES ABSTRACT

Some Inequalities for Commutators of Bounded Linear Operators in Hilbert Spaces

Maria Cameron. f(x) = 1 n

University of Houston, Department of Mathematics Numerical Analysis, Fall 2005

THE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS

MATH202 Introduction to Analysis (2007 Fall and 2008 Spring) Tutorial Note #12

Search Directions for Unconstrained Optimization

THE NEARLY ADDITIVE MAPS

Proofs for Large Sample Properties of Generalized Method of Moments Estimators

On nonexpansive and accretive operators in Banach spaces

DIFFERENT ITERATIVE METHODS. Nazli Karaca and Isa Yildirim

Solutions for Homework Assignment 2

Two-Step Iteration Scheme for Nonexpansive Mappings in Banach Space

8 Barrier Methods for Constrained Optimization

1. Nonlinear Equations. This lecture note excerpted parts from Michael Heath and Max Gunzburger. f(x) = 0

It follows from the above inequalities that for c C 1

A projection-type method for generalized variational inequalities with dual solutions

Improved Complexity of a Homotopy Method for Locating an Approximate Zero

Approximating solutions of nonlinear second order ordinary differential equations via Dhage iteration principle

SOME INEQUALITIES FOR COMMUTATORS OF BOUNDED LINEAR OPERATORS IN HILBERT SPACES. S. S. Dragomir

GENERAL NONCONVEX SPLIT VARIATIONAL INEQUALITY PROBLEMS. Jong Kyu Kim, Salahuddin, and Won Hee Lim

The Approximation of Some Invertible Operators Using Newton s Method in Banach Spaces

EXISTENCE OF SET-INTERPOLATING AND ENERGY- MINIMIZING CURVES

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

An Accelerated Hybrid Proximal Extragradient Method for Convex Optimization and its Implications to Second-Order Methods

The small ball property in Banach spaces (quantitative results)

QUADRATIC MAJORIZATION 1. INTRODUCTION

MATH3283W LECTURE NOTES: WEEK 6 = 5 13, = 2 5, 1 13

Frame expansions in separable Banach spaces

Transcription:

An improved convergence theorem for the Newton method under relaxed continuity assumptions Andrei Dubin ITEP, 117218, BCheremushinsaya 25, Moscow, Russia Abstract In the framewor of the majorization technique, an improved condition is proposed for the semilocal convergence of the Newton method under the mild assumption that the derivative F ( x) of the involved operator Fx ( ) is continuous Our starting point is the Argyros representation of the optimal upper bound for the distance x 1 x between the adjacent members of the Newton sequence { x } The major novel element of our proposal is the optimally reconstructed «first integral» approximation 1 ( ) to the recurrence relation defining the scalar majorizing sequence { } Compared to the previous results of Argyros, it enables one to obtain a weaer convergence condition that leads to a better bound on the location of the solution of the equation Fx ( ) and allows for a wider choice of initial guesses x In the simplest case of the Lipschitz continuous operator F ( x), the new convergence condition improves the famous Kantorovich condition provided that l / l (6 4 2), where l and l stand for the center-lipschitz and Lipschitz constants respectively Keywords: Newton method, Banach space, relaxed continuity assumptions, majorization technique, convergence conditions, Kantorovich theorem 1 Introduction Resolution of many different problems of the fundamental and applied mathematics involves the Newton method generating the corresponding sequence{ x } through the recurrence relation x x F x F x T x,, ( 1-1 ) 1 1 ( ( )) ( ) ( ) where F( x) Y is a continuously differentiable operator which defines a mapping DF X Y between an open subset D F of a Banach space X into a given Banach space Y The application of this method is built on the so-called Newton-Kantorovich (NK) theorem due to Kantorovich, see [ 1 ] and [ 2 ] for the original proof and further references The major limitation of this famous theorem is that one has to impose rather stringy smoothness restriction that the operator Fx ( ) is twice differentiable or at least that the Frechet derivative F ( x) of Fx ( ) is Lipschitz continuous The latter restriction cannot be satisfied in many interesting applied problems Therefore, the practically important objective is to obtain Kantorovich-type theorems for the Newton method relaxing as much as possible the assumptions about the degree of the continuity of the operator F ( y)

Recently, an interesting new proposal [ 3 ],[ 4 ] in this direction is made presuming only the continuity of F ( x) The employed majorization technique is built on the novel upper bound for the distance x 1 x The bound is expressed in terms of certain continuity measure depending only on the relative distances xq x of x q ( q, 1) from the initial point x The subtle feature which calls for an optimization is that the resulting convergence conditions introduce a larger number of elementary restrictions compared to the NK theorem To accomplish such optimization of [ 3 ],[ 4 ], we put forward a new implementation of the majorization technique It optimally introduces the approximate «first integral» representation [ 2 ] of the recurrence relation defining the majorizing sequence Under the same computational cost, the proposed technique enables one to improve the convergence condition of [ 4 ] formulating the results under the relaxed continuity assumptions in accordance with the pattern of the NK theorem 2 Improved convergence theorem 21 «First integral» form of the recurrence relations To formulate and prove the improved convergence theorem, we derive such implementation of the general majorization condition x x,, ( 2-1 ) 1 1 that the scalar majorizing sequence { } is generated by the recurrence relation in the so-called «first integral» form [ 2 ],[ 1 ]: 1 ( ),, ( 2-2 ) where the continuous non-decreasing function ( ) assumes the form 1 ( ) 2 1 ( ) ( l) dl ( 2-3 ) Here, the continuous non-decreasing function () r facilitates the affine-invariant upper bound1 1 ( F( x )) ( F( x) F( x )) ( r), x x r, r R, ( 2-4 ) where B( x, R) DF while B( x, R ) denotes the closed ball of the center x and the radius R The (uniform) continuity of F ( x) on B( x, R ) is implemented through the restriction () Note that the characteristic feature of the bound ( 2-4 ) is that this continuity measure is centered at x, ie depends on the relative distance x x of x from the stationary point identified with x Being easier to compute, the continuity measure () is generically smaller than its conventional non-centered counterpart () r and, furthermore, ( r) ( r) in a variety of practically interesting cases [ 4 ] 1 The matrix norm is presumed to be submultiplicative and consistent with the corresponding vector norm

22 Statement of the theorem The set Consider a continuously differentiable operator F( x) : DF X Y such that for some x DF Assume that the condition ( 2-4 ) and the restriction 1 ( F ( x )) L( X, Y) ( F( x )) F( x ) x x are valid for some constants, R and a continuous 1 1 non-decreasing function ( r) with () Presume also that ( 2-3 ) defines such continuous non-decreasing function ( ) that the (associated to ( 2-2 )) fixed point equation ( ) ( 2-5 ) has a minimal solution satisfying the restrictions R, B( x, R) DF where Bx (, ) denotes the closed ball of the center x and the radius Finally, let the scalar sequence { } be generated by the relation ( 2-2 ) supplemented by the initial conditions, 1 () Theorem Under the set of the conditions, the Newton sequence { x} { T x} is welldefined, remains in Bx (, ) and converges to a solution x of the equation Fx ( ) Moreover, the majorization estimates ( 2-1 ) and x x,, ( 2-6 ) are valid where the scalar sequence { }, being non-decreasing, converges to the minimal solution of ( 2-5 ) In the above stated theorem, the central role is played by the condition that there exists a solution of the equation ( 2-5 ) To mae contact with the standard formulation of the NK theorem, it is sufficient to observe that the NK theorem may be reformulated in terms of the similar requirement If 2l 1, the NK fixed point equation l 2 / 2 has a solution (for ) where l denotes the Lipschitz constant in the affine-invariant framewor (eg, see [ 4 ]) Let us also note without proof that a minor modification of the arguments in [ 4 ] enables one to verify that the considered in the theorem solution x is unique in the open ball Bx (, ) 23 Proof of the theorem Assume that there is such a majorizing scalar sequence { } that converges to a limiting point lim R and facilitates the upper bound ( 2-1 ) once { x} B( x, R) DF Then, the standard arguments [ 2 ] verify that the Newton sequence { x } complies with the Cauchy criterion and, therefore, converges to a limiting point x so that the condition ( 2-6 ) is valid and x, x B( x, ) for In turn, the continuity of Fx ( ) implies that Fx ( ) Once the sequence { } is generated by the recurrence relation ( 2-2 ), the convenient sufficient conditions for the existence of the limiting point are proposed by Kantorovich [ 1 ]

The proof of Theorem 1 of Section 2 in Chapter XVIII of [ 1 ] includes the verification of the following useful Lemma Presume that the fixed point equation ( 2-5 ) has a minimal solution [, R] and the continuous function ( ) is non-decreasing when [, R] As long as { } is generated by ( 2-2 ) with the considered initial conditions, this sequence is nondecreasing and converges to so that { } [, R] Next, let us prove that, under the set of the restrictions, the majorization condition ( 2-1 ) is valid where the continuous non-decreasing function ( ) is defined on [, R ] by ( 2-3 ) As a starting point of the proof, we use the Rheinboldt majorization technique [ 2 ] It introduces such a function ( t, s, r) that, being continuous and non-decreasing in t, s, r, optimally implements the upper bound that x, Tx D DF Here, d( x, y) x y d T x Tx d Tx x d Tx x d x x presuming 2 (, ) ( (, ), (, ), (, )), T( x) Tx, T( T( x)) 2 T x, x denotes the starting point of the Newton sequence { x } D D defined by ( 1-1 ) and one identifies D B( x, R) In turn, the latter upper bound is converted into the recurrence relation [ 2 ]: u u ( u u, u, u ), 1 ( 2-7 ) 1 1 1 Given the initial conditions u and u1 1, it defines such scalar nondecreasing sequence { u } that facilitates the estimate x 1 x u 1 u for As ( 2-7 ) is generically too complex to be analyzed exactly, we propose the following general approach Given the upper bound, () the idea is to optimally replace () by such its majorant () that admits the so-called «first-integral» representation [ 2 ] (Section 125): ( t, t r, r) ( t, t r, r) ( t r) ( r), r [, R], t [, R r], ( 2-8 ) where () r is restricted to be a continuous and non-decreasing function It is to be compared with the more ambitious suggestion [ 2 ] to try to derive such representation directly for () that is not implemented so far for the Newton method under relaxed continuity assumptions Given ( 2-8 ) and ( 2-2 ) together with the monotonicity of, () it is straightforward to justify by induction that u u 1 1 for 1 if u and u1 1 () In turn, combining it with the estimate x 1 x u 1 u, one reproduces ( 2-1 ) As for the justification of the inequality u u 1 1, note first that it is obviously valid for 1 Assume that this equality holds true for 1 q which implies that u when 1 q Then, q1 q q q1 q q1 q q1 q1 q F u u (,, ) ( ) ( ) that completes the justification It remains to optimally reconstruct the explicit form of the relevant pattern of () and then derive such its majorant () that, complying with ( 2-8 ), verifies the expression ( 2-3 ) for ( ) The standard transformations lead to r t r, ( t, s, r) ( s) ( l) dl ( r) t ( s) (1 ( )) 1 s, ( 2-9 )

that, in fact, is the particular case of the general expression obtained in [ 3 ] for a generic Newton-lie method (see the equation (34) where one is to identify a and w ( r) w ( r) ( r) ) Thus defined ( t, s, r) is continuous and non-decreasing in t, s, r due to 1 the continuity and monotonicity of () r and the verified below relation ( ) ],1[ In turn, rt rt r r, ( 2-1 ) ( t, s, r ) 2 ( s ) ( l ) dl 2 ( s ) ( l ) dl 2 ( r ) ( l ) dl ( t, s, r ) where it is presumed that r s (with ( r) ( s) ) in accordance with the subsequent identification r t s In sum, ( 2-1 ) and ( 2-8 ) lead to ( 2-3 ) Finally, to prove that thus defined functions ( ) and ( t, s, r) are non-negative, it is sufficient to demonstrate that ( ) ],1[ which also implies that 1 ( F( x )) F( x ) for 1 (ie, the sequence { x } is well-defined) In turn, this property of ( ) directly follows from the pattern ( 2-3 ) of ( 2-5 ) and the imposed restriction ], R] 3 Application to the case of the Lipschitz continuous operator F ( x) When F ( x) is Lipschitz continuous, the upper bound ( 2-4 ) is introduced with () r lr where l is the center-lipschitz constant [ 4 ] In this case, the implementation ( 2-3 ) of ( 2-5 ) is reduced to the quadratic in equation 2 2 2 l (1 l) with the discriminant 2 D( l ) (1 l ) 8 l ( l ) 6l 1 For the latter equation to possess a solution, the secondary quadratic function D( ) should be non-negative which constrains that l 3 2 2 (ie, max (3 2 2) / l ) where 3 2 2,171 is the minimal root of the equation D( ) It is straightforward to demonstrate that the corresponding minimal solution satisfies the required restriction when 1 The Kantorovich condition [ 1 ] reads l,5 ( max (2 l) ) where l stands for the standard Lipschitz constant [ 4 ] formulated in the affine-invariant way The above condition l 3 2 2 is weaer if l / l2(3 2 2) when ( l ) (3 2 2) (2 l) 1 1 max max It is noteworthy that the critical value 6 4 2,343 of the ratio l / l is fairly moderate because it may often be that l / 1 l (see [ 4 ]) The price to pay for this advantage is the linear (rather than quadratic as in the Kantorovich case) convergence of the majorizing sequence { } 4 Comparison with the Argyros convergence theorem The proposal of [ 4 ] (Section 27) derives certain approximate estimates starting directly from the bound () of ( 2-9 ) The single condition of the existence of the minimal solution ], R] of the fixed point equation ( 2-5 ) is replaced in [ 4 ] by the two different restrictions The first one requires the existence of a solution r of equation f () r r and, in our terms,

1 l dl f ( ) 1 ( ) ( ) ( ) ( ), [, R] ( 4-1 ) In the derivation of the inequality f ( ) ( ), the monotonicity of ( ) is taen into where ( ) is introduced by ( 2-3 ) In particular, f ( ) ( ) for ], R] when and ( ) is strictly increasing (with ( ) for ) As a result, compared to the first restriction of [ 4 ], our condition is weaer leading to a finer localization of the limiting point x B( x, ) and a larger maximal admissible value max of the upper bound ( F( x )) F( x ) 1 restricting the choice of the initial point x As for the second restriction of [ 4 ], in our terms it q( r ) 2 ( r ) / (1 ( r )) 1 that is not generically necessitated by the first restriction reads In particular, given ( 4-1 ), the counterpart of our condition l 3 2 2,171 may be obtained in the form l,1 while in [ 3 ] it is argued that l (2 3) / 2,134 using a slightly different technique Both these upper bounds are less favorable than our bound 5 References [ 1 ] LV Kantorovich, GP Ailov, Functional Analysis, Pergamon Press, Oxford, 1982 [ 2 ] JM Ortega, WC Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, Academic Press, New Yor, 2 [ 3 ] I K Argyros, Relaxing the convergence conditions for Newton-lie methods, J Appl Math and Computing, 21 (26), No 1-2, 119 126 [ 4 ] I K Argyros, Convergence and Applications of Newton-Type Iterations, Springer, New Yor, 28