arxiv: v1 [math.na] 25 Sep 2012

Similar documents
Kantorovich s Majorants Principle for Newton s Method

arxiv: v1 [math.fa] 16 Jun 2011

Journal of Complexity. New general convergence theory for iterative processes and its applications to Newton Kantorovich type theorems

On well definedness of the Central Path

Maximal Monotone Operators with a Unique Extension to the Bidual

New w-convergence Conditions for the Newton-Kantorovich Method

EXPANDING THE CONVERGENCE DOMAIN FOR CHUN-STANICA-NETA FAMILY OF THIRD ORDER METHODS IN BANACH SPACES

An Accelerated Hybrid Proximal Extragradient Method for Convex Optimization and its Implications to Second-Order Methods

Maximal Monotonicity, Conjugation and the Duality Product in Non-Reflexive Banach Spaces

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

AN ITERATIVE METHOD FOR COMPUTING ZEROS OF OPERATORS SATISFYING AUTONOMOUS DIFFERENTIAL EQUATIONS

arxiv: v3 [math.oc] 18 Apr 2012

Convergence analysis of a one-step intermediate Newton iterative scheme

arxiv: v1 [math.oc] 21 May 2015

B. Appendix B. Topological vector spaces

The Journal of Nonlinear Sciences and Applications

On the convergence properties of the projected gradient method for convex optimization

On the complexity of the hybrid proximal extragradient method for the iterates and the ergodic mean

Nonlinear equations. Norms for R n. Convergence orders for iterative methods

DO NOT OPEN THIS QUESTION BOOKLET UNTIL YOU ARE TOLD TO DO SO

Your first day at work MATH 806 (Fall 2015)

Newton Method with Adaptive Step-Size for Under-Determined Systems of Equations

1. Nonlinear Equations. This lecture note excerpted parts from Michael Heath and Max Gunzburger. f(x) = 0

Introduction to Real Analysis Alternative Chapter 1

Locally convex spaces, the hyperplane separation theorem, and the Krein-Milman theorem

Continuity of convex functions in normed spaces

The Newton-Kantorovich (NK) Method

An improved convergence theorem for the Newton method under relaxed continuity assumptions

Monotone Point-to-Set Vector Fields

l(y j ) = 0 for all y j (1)

A convergence result for an Outer Approximation Scheme

The effect of calmness on the solution set of systems of nonlinear equations

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Convex Optimization Notes

Convex Analysis and Economic Theory AY Elementary properties of convex functions

Cubic regularization of Newton s method for convex problems with constraints

Journal of Convex Analysis (accepted for publication) A HYBRID PROJECTION PROXIMAL POINT ALGORITHM. M. V. Solodov and B. F.

MATH 202B - Problem Set 5

WEAK CONVERGENCE OF RESOLVENTS OF MAXIMAL MONOTONE OPERATORS AND MOSCO CONVERGENCE

Course 212: Academic Year Section 1: Metric Spaces

Math 328 Course Notes

Mathematics for Economists

Math 209B Homework 2

1 Directional Derivatives and Differentiability

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε

Induced Norms, States, and Numerical Ranges

Introduction to Proofs

Lecture 5: The Bellman Equation

Merit functions and error bounds for generalized variational inequalities

Maximal monotone operators, convex functions and a special family of enlargements.

Projections onto convex sets on the sphere

Convergence of a Third-order family of methods in Banach spaces

March 25, 2010 CHAPTER 2: LIMITS AND CONTINUITY OF FUNCTIONS IN EUCLIDEAN SPACE

MATH 131A: REAL ANALYSIS (BIG IDEAS)

Series Solutions. 8.1 Taylor Polynomials

L p Spaces and Convexity

Functions of Several Variables

Chapter 8. P-adic numbers. 8.1 Absolute values

1. Bounded linear maps. A linear map T : E F of real Banach

THEOREMS, ETC., FOR MATH 515

YURI LEVIN AND ADI BEN-ISRAEL

Notes on Distributions

Richard S. Palais Department of Mathematics Brandeis University Waltham, MA The Magic of Iteration

Pacific Journal of Optimization (Vol. 2, No. 3, September 2006) ABSTRACT

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University

A Strongly Convergent Method for Nonsmooth Convex Minimization in Hilbert Spaces

1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer 11(2) (1989),

A NOTE ON LINEAR FUNCTIONAL NORMS

Iterative Solution of a Matrix Riccati Equation Arising in Stochastic Control

An inexact subgradient algorithm for Equilibrium Problems

ON THE NEWTON KANTOROVICH THEOREM

APPLICATIONS OF DIFFERENTIABILITY IN R n.

The Arzelà-Ascoli Theorem

Elementary linear algebra

SOLUTIONS TO SOME PROBLEMS

AN INEXACT HYBRID GENERALIZED PROXIMAL POINT ALGORITHM AND SOME NEW RESULTS ON THE THEORY OF BREGMAN FUNCTIONS. M. V. Solodov and B. F.

AN INEXACT HYBRID GENERALIZED PROXIMAL POINT ALGORITHM AND SOME NEW RESULTS ON THE THEORY OF BREGMAN FUNCTIONS. May 14, 1998 (Revised March 12, 1999)

Math 341: Convex Geometry. Xi Chen

MATHS 730 FC Lecture Notes March 5, Introduction

A FIXED POINT PROOF OF THE CONVERGENCE OF A NEWTON-TYPE METHOD

NOTES ON VECTOR-VALUED INTEGRATION MATH 581, SPRING 2017

THE INVERSE FUNCTION THEOREM

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

THE INVERSE FUNCTION THEOREM FOR LIPSCHITZ MAPS

Journal of Convex Analysis Vol. 14, No. 2, March 2007 AN EXPLICIT DESCENT METHOD FOR BILEVEL CONVEX OPTIMIZATION. Mikhail Solodov. September 12, 2005

Journal of Computational and Applied Mathematics

arxiv:math/ v1 [math.fa] 21 Mar 2000

FIXED POINT ITERATIONS

Convex Geometry. Carsten Schütt

Handout on Newton s Method for Systems

Extended And Unified Local Convergence For Newton-Kantorovich Method Under w Conditions With Applications

Upper sign-continuity for equilibrium problems

Your first day at work MATH 806 (Fall 2015)

Weak sharp minima on Riemannian manifolds 1

Existence results for quasi-equilibrium problems

On intermediate value theorem in ordered Banach spaces for noncompact and discontinuous mappings

MA677 Assignment #3 Morgan Schreffler Due 09/19/12 Exercise 1 Using Hölder s inequality, prove Minkowski s inequality for f, g L p (R d ), p 1:

Transcription:

Kantorovich s Theorem on Newton s Method arxiv:1209.5704v1 [math.na] 25 Sep 2012 O. P. Ferreira B. F. Svaiter March 09, 2007 Abstract In this work we present a simplifyed proof of Kantorovich s Theorem on Newton s Method. This analysis uses a technique which has already been used for obtaining new extensions of this theorem. AMSC: 49M15, 90C30. 1 Introduction Kantorovich s Theorem assumes semi-local conditions to ensure existence and uniqueness of a solution of a nonlinear equation F(x) = 0, where F is a differentiable application between Banach spaces [5, 6, 7, 12]. This theorem uses constructively Newton method and also guarantee convergence to a solution of this iterative procedure. Apart from the elegance of this theorem, it has many theoretical and practical applications, in [10] we can find a reviews of recent applications and in [11] an application in interior point methods. This theorem has also many extensions, some of then encompassing previously unrelated results see [1, 13]. Some of these generalizations and extensions are quite recent, because in the last few year the Kantorovich s Theorem has been the subject of intense research, see [1, 2, 3, 10, 11, 13]. The aim of this paper is to present a new technique for the analysis of the Kantorovich s Theorem. This technique, was introduced in [2] and since then it has been used for obtaining new extensions of Kantorovich s Theorem see [1, 3]. Here, it will be used to present a simplified proof of its classical formulation. The main idea is to define good regions for Newton method, by comparing the nonlinear function F with its scalar majorant function. Once these good regions are obtained, an invariant set for Newton method is also obtained and there, Newton iteration can be repeated indefinitely. IME/UFG, Campus II- Caixa Postal 131, CEP 74001-970 - Goiânia, GO, Brazil (Email:orizon@mat.ufg.br). The author was supported in part by FUNAPE/UFG, PADCT- CNPq, PRONEX Optimization(FAPERJ/CNPq), CNPq Grant 302618/2005-8, CNPq Grant 475647/2006-8 and IMPA. IMPA, Estrada Dona Castorina, 110, Jardim Botânico, CEP 22460-320 - Rio de Janeiro, RJ, Brazil (E-mail:benar@impa.br). The author was supported in part by CNPq Grant 301200/93-9(RN), CNPq Grant 475647/2006-8 and by PRONEX Optimization(FAPERJ/CNPq). 1

The following notation is used throughout our presentation. et X be a Banach space. The open and closed ball at x X are denoted, respectively by B(x,r) = {y X; x y < r} and B[x,r] = {y X; x y r}. For the Frechet derivative of a mapping F we use the notation F and for the Dual space of X we use X. First, let us recall Kantorovich s theorem on Newton s method in its classical formulation, see [4, 6, 8, 9, 11]. Theorem 1. et X, Y be Banach spaces, C X and F : C Y a continuous function, continuously differentiable on int(c). Take x 0 int(c),, b > 0 and suppose that 1) F (x 0 ) is non-singular, 2) F (x 0 ) 1 [F (y) F (x)] x y for any x,y C, 3) F (x 0 ) 1 F(x 0 ) b, 4) 2b 1. Define If t := 1 1 2b, t := 1+ 1 2b. (1) B[x 0,t ] C, then the sequences {x k } generated by Newton s Method for solving F(x) = 0 with starting point x 0, x k+1 = x k F (x k ) 1 F(x k ), k = 0,1,, (2) is well defined, is contained in B(x 0,t ), converges to a point x B[x 0,t ] which is the unique zero of F in B[x 0,t ] and x x k+1 1 2 x x k, k = 0,1,. (3) Moreover, if assumption 4 holds as an strict inequality, i.e. 2b < 1, then x x k+1 1 θ2k 1+θ 2k 2 1 2b x x k 2 2 1 2b x x k 2, k = 0,1,, (4) where θ := t /t < 1, and x is the unique zero of F in B[x 0,ρ] for any ρ such that t ρ < t, B[x 0,ρ] C. Note that under assumption 1-4, convergence of {x k } is Q-linear, according to (3). The additional assumption 2b < 1 guarantee Q-quadratic convergence, according to (4). This additional assumption also guarantee that x is the unique zero of F in B(x 0,t ), whenever B(x 0,t ) C. From now on, we assume that the hypotheses of Theorem 1 hold, with the exception of 2b < 1 which will be considered to hold only when explicitly stated. 2

2 Kantorovich s Theorem for a scalar quadratic function In this section we analyze Newton method applied to solve the scalar equation f(t) = 0, for f : R R f(t) = 2 t2 t+b. (5) The analysis to be performed can also be viewed as Kantorovich s theorem for function f. This function and the sequence generated by Newton method for solving f(t) = 0 with starting point t 0, t 0 := 0, (6) both will play an important rule in the analysis of Theorem 1. Note that the assumptions of Theorem 1 are satisfied in the very particular case F = f, X = Y = C = R, x 0 = t 0. The roots of f are t and t, as defined in (1). As b, > 0, 0 < t t, with strict inequality between t and t if and only if 2b < 1. Hence t is the unique root of f in B[t 0,t ], if 2b < 1, then t is the unique root of f in B(t 0,t ). So, the existence and uniqueness part of Theorem 1 for zeros of f holds. Proposition 2. The scalar function f has a smallest nonnegative root t (0,1/]. Moreover, for any t [0,t ) f(t) > 0, f (t) (t t ) < 0. Proof. For the first statement, it remains to prove that t 1/, which is a trivial consequence of the assumptions on b and. As f(0) > 0, f shall be strictly positive in [0,t ). For the last inequalities, use the inequality t 1/ and (5) to obtain f (t) = t 1 = (t 1/) (t t ). Now, the last inequality follows directly from the assumption t < t. According to Proposition 2, f (t) 0 for all t [0,t ). Therefore, Newton iteration is well defined in [0,t ). et us call it n f, n f : [0,t ) R t t f(t)/f (t). (7) Note that, up to now, only one single iteration of newton method is well defined in [0,t ). In principle, Newton iteration could map some t [0,t ) in to 1/. In such a case, the second iterate for t would be not defined. Now, we shall prove that Newton iteration can be repeated indefinitely at any starting point in [0,t ). 3

Proposition 3. For any t [0,t ) t n f (t) = 2f (t) (t t) 2, t < n f (t) < t. In particular, n f maps [0,t ) in [0,t ). Proof. Take t [0,t ). As f is a second-degree polynomial and f(t ) = 0, 0 = f(t)+f (t)(t t)+ 2 (t t) 2. Dividing by f (t) we obtain, after direct rearranging t t+f(t)/f (t) = 2f (t) (t t) 2. Note that, by (7), the left hand side of the above equation is t n f (t), which proves the first equality. Using Proposition 2 we have f(t) > 0 and f (t) < 0. Combining these inequalities with definition (7) and the first equality in the proposition, respectively, we obtain t < n f (t) < t. The last statement of the Proposition follows directly from these inequalities. Proposition 3 shows, in particular, that for any t [0,t ), the sequence {n k f (t)}, n 0 ( f (t) = t, nk+1 f (t) = n f n k f (t) ), k = 0,1,. iswelldefined, strictlyincreasing,remainsin[0,t )andso,isconvergent. Therefore, Newton method for solving f(t) = 0 with starting point t 0 = 0 ( see (6)) generates an infinite sequence {t k = n k f (t 0)}, which can be also defined as t 0 = 0, t k+1 = n f (t k ), k = 0,1,. (8) As we already observed, this sequence is strictly increasing, remains in [0,t ) and converges. Corollary 4. The sequence {t k } is well defined, strictly increasing and is contained in [0,t ). Moreover, it converges Q-linearly to t, as follows t t k+1 = 2f (t k ) (t t k ) 2 1 2 (t t k ), k = 0,1,. (9) If 2b < 1, then the sequence {t k } converge Q-quadratically as follows t t k+1 = 1 θ2k 1+θ 2k 2 1 2b (t t k ) 2 where θ = t /t < 1. 2 1 2b (t t k ) 2, k = 0,1,, (10) 4

Proof. The first statement of the corollary have already been proved. Using Proposition 3, we have for any k t n f (t k ) = 2f (t k ) (t t k ) 2, which combined with (8), yields the equality on (9). As t k [0,t ), using Proposition 2 we have f (t k ) (t k t ) < 0. The multiplication of the first above inequality by (t t k )/(2f (t k )) < 0 yields the inequality in (9). Now suppose that 2b < 1 or equivalently t < t. A closed expression for t k is available ( see, e.g. [[9], Appendix F], [4]) see the Appendix A. In this case 2 1 2b t k = t θ2k, k = 0,1,. 1 θ 2k From above equation we have that 1 2b f (t k ) = 1+θ2k, k = 0,1,. 1 θ 2k a Therefore, to obtain the equality in (10) combine the equality in (9) and latter equality. As (1 θ 2k )/(1+θ 2k ) 1 the inequality in (10) follows. 3 Simplifying assumption and convergence Newton method is invariant under (non-singular) linear transformations. This fact will be used to simplify our analysis. We claim that it is enough to prove Theorem 1 for the case X = Y and F (x 0 ) = I. Indeed, if F (x 0 ) I, define G = F (x 0 ) 1 F. Then, the domain, the roots, the domain of the derivative and the points where the derivative is non-singular are the same for F and G. Moreover, Newton method applied to F(x) = 0 is equivalent to Newton methods applied to G(x) = 0, i.e., at the points where F (x) is nonsingular, G (x) 1 G(x) = F (x) 1 F(x), x G (x) 1 G(x) = x F (x) 1 F(x). Finally, G will satisfy the same assumptions wich F satisfy. So, from now one we assume X = Y, F (x 0 ) = I. (11) Note that this assumption simplifies conditions 2 and 3 of Theorem 1. Proposition 5. If 0 t < t and x B(x 0,t), then F (x) is non-singular and F (x) 1 1/ f (t). 5

Proof. Recall that t 1/. Hence, 0 t < 1/. Using (11) and assumption 2, with x and x 0 we have F (x) I t < 1. Hence, using Banach s emma, we conclude that F (x) is non-singular and F (x) 1 1/(1 t). To end the proof, use (5) to obtain f (t) = 1 t for 0 t < t. The error in the first order approximation of F at point x int(c) can be estimated in any y C, whenever the line segment with extreme points x,y lays in C. Since balls are convex, we have: Proposition 6. If x B(x 0,R) and y B[x 0,R] C, then Proof. Define, for θ [0,1], F(y) [ F(x)+F (x)(y x) ] 2 y x 2. y(θ) = x+θ(y x), R(θ) = F(y(θ)) [F(x)+F (x)(y(θ) x)]. We shall estimate R(1). From Hahn-Banach Theorem, there exists ξ X such that ξ = 1, ξ(r(1)) = R(1). Define, for θ [0,1], g(θ) = ξ(r(θ)). Direct calculation yields, for θ [0,1) ( ) dg dθ (θ) = ξ F (y(θ)) F (x). In particular, g is C 1 on [0,1). Using assumption 2, we have dg (θ) θ y x. dθ To end the prove, note that ξ(r(0)) = 0 and perform direct integration on the above inequality. Proposition 5 guarantee non-singularity of F, and so well definedness of Newton iteration map for solving F(x) = 0 in B(x 0,t ). et us call N F the Newton iteration map (for F(x) = 0) in that region N F : B(x 0,t ) X x x F (x) 1 F(x). (12) 6

One can apply a single Newton iteration on any x B(x 0,t ) to obtain N F (x) which may not belong to B(x 0,t ), or even may not belong to the domain of F. To ensure that Newton iterations may be repeated indefinitely from x 0, we need some additional results. First, define some subsets of B(x 0,t ) in which, as we shall prove, Newton iteration (12) is well behaved. K(t) := {x B[x 0,t] : F(x) f(t)}, t [0,t ), (13) K := K(t). (14) t [0,t ) emma 7. For any t [0,t ) and x K(t), 1. F (x) 1 F(x) f(t)/f (t), 2. x 0 N F (x) n f (t), 3. F(N F (x)) f(n f (t)). In particular, N F (K(t)) K(n f (t)), t [0,t ) and N F maps K in K, i.e., N F (K) K. Proof. Take x K(t). Using Proposition 5 and (13) we conclude that F (x) is non-singular, F (x) 1 1/ f (t), F(x) f(t). Hence, F (x) 1 F(x) F (x) 1 F(x) f(t)/ f (t), which combined with the inequality f (t) < 0 yields item 1. Toproveitem2useitem1, triangularinequalityanddefinition(12)toobtain x 0 N F (x) x 0 x + F (x) 1 F(x) t f(t)/f (t). To end the prove of item 2, combine the above equation with definition (7). Fromitem 2 andproposition3, N F (x) B(x 0,t ). So, Proposition6implies F(NF (x)) [ F(x)+F (x)(n F (x) x) ] F (x) 1 F(x) 2. 2 Note that by (12) F(x)+F (x)(n F (x) x) = 0. Combining last two equations, item 1 and identity f(n f (t)) = (f(t)/f (t)) 2 /2 (which follows from (5) and (7)) we conclude that item 3 also holds. Since t < n f (t) < t (Proposition 3), using also items 2 and 3 we have that N F (x) K(n f (t)). As x is an arbitrary element of K(t), we have N F (K(t)) K(n f (t)). To prove the last inclusion, take x K. Then x K(t) for some t [0,t ), which readily implies N F (x) K(n f (t)) K. 7

The last inclusion in emma 7 shows that for any x K, the sequence {N k F (x)}, N 0 F(x) = x, N k+1 F (x) = N F ( N k F (x) ), k = 0,1,, is well defined and remains in K. The assumptions of Theorem 1 guarantee x 0 K(0) K. (15) Therefore, the sequence {x k = N k F (x 0)} is well defined and remains in K. This sequence can be also defined as x 0 = 0, x k+1 = N F (x k ), k = 0,1,. (16) which happens to be the same sequence specified in (2), Theorem 1. Proposition 8. The sequence {x k } is well defined, is contained in B(x 0,t ) and x k K(t k ), k = 0,1,. (17) Moreover, {x k } converges to a point x B[x 0,t ], and F(x ) = 0. x x k t t k, k = 0,1,, Proof. Well definedness of the sequence {x k } was already proved. We also conclude that this sequence remains in K. As K B(x 0,t ) (see (13) and (14)), {x k } also remains in B(x 0,t ). As t 0 = 0, the first inclusion in (15) can also be written as x 0 K(t 0 ). So, (17) holds for k = 0. To complete the proof of (17) use induction in k, (16), Proposition 7 and equation (8). Combining (17) with item 1 of emma 7, (16) and (8) we obtain x k+1 x k t k+1 t k, k = 0,1,. (18) As {t k } converges and k=0 t k+1 t k < we conclude that {x k } is a Cauchy sequence. So, {x k } converges to some x B[x 0,t ]. Moreover, (18) implies Note that x x k t j+1 t j = t t k, k = 0,1,. (19) j=k F(x k ) = F (x k 1 )[x k 1 x k ]. As F (x) is bounded by 1+t in B(x 0,t ) last equation implies that lim F(x k) = 0. k Now, using the continuity of F in B[x 0,t ] we have that F(x ) = 0. 8

4 Uniqueness and convergence rate To prove uniqueness and estimate the convergence rate, another auxiliary result will be needed. Proposition 9. Take x,y X, t,v 0. If x x 0 t < t, y x 0 R, F(y) = 0, f(v) 0 and B[x 0,R] C, then Proof. Note from (12) y N F (x) [v n f (t)] y x 2 (v t) 2. y N F (x) = F (x) 1 [F(x)+F (x)(y x)]. As F(y) = 0, using also Proposition 6 we obtain F (x) 1 [F(x)+F (x)(y x)] 2 y x 2, and from Proposition 5 F (x) 1 1/ f (t). Combining these equations we have y N F (x) y x 2 2 f (v t)2 (t) (v t) 2. As f (t) < 0 and f(v) 0, using also (7) we have v n f (t) = 1 f (t) [ f(t) f (t)(v t)] 1 f (t) [f(v) f(t) f (t)(v t)] = 2 f (t) (v t)2. Combining the two above inequalities we obtain the desired result. Corollary 10. If y B[x 0,t ] and F(y) = 0, then y x k+1 t t k+1 (t t k ) 2 y x k 2, y x k t t k, k = 0,1,. In particular, x is the unique zero of F in B[x 0,t ]. Proof. Take an arbitrary k. From Proposition 8 we have x k K(t k ). So, x k x 0 t k and we can apply Proposition 9 with x = x k, t = t k and v = t, to obtain y N F (x k ) [t n f (t k )] y x k 2 (t t k ) 2. 9

The first inequality now follows from the above inequality, (16) and (8). We will prove the second inequality by induction. For k = 0 this inequality holds, because y B[x 0,t ] and t 0 = 0. Now, assume that the inequality holds for some k, y x k t t k. Combining the above inequality with the first inequality of the corollary, we have that y x k+1 t t k+1, wich concludes the induction. We already know that x B[x 0,t ] and F(x ) = 0. Since {x k } converges to x and {t k } converges to t, using the second inequality of the corollary we conclude y = x. Therefore, x is the unique zero of F in B[x 0,t ]. Corollary 11. The sequences {x k } and {t k } satisfy In particular, x x k+1 t t k+1 (t t k ) 2 x x k 2, k = 0,1,. (20) Additionally, if 2b < 1 then x x k+1 1 2 x x k, k = 0,1,. (21) x x k+1 1 θ2k 1+θ 2k 2 1 2b x x k 2 2 1 2b x x k 2, k = 0,1,. (22) Proof. According to Proposition 8, x B[x 0,t ] and F(x ) = 0. To prove equation (20) apply Corollary 10 with y = x. Note that, by (9) in Corollary 4, and Proposition 8, for any k (t t k+1 )/(t t k ) 1/2 and x x k /(t t k ) 1. Combining these inequalities with (20) we have (21). Now, assume that b < 1/2 holds. Then, (10) in Corollary 4 and (20) imply (22) and the corollary is proved. Corollary 12. If 2b < 1, t ρ < t and B[x 0,ρ] C then x is the unique zero of F in B[x 0,ρ]. Proof. Assume that there exists y C such that y x 0 < ρ and F(y ) = 0. Using Proposition 6 with x = x 0 and y = y (recall that F (x 0 ) = I ) we obtain that F(x 0 )+y x 0 2 y x 0 2. Triangle inequality and assumption 3 of Theorem 1 yield F(x 0 )+y x 0 y x 0 F(x 0 ) y x 0 b. 10

Combining the above inequalities we obtain 2 y x 0 2 y x 0 b, which is equivalent to f( y x 0 ) 0. As y x 0 ρ < t last inequality implies that y x 0 t. Therefore, from Corollary 10 and assumption F(y ) = 0, we conclude that y = x. Therefore, it follows from Proposition 8, Corollary 10, Corollary 11 and Corollary 12 that all statements in Theorem 1 are valid. 4.1 Appendix: A closed formula for t k Note that f(t) = (/2)(t t )(t t ), and f (t) = (/2)[(t t )+(t t )]. Using the above equations and (8), t k+1 t = (t k t ) (t k t )(t k t ) (t k t )+(t k t ) = (t k t ) 2 (t k t )+(t k t ). By similar manipulations, we have t k+1 t = (t k t ) 2 (t k t )+(t k t ). Combing two latter equality we obtain that ( ) 2 t k+1 t tk t =. t k+1 t t k t Suppose that 2b < 1. In this case, t < t. Hence, using the definition θ := t /t < 1, and induction in k we have t k t t k t = θ 2k. After some algebraic manipulation in above equality we obtain hat t k = t θ 2k t θ 2k 1 = t θ2k 2 1 2b 1 θ 2k, k = 0,1,. References [1] Alvarez, F., Botle, J. and Munier, J., A Unifying ocal Convergence Result for Newton s Method in Riemannian Manifolds INRIA, Rapport de recherche, N. 5381, (2004). [2] Ferreira, O. P. and Svaiter, B. F., Kantorovich s Theorem on Newton s method in Riemannian Manifolds Journal of Complexity, 18, (2002), 304 329. 11

[3] Ferreira, O. P. and Svaiter, B. F., Kantorovich s Majorants Principle for Newton s Method, to appear in Optimization Methods and software (2006). [4] Gragg, W. B. and Tapia, R. A. Optimal Error Bound for Newton- Kantorovich Theorem, SIAM J. Numer. Anal., 11, 1 (1974), 10-13. [5] Kantorovich,. V. On Newton s method for functional equations, Dokl. Akad. Nauk. SSSR, 59 (1948), 1237-1240. [6] Kantorovich,. V., and Akilov, G. P., Functional analysis in normed spaces, Oxford, Pergamon (1964). [7] Ortega, J. M., The Newton-Kantorovich Theorem, The American Mathematical Monthly, 75, 6 (1968), 658 660. [8] Ortega, J. M., and Rheimboldt, W. C., Interactive solution of nonlinear equations in several variables, New York: Academic Press (1970). [9] Ostrowski, A. M., Solution of equations and systems of equations, Academic Press, New York (1996). [10] Polyak, B. T. Newton-Kantorovich method and its global convergence, Journal of Mathematical Science, 133, 4 (2006), 1513-1523. [11] Potra, Florian A., The Kantorovich Theorem and interior point methods Mathematical Programming, 102, 1 (2005), 47 70. [12] Tapia, R. A.,The Kantorovich Theorem for Newton s Method, The American Mathematical Monthly, 78, 4 (1971), 389 392. [13] Wang, X., Convergence of Newton s method and inverse function theorem in Banach space, Math. Comp. 68, 225 (1999), pp.169-186. 12