v = -!g(x 0 ) Ûg Ûx 1 Ûx 2 Ú If we work out the details in the partial derivatives, we get a pleasing result. n Ûx k, i x i - 2 b k

Similar documents
Introduction to Optimization Techniques. How to Solve Equations

Differentiable Convex Functions

Introduction to Machine Learning DIS10

6.3 Testing Series With Positive Terms

Infinite Sequences and Series

THE SOLUTION OF NONLINEAR EQUATIONS f( x ) = 0.

Find a formula for the exponential function whose graph is given , 1 2,16 1, 6

The Phi Power Series

Notes on iteration and Newton s method. Iteration

Notes for Lecture 11

Eigenvalues and Eigenvectors

1 Generating functions for balls in boxes

NUMERICAL METHODS FOR SOLVING EQUATIONS

Mon Apr Second derivative test, and maybe another conic diagonalization example. Announcements: Warm-up Exercise:

Seunghee Ye Ma 8: Week 5 Oct 28

Roberto s Notes on Series Chapter 2: Convergence tests Section 7. Alternating series

Math 113, Calculus II Winter 2007 Final Exam Solutions

Notes for Lecture 5. 1 Grover Search. 1.1 The Setting. 1.2 Motivation. Lecture 5 (September 26, 2018)

Definitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients.

Recurrence Relations

Machine Learning for Data Science (CS 4786)

Once we have a sequence of numbers, the next thing to do is to sum them up. Given a sequence (a n ) n=1

10/2/ , 5.9, Jacob Hays Amit Pillay James DeFelice

Math 210A Homework 1

MA131 - Analysis 1. Workbook 3 Sequences II

The Method of Least Squares. To understand least squares fitting of data.

Polynomial Functions and Their Graphs

PAPER : IIT-JAM 2010

The Binomial Theorem

September 2012 C1 Note. C1 Notes (Edexcel) Copyright - For AS, A2 notes and IGCSE / GCSE worksheets 1

CS321. Numerical Analysis and Computing

Basic Iterative Methods. Basic Iterative Methods

Math 257: Finite difference methods

Infinite Series. Definition. An infinite series is an expression of the form. Where the numbers u k are called the terms of the series.

Lecture 3: Catalan Numbers

Ma 530 Infinite Series I

Practice Problems: Taylor and Maclaurin Series

REVISION SHEET FP1 (MEI) ALGEBRA. Identities In mathematics, an identity is a statement which is true for all values of the variables it contains.

CS537. Numerical Analysis and Computing

Optimization Methods MIT 2.098/6.255/ Final exam

Ray-triangle intersection

Lecture 6: Integration and the Mean Value Theorem. slope =

LINEAR ALGEBRA. Paul Dawkins

Solutions to Homework 7

Series Review. a i converges if lim. i=1. a i. lim S n = lim i=1. 2 k(k + 2) converges. k=1. k=1

5.6 Absolute Convergence and The Ratio and Root Tests

Math 120 Answers for Homework 23

5.1. The Rayleigh s quotient. Definition 49. Let A = A be a self-adjoint matrix. quotient is the function. R(x) = x,ax, for x = 0.

Apply change-of-basis formula to rewrite x as a linear combination of eigenvectors v j.

Sequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

Solutions to Final Exam Review Problems

NUMERICAL METHODS COURSEWORK INFORMAL NOTES ON NUMERICAL INTEGRATION COURSEWORK

Tridiagonal reduction redux

Practice Test Problems for Test IV, with Solutions

Section 11.6 Absolute and Conditional Convergence, Root and Ratio Tests

MATH 10550, EXAM 3 SOLUTIONS

Root Finding COS 323

In algebra one spends much time finding common denominators and thus simplifying rational expressions. For example:

Math F15 Rahman

Complex Analysis Spring 2001 Homework I Solution

Machine Learning for Data Science (CS 4786)

Building Sequences and Series with a Spreadsheet (Create)

11. FINITE FIELDS. Example 1: The following tables define addition and multiplication for a field of order 4.

Forces: Calculating Them, and Using Them Shobhana Narasimhan JNCASR, Bangalore, India

5.3 Preconditioning. - M is easy to deal with in parallel (reduced approximate direct solver)

REVISION SHEET FP1 (MEI) ALGEBRA. Identities In mathematics, an identity is a statement which is true for all values of the variables it contains.

Nonlinear regression

Root Finding COS 323

We are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n

Note: we can take the Real and Imaginary part of the Schrödinger equation and write it in a way similar to the electromagnetic field. p = n!

( 1) n (4x + 1) n. n=0

Sequences, Mathematical Induction, and Recursion. CSE 2353 Discrete Computational Structures Spring 2018

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

Section A assesses the Units Numerical Analysis 1 and 2 Section B assesses the Unit Mathematics for Applied Mathematics

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices

Topic 9 - Taylor and MacLaurin Series

1 Duality revisited. AM 221: Advanced Optimization Spring 2016

CHAPTER 10 INFINITE SEQUENCES AND SERIES

Mathematics 116 HWK 21 Solutions 8.2 p580

A widely used display of protein shapes is based on the coordinates of the alpha carbons - - C α

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

Chapter 2 The Solution of Numerical Algebraic and Transcendental Equations

Iterative Techniques for Solving Ax b -(3.8). Assume that the system has a unique solution. Let x be the solution. Then x A 1 b.

15.081J/6.251J Introduction to Mathematical Programming. Lecture 21: Primal Barrier Interior Point Algorithm

f(x) dx as we do. 2x dx x also diverges. Solution: We compute 2x dx lim

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

Solutions to Final Exam

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

(a) (b) All real numbers. (c) All real numbers. (d) None. to show the. (a) 3. (b) [ 7, 1) (c) ( 7, 1) (d) At x = 7. (a) (b)

P1 Chapter 8 :: Binomial Expansion

Zeros of Polynomials

Part I: Covers Sequence through Series Comparison Tests

Math 475, Problem Set #12: Answers

Physics 116A Solutions to Homework Set #1 Winter Boas, problem Use equation 1.8 to find a fraction describing

Complex Numbers Solutions

Assignment 1 : Real Numbers, Sequences. for n 1. Show that (x n ) converges. Further, by observing that x n+2 + x n+1

AP Calculus BC Review Applications of Derivatives (Chapter 4) and f,

Inverse Matrix. A meaning that matrix B is an inverse of matrix A.

Math 220B Final Exam Solutions March 18, 2002

Transcription:

The Method of Steepest Descet This is the quadratic fuctio from to that is costructed to have a miimum at the x that solves the system A x = b: g(x) = <x,ax> - 2<x,b> I the method of steepest descet, we pick a startig poit x 0 that we thik is close to the solutio. We start our search for the solutio by headig i the directio of steepest descet. This directio is the egative of the gradiet of g(x) at the startig poit x 0. v = -!g Because g(x) has a particularly simple form, we ca compute its gradiet explicitly ad otice that somethig iterestig happes whe we do:!g = Ûx 1 Ûx 2 Ú (x Ûx 0 ) If we work out the details i the partial derivatives, we get a pleasig result. This leads to (x) = 2 a Ûx k, i x i - 2 b k k i = 1!g = Ûx 1 Ûx 2 Ú (x Ûx 0 ) = 2 (A x 0 - b) = -2 r 0 where r 0 is the residual associated with x 0. 1

Now that we have a search directio, we ca costruct a ray startig at x 0 ad headig i the directio of v = -2 r 0. We wat the poit alog that ray that miimizes the quadratic form g(x). h(t) = g(x + t v) This fuctio has its miimum at the poit where h (t) = 0: h (t) = -2 <v,b-ax> + 2 t <v,av> = 0 t = <v,b-ax> <v,av> = <v,r> <v,av> Usig that value of t to compute x + t v gives us our ext poit. We the compute a residual at that poit to establish our search directio ad repeat the process. The big problem with the method of steepest descet is that it typically takes too may iteratios of the method to coverge to the solutio of the system. I fact, it takes so may iteratios that we ed up doig more work tha if we had just solved the system by Gauss elimiatio i the first place. The method does have two big advatages. The first is that the method ca be modified to coverge more quickly, as we shall see below. The secod is that a versio of this method works for oliear systems. There is o equivalet to Gauss elimiatio for oliear systems of equatios, so steepest descet is a primary method for dealig with oliear systems. The Cojugate Directio ad Cojugate Gradiet Methods The problem with the method of steepest descet is that the search directios it uses v = -2 r lead to bad covergece behavior. We ca fix this problem by pickig better search directios. The cojugate directio method solves a by system by usig a set of search vectors v (1), v (2),, v () that have a special property. The vectors v (k) are selected be A orthogoal. Vectors v (j) ad v (k) for j k are A orthogoal if <v (j),a v (k) > = 0 The problem with the cojugate directio method is that we typically wo't have a set of A orthogoal vectors just lyig aroud. We eed some scheme to geerate them. 2

The cojugate gradiet method uses a clever scheme to geerate the A orthogoal set of vectors. The process begis by pickig a startig poit for the search ad usig the steepest descet directio v (1) = r (0) as its first search directio. After coductig the first lie search we lad at the secod poit x (1) ad compute a residual r (1) there. For subsequet iteratios, we seek a ew search directio that satisfies the relatioship v (k) = r (k-1) + s k-1 v (k-1) The trick is to select s k-1 so that v (k-1) ad v (k) are A orthogoal: <v (k-1),a(r (k-1) + s k-1 v (k-1) )> = 0 <v (k-1),ar (k-1) > + s k-1 <v (k-1),a v (k-1) > = 0 s k-1 = - <v(k-1),a r (k-1) > <v (k-1),a v (k-1) > After iteratios of this scheme, we will arrive at the solutio of the origial system. Summary of the Method We have ow worked out all the details, but it might be useful to summarize ad codese our fidigs. g(x) = <x,ax> - 2<x,b> x (0) = our startig guess r (k) = b - A x (k) v (1) = r (0) 3

t k = <v(k),r (k-1) > <v (k),av (k) > x (k) = x (k-1) + t k v (k) s k = - <v(k),a r (k) > <v (k),a v (k) > v (k) = r (k-1) + s k-1 v (k-1) Acceleratig Covergece The cojugate gradiet method is effective, but we would also like to make it fast. Oe way to make it faster is to arrage for more rapid covergece of the sequece geerated by the method. It turs out that the mai thig that affects the speed of covergece of the sequece is the matrix A ad its eigevalue, eigevector structure. I a effort to make A behave more icely, a commo techique is to precoditio the matrix A by doig A $ = C -1 A (C -1 ) t for some appropriately chose C. We the use the cojugate gradiet method to compute a approximate solutio for the system where A $ x $ = b $ $ -1 b = C b x $ = C t x We use the latter equatio to solve for x at the ed of the process: x = (C t ) -1 x $ What are some ways to select precoditioig matrices C? Oe method is to set 4

1 C i, j = A i, i 0 i = j i j A secod method is to do a Cholesky decompositio o A: A = LL T C = L I practice, the actual Cholesky decompositio is ot used, because it is too expesive to compute. For A with a great may 0 etries, we ca compute a "approximate" Cholesky decompositio by doig 1. Force L to have the same patter of o-zero terms as A. 2. Use the Cholesky formulas to compute oly those etries of L that we thik should be ozero. Sice the case of positive defiite A with may 0 etries arises frequetly i applicatios, this is a useful approach. 5