Chapter 3. Linear and Nonlinear Systems

Similar documents
Chapter 1: Systems of linear equations and matrices. Section 1.1: Introduction to systems of linear equations

Section 9.2: Matrices. Definition: A matrix A consists of a rectangular array of numbers, or elements, arranged in m rows and n columns.

Linear Algebra. Matrices Operations. Consider, for example, a system of equations such as x + 2y z + 4w = 0, 3x 4y + 2z 6w = 0, x 3y 2z + w = 0.

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 1 x 2. x n 8 (4) 3 4 2

Section 9.2: Matrices.. a m1 a m2 a mn

CS 246 Review of Linear Algebra 01/17/19

Basic Concepts in Linear Algebra

Matrices and Linear Algebra

Review of Basic Concepts in Linear Algebra

Review of Linear Algebra

Math 489AB Exercises for Chapter 2 Fall Section 2.3

Linear Algebra and Matrix Inversion

Chapter 5. Linear Algebra. A linear (algebraic) equation in. unknowns, x 1, x 2,..., x n, is. an equation of the form

Introduction to Matrices

Knowledge Discovery and Data Mining 1 (VO) ( )

Foundations of Matrix Analysis

Econ Slides from Lecture 7

A matrix is a rectangular array of. objects arranged in rows and columns. The objects are called the entries. is called the size of the matrix, and

A matrix is a rectangular array of. objects arranged in rows and columns. The objects are called the entries. is called the size of the matrix, and

1 Matrices and Systems of Linear Equations. a 1n a 2n

Linear Algebra March 16, 2019

Jim Lambers MAT 610 Summer Session Lecture 1 Notes

Phys 201. Matrices and Determinants

MATH2210 Notebook 2 Spring 2018

Introduction. Vectors and Matrices. Vectors [1] Vectors [2]

Math 4A Notes. Written by Victoria Kala Last updated June 11, 2017

Chapter 4 - MATRIX ALGEBRA. ... a 2j... a 2n. a i1 a i2... a ij... a in

Math Linear Algebra Final Exam Review Sheet

Linear Algebra. The analysis of many models in the social sciences reduces to the study of systems of equations.

1 Multiply Eq. E i by λ 0: (λe i ) (E i ) 2 Multiply Eq. E j by λ and add to Eq. E i : (E i + λe j ) (E i )

Matrix & Linear Algebra

Solving Linear Systems of Equations

CPE 310: Numerical Analysis for Engineers

Systems of Linear Equations. By: Tri Atmojo Kusmayadi and Mardiyana Mathematics Education Sebelas Maret University

and let s calculate the image of some vectors under the transformation T.

Elementary Linear Algebra

Numerical Linear Algebra Homework Assignment - Week 2

Study Notes on Matrices & Determinants for GATE 2017

Math 313 Chapter 1 Review

Chapter 7. Tridiagonal linear systems. Solving tridiagonal systems of equations. and subdiagonal. E.g. a 21 a 22 a A =

4. Determinants.

JUST THE MATHS UNIT NUMBER 9.8. MATRICES 8 (Characteristic properties) & (Similarity transformations) A.J.Hobson

A Review of Matrix Analysis

Fundamentals of Engineering Analysis (650163)

Matrices and systems of linear equations

Linear Algebra: Matrix Eigenvalue Problems

Eigen Values and Eigen Vectors - GATE Study Material in PDF

Elementary maths for GMT

CHAPTER 3. Matrix Eigenvalue Problems

Review Let A, B, and C be matrices of the same size, and let r and s be scalars. Then

Prepared by: M. S. KumarSwamy, TGT(Maths) Page

Matrix Algebra. Matrix Algebra. Chapter 8 - S&B

AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES

Topics. Vectors (column matrices): Vector addition and scalar multiplication The matrix of a linear function y Ax The elements of a matrix A : A ij

Systems of Linear Equations and Matrices

MATRICES. a m,1 a m,n A =

Eigenvalues and Eigenvectors

Systems of Linear Equations and Matrices

POLI270 - Linear Algebra

Chapter 1 Matrices and Systems of Equations

Introduction to Matrices

MAT 610: Numerical Linear Algebra. James V. Lambers

3 Matrix Algebra. 3.1 Operations on matrices

Chapter 3. Determinants and Eigenvalues

MATH 315 Linear Algebra Homework #1 Assigned: August 20, 2018

Computational Methods CMSC/AMSC/MAPL 460. Eigenvalues and Eigenvectors. Ramani Duraiswami, Dept. of Computer Science

Linear Algebra and Matrices

MATRICES AND ITS APPLICATIONS

MATRICES. knowledge on matrices Knowledge on matrix operations. Matrix as a tool of solving linear equations with two or three unknowns.

0.1 Rational Canonical Forms

Linear Algebra Highlights

MATH 240 Spring, Chapter 1: Linear Equations and Matrices

Notes on Linear Algebra and Matrix Theory

2. Linear algebra. matrices and vectors. linear equations. range and nullspace of matrices. function of vectors, gradient and Hessian

Math Camp Lecture 4: Linear Algebra. Xiao Yu Wang. Aug 2010 MIT. Xiao Yu Wang (MIT) Math Camp /10 1 / 88

LECTURE NOTES ELEMENTARY NUMERICAL METHODS. Eusebius Doedel

MTH 464: Computational Linear Algebra

Numerical Analysis Lecture Notes

ELE/MCE 503 Linear Algebra Facts Fall 2018

Determinants by Cofactor Expansion (III)

Linear Systems and Matrices

MAT Linear Algebra Collection of sample exams

Introduction to Matrix Algebra

Matrices and Vectors. Definition of Matrix. An MxN matrix A is a two-dimensional array of numbers A =

GAUSSIAN ELIMINATION AND LU DECOMPOSITION (SUPPLEMENT FOR MA511)

Online Exercises for Linear Algebra XM511

Linear Algebra review Powers of a diagonalizable matrix Spectral decomposition

Matrix Arithmetic. j=1

Chapter 2: Matrices and Linear Systems

Finite Mathematics Chapter 2. where a, b, c, d, h, and k are real numbers and neither a and b nor c and d are both zero.

OHSx XM511 Linear Algebra: Solutions to Online True/False Exercises

Eigenvalues and Eigenvectors

Matrix operations Linear Algebra with Computer Science Application

Chapter 3 Transformations

Math Bootcamp An p-dimensional vector is p numbers put together. Written as. x 1 x =. x p

Linear Algebra: Lecture notes from Kolman and Hill 9th edition.

Chapter 4 No. 4.0 Answer True or False to the following. Give reasons for your answers.

Numerical Linear Algebra

Chapter 2 Notes, Linear Algebra 5e Lay

Linear Algebra Primer

Transcription:

59 An expert is someone who knows some of the worst mistakes that can be made in his subject, and how to avoid them Werner Heisenberg (1901-1976) Chapter 3 Linear and Nonlinear Systems In this chapter we investigate direct methods and iterative methods for solving linear systems of equations of the form E 1 : a 11 x 1 + a 12 x 2 + + a 1j x j + + a 1n x n = b 1 E 2 : a 21 x 1 + a 22 x 2 + + a 2j x j + + a 2n x n = b 2 E i : a i1 x 1 + a i2 x 2 + + a ij x j + + x in x n = b i (31) E m : a m1 x 1 + a m2 x 2 + + a mj x j + + a mn x n = b m where the coefficient in the ith row and jth column is denoted by a ij for i =1,,m and j =1,n These coefficients of the system are assumed to be known and the quantities on the right-hand side b i for i =1,m are also assumed to be known The equations are labeled E 1,E 2,,E m and the problem is to solve for, if possible, the unknowns x 1,x 2,,x n This system represents m linear equations from which n unknowns must be determined The system is referred to as a m n (read m by n) linear system of equations since it has m equations with n unknowns, with m not necessarily equal to n The above system of equations can be written in alternative forms One compact form is to write the system of equations as E i : n! a ij x j = b i, i =1, 2,,m (32) j=1 The system of equations (31) can also be expressed in the matrix form Ax = b (33)

60 where a 11 a 12 a 1n a A =[a ij ]= 21 a 22 a 2n, x = a m1 a m2 a mn x 1 x 2 x n 1 x n, b = b 2 b 1 b m, (34) Here it is to be understood that i, j are integers in the range 1 i m and 1 j n with a ij denoting the matrix element in the ith row and jth column The matrix A is called a m n (read m by n) matrix The special n 1 matrix x is called a n-dimensional column vector with x i, 1 i n, the element in the ith row Similarly, the special m 1 matrix b is called a m-dimensional column vector with b k denoting the element in the kth row, for 1 k m We will examine direct methods of solution and iterative methods for solving such systems We also investigate methods for solving nonlinear systems of equations of the form E 1 : E 2 : E m : f 1 (x 1,x 2,,x n )=0 f 2 (x 1,x 2,,x n )=0 f m (x 1,x 2,,x n )=0 where f i, for i =1,,m, represent known continuous functions of the variables x 1,x 2,,x n The problem is to Þnd, if possible, some numerical procedure for determining the unknown quantities x 1,x 2,,x n which satisfy all of the nonlinear equations in the system of equations (35) Note that the system of equations (31) is a special case of the more general system of equations (35) The system of nonlinear equations (35) can be written in the vector form f(x) =0 where x = x 1 x 2 x n and f(x) = f 1 (x 1,x 2,,x n ) f 2 (x 1,x 2,,x n f m (x 1,x 2,,x n ) (35) (36) with 0 being an m-dimensional column vector of zeros We develop several numerical methods which are applicable for solving systems of the above types We desire to develop numerical methods for solving the above systems of equations in cases where the numbers m and n are large In some applied problems it is not unusual for the number of unknowns x 1,x 2,,x n

61 to be very large, say n>10 5 or n>10 6 For illustrative purposes and examples of the numerical techniques we will use much smaller values for n Preliminaries In dealing with the matrix form associated with the equations (32) there are occasions when it is convenient to make use of special matrices and special functions associated with these matrices Some of these special matrices and functions are as follows ( 1, if i = j (i) The n n identity matrix I =[δ ij ],whereδ ij = The identity 0, if i "= j matrix has 1 s down the main diagonal and 0 s everywhere else A 3 3 identity matrix has the form I = 1 0 0 0 1 0 0 0 1 (ii) If A is a n n square matrix and there exists a matrix B with the property that BA = AB = I, thenb is called the inverse of A and is written B = A 1 That is, the inverse matrix A 1 has the property that AA 1 = A 1 A = I If A 1 exists, then A is said to be nonsingular If the matrix A does not have an inverse, then the matrix A is said to be singular In the special case m = n and A is a square matrix having an inverse, then the solution to the matrix system (33) can be symbolically obtained by multiplying both sides of the matrix equation (33) by the inverse of A to obtain A 1 Ax = A 1 b which simpliþes to x = I x = A 1 b This technique for solving a n n linear system of equations is only recommendedinthecasewheren is small This is because the number of multiply and divided operations needed to calculate A 1 increases like n 3 and so the inverse matrix calculation becomes very lengthy and burdensome when n is large (iii) Associated with a m n matrix A =[a ij ] is the n m transpose matrix denoted by A T =[a ji ] and formed by interchanging the rows and columns of the m n matrix A Note that column vectors such as the x given in equation (34) can be expressed as x =[x 1,x 2,,x n ] T The transpose is used to determine if a matrix is symmetric A square matrix is said to be symmetric if A T = A The matrix product A T A always produces a square matrix The transpose

62 operation satisþes the following properties (i) (AB) T =B T A T (ii) (A 1 ) T =(A T ) 1 (iii) (A T ) T =A The transpose of a product is the product of the transpose matrices in reverse order When A 1 exists, then the transpose of an inverse is the inverse of the transpose The transpose of the transpose matrix returns the original matrix (iv) A n n lower triangular matrix L =[" ij ] has the form " 11 " 21 " 22 L = " 31 " 32 " 33 " n1 " n2 " n3 " nn " ij =0for j>i (37) with all zeros above the main diagonal In the special case the diagonal elements of a lower triangular matrix are all 1 s, then L is called a unit lower triangular matrix (v) A n n upper triangular matrix U =[u ij ] has the form U = u 11 u 12 u 13 u 1n u 22 u 23 u 2n u 33 u 3n u nn u ij =0for i>j (38) with all zeros below the main diagonal In the special case the diagonal elements of an upper triangular matrix are all 1 s, then U is called a unit upper triangular matrix (vi) A n n square matrix A =[a ij ] with the property that ( 0, for i + s j, 1 <s<n a ij = 0, for j + t i, 1 <t<n (39) is said to be a banded matrix with band width w = s + t 1 For example, the 5 5 tridiagonal matrix a 11 a 12 0 0 0 a 21 a 22 a 23 0 0 0 a 32 a 33 a 34 0 0 0 a 43 a 44 a 45 0 0 0 a 54 a 55 (310)

63 is a banded matrix with band width 3 Tridiagonal matrices occur quite frequently The elements in a tridiagonal matrix are along the diagonal, the subdiagonal and superdiagonal Tridiagonal system of equations are easily solved Tridiagonal matrices are sometimes denoted using the notation T = tridiagonal(#a, # b, #c) where #a =(a 1,a 2,a 3,a 4,,a n 1,a n ) is vector along the subdiagonal # b =(b1,b 2,b 3,b 4,,b n 1,b n ) is vector along the diagonal #c =(c 1,c 2,c 3,c 4,,c n 1,c n ) is vector along the superdiagonal This is a shorthand notation for the tridiagonal matrix b 1 c 1 T =[t ij ]= a 2 b 2 c 2 a 3 b 3 c 3 a n 1 b n 1 c n 1 a n b n with ( tij =0, i > j +2 t ij =0, j > i+2 Another matrix which is nice to work with is the diagonal matrix with band widthoneanexampleofa5 5 diagonal matrix is a 11 0 0 0 0 0 a 22 0 0 0 D = 0 0 a 33 0 0 0 0 0 a 44 0 0 0 0 0 a 55 (vii) A n n matrix A with diagonal elements a ii for i =1, 2,,n and satisfying a ii (311) n! a ij for i =1, 2,n (312) j=1 j!=i is said to be diagonally dominant (viii) A vector norm % % is a mapping from R n to R which is a measure of distance associated with vectors x =[x 1,x 2,,x n ] T Vector norms have the following properties: (a) % x % 0 for all x R n (b) % αx %= α % x % where α is a scalar R

64 (c) % x + y % %x % + % y % for all x, y R n (d) % x %= 0if and only if x =0 The Euclidean norm or " 2 norm is used most often and is deþned ) n * 1/2! % x % 2 = x 2 i (313) i=1 This norm represents the distance of the point (x 1,x 2,,x n ) from the origin Other vector norms can be deþned so long as they obey the above properties Two other vector norms used quite frequently are the " norm and " p norm deþned respectively as % x % =max x i 1 i n ) n * 1/p! (314) % x % p = x i p i=1 (ix) A matrix norm % A % associated with a n n matrix A =[a ij ] is any real-valued function % % which satisþes the properties: (a) % A % 0 (b) % αa % α % A % where α is a scalar R (c) % A + B % %A % + % B %, wherea, B are n n matrices (d) % AB % %A %% B % (e) % A %= 0if and only if a ij =0for all i, j values The quantity % A B % is used to measure the nearness of two n n matrices Let % % denote a vector norm, then the natural matrix norm of a n n matrix A is deþned % A %= max $x$=1 % Ax % (315) For example, the " 2 norm of the n n matrix A would be represented and the " norm of A would be represented % A % 2 = max $x$ 2 =1 % Ax % 2 (316) % A % = max % Ax % (317) $x$ =1 It can be shown, see the Bronson reference, that if A =(a ij ) is a n n matrix, then % A % = max 1 i n j=1 n! a ij

65 (x) The characteristic polynomial associated with the real n n matrix A is deþnedintermsofadeterminantandgivenby p(λ) =det [A λi] (318) where λ is a scalar and I is the n n identity matrix The values of λ which satisfy the characteristic equation p(λ) =0are called eigenvalues associated with the matrix A Nonzero vectors x with the property that Ax = λx are called eigenvectors of A corresponding to the eigenvalue λ (xi) The spectral radius ρ(a) of the n n matrix A is deþned ρ(a) =max λ (319) where λ is an eigenvalue of A If an eigenvalue is complex with the value λ = λ 1 + iλ 2,then λ = + λ 2 1 + λ2 2 It can be shown that the spectral radius has the properties (i) ρ(a) % A % for any matrix norm % % (ii), ρ(a T A)=% A % 2 (xii) The condition number K(A) associated with a nonsingular matrix A is deþned K(A) =% A %% A 1 % (320) where % % is a natural norm A matrix A is said to be well behaved or wellconditioned if its condition number is close to unity It is called ill behaved or ill-conditioned if the condition number is far from unity Great care should be taken when working with ill-conditioned matrices Eigenvalues and Eigenvectors A n n square matrix A can be used as an operator to transform a nonzero n 1 column vector #x One can imagine an input-output system such as the one illustrated in the Þgure 3-1 Figure 3-1 An example of a linear operator

66 Those nonzero vectors #x having the special property that the output is proportional to the input must satisfy A#x = λ#x (321) where λ is a scalar proportionality constant The special nonzero vectors #x satisfying equation (321) are called eigenvectors and the corresponding scalars λ are called eigenvalues The equation (321) is equivalent to the homogeneous system A#x λ#x =(A λi)#x = #0 (322) where I is the n n identity matrix The equation (322) has a nonzero solution if and only if det (A λi) = A λi =0 (323) The equation (323) when expanded is a polynomial equation in λ of degree n having the form C(λ) = A λi =( λ) n + c n 1 ( λ) n 1 + + c 1 ( λ)+c 0 =0 (324) which is called the characteristic equation associated with the square matrix A The solutions of equation (324) give the eigenvalues associated with the n n square matrix A For any given eigenvalue λ, thematrixa λi is a singular matrix such that the homogeneous equations (322) produce a nonzero eigenvector #x Note that if #x is an eigenvector, then any nonzero constant times #x is also an eigenvector Example 3-1 (Eigenvalues and Eigenvectors) Find the eigenvalues and eigenvectors associated with the matrix A = - 1 0 1 1 Solution: We construct the homogeneous system (A λi)#x = #0 or The characteristic equation is found to be - 1 λ 0 1 1 λ - x1 = x 2 C(λ) = / 1 λ 0 1 1 λ / =(1 λ)2 =0-0 (325) 0

67 The eigenvalues are λ 1 =1and λ 2 =1 For λ =1, the equation (325) reduces to - 0 0 1 0 - x1 = x 2-0 0 - Therefore, 0 #x = k is an eigenvector for any nonzero constant k 1 Elementary Row Operations To solve the system of equations (31) one is allowed to perform any of the following elementary row operations on the system of equations (i)anequationinrowi can be multiplied by a nonzero constant α That is, equation E i is replaced by the equation αe i This is denoted by the notation (αe i E i ) and is read, The constant α times equation E i replaces the equation E i (ii) Equation E j can be replaced by a multiple of equation E i added to equation E j This can be expressed using the above notation as (αe i + E j E j ) where i "= j (iii) Any two equations can be interchanged This is denoted by the notation (E i E j ) where i "= j Example 3-2 (Elementary row matrices) Row operations performed upon the identity matrix I produces elementary row matrices E Consider the 3 3 identity matrix I = examples of elementary row matrices are: (i) Interchanging rows 2 and 3 gives the elementary row matrix E 1 = (ii) Multiplying row 3 by the scalar 5 gives the elementary row matrix E 2 = 1 0 0 0 1 0 0 0 1 1 0 0 0 0 1 0 1 0 1 0 0 0 1 0 0 0 5 1 0 0 6 1 0 0 0 1 (iii) Multiplying row 1 by 6 and adding the result to row 2 gives the elementary row matrix E 3 = Note that if A = E 1 A = d e f a b c g h i a b c d e f g h i, E 2 A = is a 3 3 matrix, then a b c d e f, E 3 A = 5g 5h 5i, thensome d +6a e+6b f +6c a b c g h i

68 Observe that the elementary row operations recorded in the matrices E 1,E 2,E 3 have been applied to the matrix A A shorthand notation for recording the row operations performed upon a linear system of equations is to write down the coefficient matrix A of the linear system Ax = b and then append to the right-hand side of A the column vector b The resulting array is called an augmented matrix and is written a 11 a 12 a 1n a [A b] = 21 a 22 a 2n a m1 a m2 a mn Our objective is to perform row operations upon the resulting array and try to reduce the array A to an upper triangular form The row operations are then recorded in the augmented column vector For example, consider the 2 3 augmented array ) * a 11 a 12 b 1, a 21 a 22 b 2 with a 11 nonzero, where we multiply the Þrst row by a 21 /a 11 and add the result to row 2 to obtain the triangular system ) a 11 a 12 b 1 a 21 a 22 b 2 * E 1 : a 21 a 11 E 1 + E 2 E 2 : b 1 b 2 b m ) * a 11 a 12 b 1 0 c 22 d 2 where c 22 = a 22 a 21 a 12 /a 11 and d 2 = b 2 a 21 b 1 /a 11 The nonzero element a 11 is called a pivot element in the diagonalization process The resulting upper triangular systemcanthenbesolvedbybacksubstitutionmethods Gaussian Elimination The Gaussian elimination method reduces a matrix to upper triangular (or lower triangular) and then uses back substitution to solve for the unknowns The method is illustrated using an example Example 3-3 (Gaussian elimination method) Solve the system of equations E 1 : 3x 1 2x 2 + x 3 =8 E 2 : 4x 1 + x 2 3x 3 =3 E 3 : x 1 +5x 2 4x 3 =5 (326)