lecture 16: Introduction to Least Squares Approximation

Similar documents
We will begin by supplying the proof to (a).

1.3 Continuous Functions and Riemann Sums

Week 13 Notes: 1) Riemann Sum. Aim: Compute Area Under a Graph. Suppose we want to find out the area of a graph, like the one on the right:

0 otherwise. sin( nx)sin( kx) 0 otherwise. cos( nx) sin( kx) dx 0 for all integers n, k.

Lecture 19: Continuous Least Squares Approximation

Review of the Riemann Integral

FOURIER SERIES PART I: DEFINITIONS AND EXAMPLES. To a 2π-periodic function f(x) we will associate a trigonometric series. a n cos(nx) + b n sin(nx),

1. (25 points) Use the limit definition of the definite integral and the sum formulas to compute. [1 x + x2

( ) k ( ) 1 T n 1 x = xk. Geometric series obtained directly from the definition. = 1 1 x. See also Scalars 9.1 ADV-1: lim n.

EVALUATING DEFINITE INTEGRALS

MATH 104 FINAL SOLUTIONS. 1. (2 points each) Mark each of the following as True or False. No justification is required. y n = x 1 + x x n n

Sequence and Series of Functions

Approximations of Definite Integrals

Chapter 5. The Riemann Integral. 5.1 The Riemann integral Partitions and lower and upper integrals. Note: 1.5 lectures

Limit of a function:

INFINITE SERIES. ,... having infinite number of terms is called infinite sequence and its indicated sum, i.e., a 1

Chapter 7 Infinite Series

MAS221 Analysis, Semester 2 Exercises

Chapter System of Equations

Taylor Polynomials. The Tangent Line. (a, f (a)) and has the same slope as the curve y = f (x) at that point. It is the best

General properties of definite integrals

Vectors. Vectors in Plane ( 2

The Definite Integral

The Basic Properties of the Integral

Math 104: Final exam solutions

lecture 24: Gaussian quadrature rules: fundamentals

Quadrature Methods for Numerical Integration

MATRIX ALGEBRA, Systems Linear Equations

POWER SERIES R. E. SHOWALTER

THE NATIONAL UNIVERSITY OF IRELAND, CORK COLÁISTE NA hollscoile, CORCAIGH UNIVERSITY COLLEGE, CORK SUMMER EXAMINATION 2005 FIRST ENGINEERING

Definite Integral. The Left and Right Sums

INTEGRATION TECHNIQUES (TRIG, LOG, EXP FUNCTIONS)

B. Examples 1. Finite Sums finite sums are an example of Riemann Sums in which each subinterval has the same length and the same x i

The Weierstrass Approximation Theorem

Approximate Integration

Convergence rates of approximate sums of Riemann integrals

MA123, Chapter 9: Computing some integrals (pp )

Test Info. Test may change slightly.

UNIVERSITY OF BRISTOL. Examination for the Degrees of B.Sc. and M.Sci. (Level C/4) ANALYSIS 1B, SOLUTIONS MATH (Paper Code MATH-10006)

Numbers (Part I) -- Solutions

The Reimann Integral is a formal limit definition of a definite integral

Infinite Series Sequences: terms nth term Listing Terms of a Sequence 2 n recursively defined n+1 Pattern Recognition for Sequences Ex:

Calculus II Homework: The Integral Test and Estimation of Sums Page 1

Remarks: (a) The Dirac delta is the function zero on the domain R {0}.

Linear Programming. Preliminaries

1 Tangent Line Problem

10.5 Test Info. Test may change slightly.

Course 121, , Test III (JF Hilary Term)

n 2 + 3n + 1 4n = n2 + 3n + 1 n n 2 = n + 1

PROGRESSIONS AND SERIES

3.7 The Lebesgue integral

18.01 Calculus Jason Starr Fall 2005

Closed Newton-Cotes Integration

MTH 146 Class 16 Notes

Numerical Solution of Fuzzy Fredholm Integral Equations of the Second Kind using Bernstein Polynomials

Lecture 2: Matrix Algebra

Convergence rates of approximate sums of Riemann integrals

Crushed Notes on MATH132: Calculus

F x = 2x λy 2 z 3 = 0 (1) F y = 2y λ2xyz 3 = 0 (2) F z = 2z λ3xy 2 z 2 = 0 (3) F λ = (xy 2 z 3 2) = 0. (4) 2z 3xy 2 z 2. 2x y 2 z 3 = 2y 2xyz 3 = ) 2

Statistics for Financial Engineering Session 1: Linear Algebra Review March 18 th, 2006

INTEGRATION IN THEORY

The Definite Riemann Integral

( ) dx ; f ( x ) is height and Δx is

f(bx) dx = f dx = dx l dx f(0) log b x a + l log b a 2ɛ log b a.

Riemann Integration. Chapter 1

f(t)dt 2δ f(x) f(t)dt 0 and b f(t)dt = 0 gives F (b) = 0. Since F is increasing, this means that

y udv uv y v du 7.1 INTEGRATION BY PARTS

Schrödinger Equation Via Laplace-Beltrami Operator

Review of Sections

Background 1. Cramer-Rao inequality

Principles of Mathematical Analysis

Math 3B Midterm Review

Math 140B - Notes. Neil Donaldson. September 2, 2009

BC Calculus Path to a Five Problems

Probability and Stochastic Processes: A Friendly Introduction for Electrical and Computer Engineers Roy D. Yates and David J.

A GENERALIZATION OF GAUSS THEOREM ON QUADRATIC FORMS

Important Facts You Need To Know/Review:

( a n ) converges or diverges.

Topic 4 Fourier Series. Today

Frequency-domain Characteristics of Discrete-time LTI Systems

Some New Iterative Methods Based on Composite Trapezoidal Rule for Solving Nonlinear Equations

M3P14 EXAMPLE SHEET 1 SOLUTIONS

Notes 17 Sturm-Liouville Theory

In an algebraic expression of the form (1), like terms are terms with the same power of the variables (in this case

A GENERAL METHOD FOR SOLVING ORDINARY DIFFERENTIAL EQUATIONS: THE FROBENIUS (OR SERIES) METHOD

MATH 104: INTRODUCTORY ANALYSIS SPRING 2008/09 PROBLEM SET 10 SOLUTIONS. f m. and. f m = 0. and x i = a + i. a + i. a + n 2. n(n + 1) = a(b a) +

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Numerical Solutions of Fredholm Integral Equations Using Bernstein Polynomials

DETERMINANT. = 0. The expression a 1. is called a determinant of the second order, and is denoted by : y + c 1

10.5 Power Series. In this section, we are going to start talking about power series. A power series is a series of the form

[ 20 ] 1. Inequality exists only between two real numbers (not complex numbers). 2. If a be any real number then one and only one of there hold.

SUTCLIFFE S NOTES: CALCULUS 2 SWOKOWSKI S CHAPTER 11

Solution of the exam in TMA4212 Monday 23rd May 2013 Time: 9:00 13:00

2.1.1 Definition The Z-transform of a sequence x [n] is simply defined as (2.1) X re x k re x k r

Similar idea to multiplication in N, C. Divide and conquer approach provides unexpected improvements. Naïve matrix multiplication

Exploring the Rate of Convergence of Approximations to the Riemann Integral

Simpson s 1/3 rd Rule of Integration

Fast Fourier Transform 1) Legendre s Interpolation 2) Vandermonde Matrix 3) Roots of Unity 4) Polynomial Evaluation

Certain sufficient conditions on N, p n, q n k summability of orthogonal series

ANALYSIS HW 3. f(x + y) = f(x) + f(y) for all real x, y. Demonstration: Let f be such a function. Since f is smooth, f exists.

Transcription:

97 lecture 16: Itroductio to Lest Squres Approximtio.4 Lest squres pproximtio The miimx criterio is ituitive objective for pproximtig fuctio. However, i my cses it is more ppelig (for both computtio d for the give pplictio) to fid pproximtio to f tht miimizes the itegrl of the squre of the error. Give f C[, b], fid P P such tht 1/ 1/. ( f (x) P (x)) dx mi ( f (x) p(x)) dx pp This is exmple of lest squres problem..4.1 Ier products for fuctio spces To fcilitte the developmet of lest squres pproximtio theory, we itroduce forml structure for C[, b]. First, recogize tht C[, b] is lier spce: y lier combitio of cotiuous fuctios o [, b] must itself be cotiuous o [, b]. Defiitio.. The ier product of the fuctios f, g C[, b] is h f, gi f (x)g(x) dx. The ier product stisfies the followig bsic xioms: h f + g, hi h f, hi + hg, hi for ll f, g, h C[, b] d ll IR; h f, gi hg, f i for ll f, g C[, b]; h f, f i for ll f C[, b]. With this ier product we ssocite the orm For simplicity we re ssumig tht f d g re rel-vlued. To hdle complex-vlued fuctios, oe geerlizes the ier product to h f, gi f (x)g(x) dx, which the gives h f, gi hg, f i. 1/. k f k : h f, f i 1/ f (x) dx This is ofte clled the L orm, where the superscript i L refers to the fct tht the itegrd ivolves the squre of the fuctio f ; the L stds for Lebesgue, comig from the fct tht this ier product c be geerlized from C[, b] to the set of ll fuctios tht re squre-itegrble, i the sese of Lebesgue itegrtio. By restrictig our ttetio to cotiuous fuctios, we dodge the mesuretheoretic complexities. The Lebesgue theory gives more robust defiitio of the itegrl th the covetiol Riem pproch. With such otios oe c exted lest squres pproximtio beyod C[, b], to more exotic fuctio spces.

98.4. Lest squres miimiztio vi clculus We re ow redy to solve the lest squres problem. We shll cll the optiml polyomil P P, i.e., k f P k mi pp k f pk. We c solve this miimiztio problem usig bsic clculus. Cosider this exmple for 1, where we optimize the error over polyomils of the form p(x) c + c 1 x. The polyomil tht miimizes k f pk will lso miimize its squre, k f pk. For y give p P 1, defie the error fuctio E(c, c 1 ) : k f (x) (c + c 1 x)k L ( f (x) c c 1 x) dx f (x) f (x)(c + c 1 x)+(c + c c 1 x + c 1 x ) dx f (x) dx c f (x) dx To fid the optiml polyomil, P, optimize E over c d c 1, i.e., fid the vlues of c d c 1 for which c 1 xf(x) dx + c (b )+c c 1 (b )+ 1 c 1 (b ). First, compute. c c 1 Z b f (x) dx + c (b )+c c 1 (b ) Z b xf(x) dx + c (b )+c c 1 (b ). 1 Settig these prtil derivtives equl to zero yields c (b )+c 1 (b ) f (x) dx c (b )+c 1 (b ) xf(x) dx. These equtios, lier i the ukows c d c 1, c be writte i the mtrix form " (b ) b # b (b ) #" c c 1 4 R b R b f (x) dx xf(x) dx Whe b 6 this system lwys hs uique solutio. The resultig c d c 1 re the coefficiets for the moomil-bsis expsio of the lest squres pproximtio P P 1 to f o [, b]. 5.

99 Exmple.4 ( f (x) e x ). Apply this result to f (x) e x for x [, 1]. Sice Z 1 e x dx e 1, Z 1 xe x dx e x (x 1) 1 x 1, we must solve the system " 1 #" c # " e # 1 c 1. The desired solutio is c 4e 1, c 1 18 6e. Figure.7 compres f to this lest squres pproximtio P d the miimx pproximtio p computed erlier..5 1.5 1.5.1...4.5.6.7.8.9 1 x Figure.7: Top: Approximtio of f (x) e x (blue) over x [, 1] vi lest squres (P, show i red) d miimx (p, show s gry lie). Bottom: Error curves for lest squres, f P (red), d miimx, f p (gry) pproximtio. While the curves hve similr shpe, ote tht the red curve does ot tti its mximum devitio from f t + poits, while the gry oe does.. error, f (x) p(x).15.1.5 -.5 -.1 lest squres miimx -.15.1...4.5.6.7.8.9 1 x We c see from the plots i Figure.7 tht the pproximtio looks decet to the eye, but the error is ot terribly smll. We c I fct, k f P k.677.... This is decrese tht error by icresig the degree of the pproximtig polyomil. Just s we used -by- lier system to fid the best lier pproximtio, geerl ( + 1)-by-( + 1) lier system c be costructed to yield the degree- lest squres pproximtio. ideed smller th the -orm error of the miimx pproximtio p : k f p k.78....

1.4. Geerl polyomil bses Note tht we performed the bove miimiztio i the moomil bsis: p(x) c + c 1 x is lier combitio of 1 d x. Our experiece with iterpoltio suggests tht differet choices for the bsis my yield pproximtio lgorithms with superior umericl properties. Thus, we develop the form of the pproximtig polyomil i rbitrry bsis. Suppose {f k } is bsis for P. Ay p P c be writte s p(x) The error expressio tkes the form c k f k (x). E(c,...,c ) : k f (x) p(x)k L f (x) c k f k (x) dx h f, f i c k h f, f k i + ` To miimize E, we seek criticl vlues of c [c,...,c +1 ] T IR +1, i.e., we wt coefficiets where the grdiet of E with respect to c is zero: r c E. To compute this grdiet, evlute / for j,..., : h f, f i c k h f, f k i + h f, f j i + c j hf j, f j i + k6j ` c k c j hf k, f j i + ` `6j c j c`hf j, f`i + I this lst lie, we hve broke the double sum o the previous lie ito four prts: oe tht cotis c j, two tht coti c j (c k c j for k 6 j; c j c` for ` 6 j), d oe (the double sum) tht does ot ivolve c j t ll. This decompositio mkes it esier to compute the derivtive: c j c hf j, f j i + j k6j c k c j hf k, f j i + c j hf j, f j i + c j hf j, f j i + k6j k6j ` `6j c j c`hf j, f`i + c k hf k, f j i + c k hf k, f j i. ` `6j ` k6j `6j c`hf j, f`i +. ` k6j `6j

11 These terms cotribute to / to give (.1) h f, f i + j c k hf k, f j i. To miimize E, set / for j,...,, which gives the + 1 equtios (.1) c k hf k, f j i h f, f j i, j,...,, i the + 1 ukows c,...,c. Sice these equtios re lier i the ukows, write them i mtrix form: hf, f i hf, f 1 i hf, f i c h f, f i hf 1, f i hf 1, f 1 i. c 1. h f, f 6. 1 i, 4.... 7 6 7 6 5 4 5 4. 7 5 hf, f i hf, f 1 i hf, f i h f, f i which we deote Gc b. The mtrix G is clled the Grm mtrix. Usig this mtrix-vector ottio, we c ccumulte the prtil derivtives formuls (.1) for E ito the grdiet r c E Gc b. Sice c is criticl poit if d oly if r c E(c), we must sk: How my criticl poits re there? Equivletly, how my c solve Gc b? If c is criticl poit, is it (locl or eve globl) miimum? We will swer the first questio by showig tht G is ivertible, d hece E hs uique criticl poit. To swer the secod questio, we must ispect the Hessi c r c E r c (r c E)G. The criticl poit c is locl miimum if d oly if the Hessi is symmetric positive defiite. The symmetry of the ier product implies hf j, f k i hf k, f j i, d hece G is symmetric. (I this cse, symmetry lso follows from the equivlece of mixed prtil deritivtes.) The followig theorem cofirms tht G is ideed positive defiite. A mtrix G is positive defiite provided z Gz > for ll z 6.