Chapter 0 of Calculus ++, Differential calculus with several variables

Similar documents
Midterm 1 Review. Distance = (x 1 x 0 ) 2 + (y 1 y 0 ) 2.

Vector Functions & Space Curves MATH 2110Q

MITOCW ocw-18_02-f07-lec02_220k

MATH 12 CLASS 5 NOTES, SEP

Chapter 5 of Calculus ++ : Description and prediction of motion

MATH 423/ Note that the algebraic operations on the right hand side are vector subtraction and scalar multiplication.

CHAPTER 6 VECTOR CALCULUS. We ve spent a lot of time so far just looking at all the different ways you can graph

Chapter 1 of Calculus ++ : Differential calculus with several variables

Course Notes Math 275 Boise State University. Shari Ultman

Span and Linear Independence

MATH The Chain Rule Fall 2016 A vector function of a vector variable is a function F: R n R m. In practice, if x 1, x n is the input,

x 1. x n i + x 2 j (x 1, x 2, x 3 ) = x 1 j + x 3

Section 14.1 Vector Functions and Space Curves

4.1 Distance and Length

Linear Algebra and Robot Modeling

Vector Geometry. Chapter 5

MATH 32A: MIDTERM 1 REVIEW. 1. Vectors. v v = 1 22

n=0 ( 1)n /(n + 1) converges, but not

3 = arccos. A a and b are parallel, B a and b are perpendicular, C a and b are normalized, or D this is always true.

Chapter 7. Homogeneous equations with constant coefficients

Vector equations of lines in the plane and 3-space (uses vector addition & scalar multiplication).

Topic 14 Notes Jeremy Orloff

Vector Calculus. Lecture Notes

October 25, 2013 INNER PRODUCT SPACES

Worksheet 1.8: Geometry of Vector Derivatives

Chapter 8. Rigid transformations

SOLUTIONS TO SECOND PRACTICE EXAM Math 21a, Spring 2003

Euclidean Spaces. Euclidean Spaces. Chapter 10 -S&B

Distances in R 3. Last time we figured out the (parametric) equation of a line and the (scalar) equation of a plane:

Vectors and Coordinate Systems

REVIEW OF DIFFERENTIAL CALCULUS

1 Vectors and 3-Dimensional Geometry

Linear Algebra for Beginners Open Doors to Great Careers. Richard Han

1.1 Single Variable Calculus versus Multivariable Calculus Rectangular Coordinate Systems... 4

Week 4: Differentiation for Functions of Several Variables

Linear Algebra, Summer 2011, pt. 2

mathematical objects can be described via equations, functions, graphs, parameterization in R, R, and R.

Integrals. D. DeTurck. January 1, University of Pennsylvania. D. DeTurck Math A: Integrals 1 / 61

Notes on Green s Theorem Northwestern, Spring 2013

Solutions to Selected Questions from Denis Sevee s Vector Geometry. (Updated )

Solving Systems of Equations Row Reduction

Homework 2 Solutions

22m:033 Notes: 1.8 Linear Transformations

Multivariable Calculus Notes. Faraad Armwood. Fall: Chapter 1: Vectors, Dot Product, Cross Product, Planes, Cylindrical & Spherical Coordinates

Final Review Sheet. B = (1, 1 + 3x, 1 + x 2 ) then 2 + 3x + 6x 2

APPM 2350, Summer 2018: Exam 1 June 15, 2018

MATH 167: APPLIED LINEAR ALGEBRA Least-Squares

4 Partial Differentiation

Distance in the Plane

Topic 2-2: Derivatives of Vector Functions. Textbook: Section 13.2, 13.4

Solutions to Math 51 First Exam October 13, 2015

Math (P)Review Part II:

Chapter 1. Linear Equations

LB 220 Homework 4 Solutions

Introduction. Chapter Points, Vectors and Coordinate Systems

Introduction to Algebra: The First Week

Study guide for Exam 1. by William H. Meeks III October 26, 2012

Predicting the future with Newton s Second Law

Section Vector Functions and Space Curves

Span & Linear Independence (Pop Quiz)

Math 317 M1A, October 8th, 2010 page 1 of 7 Name:

MATH 12 CLASS 4 NOTES, SEP

Linear Programming and its Extensions Prof. Prabha Shrama Department of Mathematics and Statistics Indian Institute of Technology, Kanpur

2. Two Dimensional Kinematics

Electromagnetic Theory Prof. D. K. Ghosh Department of Physics Indian Institute of Technology, Bombay

Mechanics, Heat, Oscillations and Waves Prof. V. Balakrishnan Department of Physics Indian Institute of Technology, Madras

DEPARTMENT OF MATHEMATICS AND STATISTICS UNIVERSITY OF MASSACHUSETTS. MATH 233 SOME SOLUTIONS TO EXAM 1 Fall 2018

Linear Algebra (MATH ) Spring 2011 Final Exam Practice Problem Solutions

Notes on multivariable calculus

The Basics of Physics with Calculus Part II. AP Physics C

(arrows denote positive direction)

Section 1.8/1.9. Linear Transformations

Calculus Vector Principia Mathematica. Lynne Ryan Associate Professor Mathematics Blue Ridge Community College

Introduction to Vectors

Vector Spaces, Orthogonality, and Linear Least Squares

Homogeneous Linear Systems and Their General Solutions

Linear Algebra. Preliminary Lecture Notes

MAT 272 Test 1 Review. 1. Let P = (1,1) and Q = (2,3). Find the unit vector u that has the same

a11 a A = : a 21 a 22

Chapter 6. Second order differential equations

36 What is Linear Algebra?

MATH 22A: LINEAR ALGEBRA Chapter 4

Name Solutions Linear Algebra; Test 3. Throughout the test simplify all answers except where stated otherwise.

Introduction - Motivation. Many phenomena (physical, chemical, biological, etc.) are model by differential equations. f f(x + h) f(x) (x) = lim

Math 308 Midterm Answers and Comments July 18, Part A. Short answer questions

Linear Independence Reading: Lay 1.7

DIFFERENTIAL EQUATIONS

The Calculus of Vec- tors

22 Approximations - the method of least squares (1)

Exam 1 Review SOLUTIONS

Linear Algebra Exam 1 Spring 2007

Limits for parametric and polar curves

PYTHAGOREAN TRIPLES KEITH CONRAD

Linear Algebra. Preliminary Lecture Notes

Matrix-Vector Products and the Matrix Equation Ax = b

Making the grade: Part II

VANDERBILT UNIVERSITY. MATH 2300 MULTIVARIABLE CALCULUS Practice Test 1 Solutions

1 Review of the dot product

Implicit Differentiation Applying Implicit Differentiation Applying Implicit Differentiation Page [1 of 5]

Sequences and infinite series

Transcription:

Chapter of Calculus ++, Differential calculus with several variables Background material by Eric A Carlen Professor of Mathematics Georgia Tech Spring 6 c 6 by the author, all rights reserved -

Table of Contents Section : Lines and planes in IR The description of lines and planes in IR by equations - The parametric description of lines and planes in IR -5 Recovering equations from a parameterization -7 4 Lines intersecting planes - 5 Lines reflecting off planes -5 6 Distance problems -7 Section : The distance between two lines in IR An algebraic point of view - A differential geometric point of view - Comparing the two points of view -4 Section : Functions from IR to IR n Curves in IR n -6 Velocity and acceleration -8 -

Section : Lines and planes in IR We have two ways of describing lines and planes in IR : We can specify them in terms of equations, or in parametric form Both are useful for answering various geometric questions about lines and planes, and it is important to be able to pass back and forth between them : The description of lines and planes in IR by equations A plane in IR is the solution of a single linear equation of the form a x = d () where x = x y z If for example, a =, and d = 4, this becomes x + y + z = 4 () There is an even more geometric way to write () Let x be any particular point on the plane; ie, any particular solution of () Finding particular solutions is easy: Just set all but one variable equal to zero With y z =, () becomes x = 4, so one particular solution is x = 4 You can check that indeed, a x = d = 4 In general, if x is any particular solution of (), then () is equivalent to a (x x ) = () This is the normal vector and base point for of the equation for a plane Here is an illustration showing a patch of a plane You see the base point x and the normal vector a attached to x Also indicated are two points x and y lying in the plane You see that a and x x, the vector running from x to x, are orthogonal, and that this characterizes membership in the plane -

A line in R is the intersection of two planes, as illustrated below: Therefore, we can specify a line as the solution set of a system of two equations specifying planes; ie, two equations of the form (): a x = d a x = d (4) Introduce the matrix A whose first row is a, and whose second row is a ; ie, [ a A = Then by a fundamental formula for vector matrix multiplication, a and so if we introduce the vector d = equation Ax = [ d d [ a x a x,, we have that (4) is equivalent to the matrix Ax = d (5) For example, consider the case A = [ and d = [ (6) Then the matrix equation Ax = d is equivalent to the system of equations x + y + z = x + y + z = (7) -

Its solution set is clearly the line formed by the intersection of the two planes x + y + z = and x + y + z = (8) That summarizes the description in terms of equations What is the parametric form of description? : The parametric description of lines and planes in R The parametric description is what we get when we solve the equations Let s take the equations for a line using A and d from (6) To solve this equation, you [ [ row reduce [A d = which leads to The variable z is non pivotal, and from the second equation we see y + z =, or y = z The first equation then becomes x + ( z) + z =, or x = + z Plugging these results into x = x y, we get + z z = + z z z Defining x = and v =, we have that the solution set is exactly the set of points x(t) where t ranges over the real numbers, and x(t) = x + tv (9) Notice that we have changed the name of the paramter from z to t This is not necessary, but it signals a shift in the way we think about z as a parameter now instead of a variable Every line may be parameterized as in (9) for some vector x, called a base point and some non-zero vector v called a direction vector, and this is a parametric description of the line Here is an illustration of the scheme: As we have seen, one way to find the direction vector v given the system of equations for the line is to solve the system of equations There is an alternative involving the cross product Here is how it works for the line given by (7): -4

A direction vector of this line is some non-zero vector that is orthogonal to the normal vectors to both of the planes specified in (7) The normal vectors to these two planes are a = and a = One way to get a vector orthogonal to both of these vectors is to take their cross product; that is we can use v = a a as the direction vector Computing this we find v = a a = 4 8 4 To find a base point, set z = in (8), which then become the simple system x + y = x + y = This is easily solved, with the result that x =, and y = Hence we have the base point This gives us the parameterization x(t) = x = + t 4 8 = 4 4t + 8t This is not 4t the same parameterization we found before, but it does parameterize the same line The cross product approach is favored by many textbooks, although you still have to solve a system of equations to find the base point The linear algebra approach is probably simpler to apply In any case, keep in mind that finding a parameterization starting from the equations just means solving the equations To find the parametric description of a plane, we again just solve the equation For example, consider the plane given by x + y + z = 4 as above We do not need to bring in any matrices and row reduction here there would be just one row since there is just one equation Instead, we just solve for x in terms of y and z, and find x = 4 y z Plugging this into x = x y, we get z x = 4 y z y = 4 + y + z z -5

We could use y and z as the parameters, but let s change their names to s and t, so we have that the plane is parameterized by x(s, t) = x + sv + tv where x = 4 v = Here is an illustration of the scheme: and v = You see the vector x(/, /) = x +(/)v +(/)v, and you can see how any point in the plane can be reached by starting from x, moving in the direction of v to x + sv, for some s, and from there moving in the direction v to x + sv + tv for some t Two linearly independent direction vectors suffice to reach every point in the plane, and the values of s and t are uniquely determined Recovering equations from a parameterization Sometimes a parameterization is easy to find, and what would be most useful to us would be an equation For example, suppose we are given 4 points p = and we are asked: p = p = Do all of these points lie in the same plane? and p 4 = 4, () To answer this question, consider the plane determined by the first three points It is easy to parameterize this: Use p as the base point, and get the direction vectors from the -6

p p and p p Hence we define x = v = Then the plane is parameterized by and v = () x(s, t) = x + sv + tv = + s + t () s t To find the equation of the plane in the form a (x x ) =, all we need to do is to find the normal vector a, since we already have the base point x A valid choice for this is a = v v Indeed, since this vector is orthogonal to both v and v, we have a (x + sv + tv ) = a x + sa v + ta v = a x for all s and t So when x = x + sv + tv for some s and t, a (x x ) = We summarize: Given a plane in parametric form x + sv + tv, one can get an equation for this plane in the form a (x x ) = by using the same base point x and taking a = v v For example, consider the case in which x, v and v are given by () Then We also compute a = v v = = a x = x + y + z and a x = so the equation a (x x ) = written out in terms of x, y and z is x + y + z =, -7

or, what is the same thing, x + y + z = 6 () This is the equation for the plane passing through the first three points in the list () You can easily check that these points do satisfy the equation For example, with p, x =, y = and z = With these values of x, y and z, () is clearly satisfied We can now easily decide whether p 4 lies in the same plane as the first three points With x = 4, y = and z =, the equation () is satisfied, so it is in the plane That takes care of going from a parameterization to an equation for planes in R and shows one reason for doing so What about lines? Consider a line given in parametric form by x(t) = x + tv where x = We need to find two equations v = a (x x ) and a (x x ) for two planes that both contain this line The base point x is no problem; the base point x from the line will do We just have to find a and a The vectors a and a are orthogonal to their respective planes, and since the lines and the planes have the same base point x in common, these planes will contain the line if and only if a v = and a v = So all we need to do is to find a pair of linearly independent vectors a and a that are orthogonal to v, the direction vector of the line (If they are not linearly independent, the two planes would be the same, and their intersection would be plane, not a line) [ c We can use a trick from R to do this Recall that for any vector c = in R d, we get an orthogonal vector c by swapping the entries and putting in a minus sign: If c = [ c d, then c = [ d c, and as you easily check, c c =, so that c is orthogonal to c To apply this in R, we begin with an example Take v = a = a b -8 Let a be given by

We have chosen to zero out the third component of a components do not enter into the dot product so a v = [ a b [, This means that the third and we are left with a dot product in R We know how to make this zero: Choose [ a = b [ = [ Therefore, we have a = To find a, we apply the same procedure, zeroing out a different entry this time If we choose to zero out the first entry, so that a = [ [ a a, we get a v = Hence b b we take a = b =, and have a = Now that we have found a and a, we can write down the system of equations for the line in the standard algebraic form Computing, we find Also, Hence the system of equations for the line is a x = and a x = a x = x + y and a x = y + z x + y = y + z = (4) There is an alternative method, based on the cross product, for finding a pair of vectors a and a that are orthogonal to v, and linearly independent too Here is how it goes: First pick any vector w that is not a multiple of v The standard choice is w = since if v is a multiple of this vector, everything is obvious anyway Now choose a = w v -9

and then a = a v This works because, by the properties of the cross product, w v is a non-zero vector that is orthogonal to both w and v In particular, it is orthogonal to v, which is what we are after Again by the properties of the cross product a v is a non-zero vector that is orthogonal to both a and v In particular, it is orthogonal to v, which is what we are after So you can also find a and a by computing two cross products For example, with v = and w =, we get a = w v = for a and a, we get the system of equations and a = a v = y + z = x + y z = With these choices This is a different system than we found before, but both systems are equivalent; ie, they have the same line as their solution set 4 Lines intersecting planes The material that we have just covered provides the means to answer a wide range of geometric problems Here is an example: Consider the plane that passes through three points, p, p and p, not all on the same line Consider also a line l that passes through two distinct points x and x Unless the line happens to be parallel to the plane, it will intersect the plane in exactly one point Here is an illustration: We now ask: how do we find the intersection point? -

Let s solve this problem in a concrete example Let p, p and p be given by (), and let x and x be given by x = x = There are several ways to solve our problem; ie, find the point of intersection First solution: In this solution, we use the equation for the plane, and the parametric form of the line We have already computed the equation of the plane above; it is given by (), namely x + y + z = 6 (5) To parameterize the line, use v = x x as the direction vector Calculating, we find v = Therefore x(t) = x + tv = + t + t (6) t Plugging x(t), y(t) and z(t) into (5), we get ( + t) + ( + t) + ( t) = 6 or t = Therefore, x() is in the plane Computing, This is the point we seek x() = 4 Second solution: In this solution, we use the equation for the plane, and the system of equations for the line We have already worked out the equation of the plane in () Just above, we found the parametric form of the line It is the one we used in our example of how to recover the system of equations of a line from the parametric form, hence we know that the system of equations for the line is given by (4) The point we are seeking satisifes both () and (4), so it satisfies the combined system x + y + z = 6 x + y = y + z = - (7)

The corresponding augmented matrix is 6 which row reduces to 6 Now back substitution gives us z =, then y +z =, so y = 4, and then x+y +z = 6, so x = Again we find the point 4 The second solution was efficient once we found the system of equations for the line To do this, we had to find the parameterization first When a line is specified by giving two points on it, a parameterization is easy to write down, but finding the equations takes some work This is also true when we a plane is specified by giving three points on it hence there is some advantage to being able to work entirely with parameterizations We do this in out third solution, which has other advantages as well Third solution: In this solution, we use the parametric descriptions of the plane and the line As we have seen in (), the parametric description of the plane is given by x(s, t) = x + sv + tv where x = v = and v = Let z(u) be a parameterization of the line (The symbols x and t are already taken, so we use z and u to avoid confusion) As we have seen in (6), this is z(u) = + u + u u We are looking for values of s, t, and u so that x(s, t) = z(u) (8) If we find them, we have a point that is on both the plane and the line -

Let s put the terms involving s and t on the left, and everything else on the right This gives us the equation u sv + tv = z(u) x = u (9) u To write this in matrix form, introduce Then since A A = [v, v = [ s = sv t + tv, we have that (9) is equivalent to Now let s solve this for A [ u s = u () t u [ s The corresponding augmented matrix is t u u u This row reduces to u u () u We see that there is a pivot in the final column unless u =, and hence there is a solution only for u = This means that the line crosses the plane at z(), and we find z() = 4 This is the point we seek We can now go further and find the s and t values Here is why you might want to do this Suppose we want to know whether the line hits the triangle with vertices at the points p, p, and p We will use the following fact, which is easily understood from a picture: -

The set of points lying on the triangle with vertices at any three non colinear points p, p, and p in R are given by p + s(p p ) + t(p p ) where s, t and s + t () Since we have chosen x = p, v = (p p ) and v = (p p ) in our parameteri- zation, we have that the point of intersection, 4, lies in the triangle if and only if its parameters s and t satsify () [ s To check this we go on and solve A = 4 We can use our previous work and t just plug u = into () getting From here we see t = and s = The conditions () are not satisfied, and the line passes though the plane outside the triangle 5 Lines reflecting off planes Let l be a line with direction vector v We know how to find the point at which this line will hit any plane that is not parallel to the line If the line describes the path of an incoming light ray, and the plane is a mirror, the light ray will be reflected, and another line will describe the path of the reflected ray How can we find the second line corresponding to the reflected ray? The first step is to find the point x where the incoming line crosses the plane We know how to do this Next, we need to find the direction vector of the outgoing line Here is how to do this: Let a (x x ) be the equation of the plane Form the unit vector u = a a We now make the following claim, which will be applied with this unit vector u, and the direction vector v of the line: Let u be any given unit vector in R Any vector v in R can be uniquely decomposed into a sum v = v + v where v is parallel to u, and v is orthogonal to u In fact, we have the formulas v = (u v)u and v = v (u v)u () -4

Proof: To justify this, notice that if v and v are defined by (), then clearly v = v + v Also, v is defined to be a multiple of u, so certainly it is parallel to u Next, to check that v is orthogonal to u, compute its dot product with u Since u is a unit vector, u u =, and so v u = (v (u v)u) u = v u (u v)u u = v u (u v) = Finally, suppose that v = ṽ + ṽ is another such decomposition Then v + v = ṽ + ṽ so that ṽ v = v ṽ The vector on the left is parallel to u, and the vector on the right is orthogonal to u, so both vectors are orthogonal to themselves But only the zero vector is orthogonal to itself, so ṽ v = and v ṽ = This shows that ṽ = v and v = ṽ, so the fomula () gives the unique decomposition of this type To apply this, let v be the direction vector of the incoming line Let u be a unit vector that is orthogonal to the plane; ie, one of ± a a, and decompose v as v + v When the ray of light reflects off the plane, the component that is orthogonal to the plane changes sign This is the component that is parallel to u, so the new direction vector w is From the formulas for v and v, we have w = v v w = v (u v)u (u v)u = v (u v)u (4) Here is an example: Consider the plane that passes through p, p and p where these vectors are given by () Consider the line that passes through x and x where x = x = Find the parametric description of the line obtained by reflecting this line off this plane -5

In the previous section we have already computed that the point of intersection of the line and the plane is 4, which we take as our base point x We have also found that the equation of the plane is x + y + z = 6, so the vector a is a =, and so we have the unit vector u = direction vector of the line is v = This is all we need u v = / Therefore Finally, we found that the To work out the reflected direction vector, we first work out w = v (u v)u = The reflected line then is x + uw = 6 Distance problems 4 + u 5 = 5 The orthogonal decomposition (4) is also the key to distance problems For example, let p be any point in R, and consider any plane given by an equation a (x x ) = Then the vector p x runs from the base point x in the plane to p, but in general its length is greater than the distance between p and the plane To find the distance, let u be either unit vector orthogonal to the plane, and decompose Here is an illustration: (p x ) = (p x ) + (p x ) -6

Only the parallel component is relevant for computing the distance from the plane to p, as you see above Hence, the distance equals the length of parallel component (p x ) Let s compute the distance from p = to the plane given by x + y + z = We get a particular solution of this equation by taking y = z = and then x =, so we can take x = Then p x = The normal vector to the plane is a =, so we have the unit vector u = 6 Then, using (4), we compute (p x ) u = / 6, so that (p x ) = u = 6 Hence, (p x ) = / 6, and this is the distance from p to the plane Computing the distance from a point to a line is similar: Let l be the line given in parametric form by x + tv, and let p be any given point in R Then the vector p x runs from the base point x on the line to p, but in general its length is greater than the distance between p and the line To find the distance, let u be the unit vector parallel to v, ie, u = v v, and decompose (p x ) = (p x ) + (p x ) -7

This time, the parallel component corresponds to motion along the the line, which has nothing to do with the distance from the line, and it is the length of the orthogonal component, (p x ) For example, consider the line given by x(t) = x + tv where Let p = x = Then we have p x = and v =, and u = v v = (p x ) u = / Hence, from the formula () we have We compute (p x ) = = 4 5 The length of this vector is 4/, and so that is the distance from p to l Problems Consider the plane passing through the three points p = [ p = [ and p = [ and the line passing through z = [ and z = [ (a) Find a parametric representation x(s, t) = x + sv + tv for the plane (b) Find a parametric representation z(u) = z + uw for the line (c) Find an equation for the plane (d) Find a system of equations for the line (e) Find the points, if any, where the line intersects the plane (f) Does the line pass through the triangle with vertices p, p and p? computation (g) Find the distance from p to the line (h) Find the distance from z to the plane Consider the plane passing through the three points p = [ p = [ -8 and p = [ Justify your answer with a

and the line passing through z = [ 4 and z = [ (a) Find a parametric representation x(s, t) = x + sv + tv for the plane (b) Find a parametric representation z(u) = z + uw for the line (c) Find an equation for the plane (d) Find a system of equations for the line (e) Find the points, if any, where the line intersects the plane (f) Does the line pass through the triangle with vertices p, p and p? computation (g) Find the distance from p to the line (h) Find the distance from z to the plane Consider the plane passing through the three points p = [ p = [ 5 and p = [ 4 Justify your answer with a and the line passing through z = [ and z = [ (a) Find a parametric representation x(s, t) = x + sv + tv for the plane (b) Find a parametric representation z(u) = z + uw for the line (c) Find an equation for the plane (d) Find a system of equations for the line (e) Find the points, if any, where the line intersects the plane (f) Does the line pass through the triangle with vertices p, p and p? computation (g) Find the distance from p to the line (h) Find the distance from z to the plane 4 Consider the plane given by Let p = x y + z = 4 [ What is the distance from p to the plane? 5 Consider the plane given by x y + z = [ Let p = 5 What is the distance from p to the plane? 6 Consider the line given by x y + z = 4 Let p = x + y + z = [ What is the distance from p to the line? Justify your answer with a -9

7 Consider the line given by x y + z = y + z = [ Let p = 5 What is the distance from p to the line? 8 Consider the line l given by x y + z = 4 x + y + z = Find a parametric representation of the line obtained by reflecting this line through the plane x + y z = 9 Consider the line l given by x y + z = y + z = Find a parametric representation of the line obtained by reflecting this line through the plane x + y z = [ Consider the vector v = 4 to v Find two linearly independent vectors a and a that are orthogonal [ Consider the vector v = Find two linearly independent vectors a and a that are orthogonal to v -

Section : The distance between two lines in IR : An algebraic point of view Consider two lines in IR given parametrically by x (s) = x +sv and x (t) = x +tv The distance between the points x (s) and x (t) is x (s) x (t) As s and t vary, so does the distance been the points The distance between the two lines is, by definition, the minimum value of x (s) x (t) as s and t range independently over the real numbers Computing this distance is a least squares problem, as we now explain First of all, we will seek the values of s and t that minimize the squared distance x (s) x (t) If we can find s and t so that x (s ) x (t ) x (s) x (t) for all s, t, () then taking square roots of both sides we have x (s ) x (t ) x (s) x (t) for all s, t, () since the square root function is monotone increasing Hence x (s ) x (t ) will be the distance between the lines Now note that x (s) x (t) = sv tv (x x ) [ s If we define A to be the matrix A = [v, v, and b = x x, and finally s =, we t can rewrite this as Therefore, and so if s = [ s t x (s) x (t) = As b x (s) x (t) = As b, is the least squares solution of As = b, then () holds, and the distance between the two lines is As b Example (The distance between two lines) Let x, x, v and v be given by x = [ x = [ v = [ and v = [ Then The normal equations are A = [ and b = [ A t As = A t b -

We compute A t A = [ 9, so that (A t A) = 9 [ [ We then compute A 9 t b =, so that [ s = [ [ = [ t 9 9 Hence s = /, and t = We now compute Hence x (/) x () = [ 4 4 [ x (/) = 5 5 4 and so and x () = [ x (/) x () = 6 = This is the distance between the two lines; for no values of s and t is x (s) x (t) any smaller than this, the value at s = / and t = : A differential calculus point of view Define a function f of two variables s and t by f(s, t) = x (s) x (t) This represents the square of the distance between x (s) and x (t) Finding values of s and t that make f(s, t) as small as possible amounts to computing the square of the distance between the two lines, and hence doing so determines the distance Example (A two variable distance function) Let x, x, v and v be given as in Example Then f(s, t) = (s + t ) + (s) + (s + t + ) = 9s + t + 6st 6s t + 5 In the last subsection, we saw how to compute values s and t so that () holds, or, what is the same, so that Now, fix s = s, and define a function g(t) by From the definition and (), f(s, t ) f(s, t) for all s, t () g(t) = f(s, t) g(t ) g(t) for all t This means that g has a minimum at t = t, and from the calculus of functions of a single variable, we know that this means that the derivative of g is zero at t = t That is: g (t ) = (4) -

In the same way, fix t = t, and define a function h(s) by h(s) = f(s, t ) From the definition and (), h(s ) h(s) for all s Hence h has a minimum at s = s, and so h (s ) = (5) The two equations (4) and (5) can be solved to determine the two unknowns s and t Notice that both equations involve both s and t through the definitions of g and h, so that (4) and (5) forms a system of equations Notice the strategy: We are going to determine the minimum of a function of two variables by considering one variable at a time, thus placing us on the familiar ground of single variable calculus Example Let f(s, t) be the function determined in Example : f(s, t) = 9s + t + 6st 6s t + 5 Then g(t) = t + [6s t + [9s 6s + 5 where we have group the terms by powers of t Differentiating with respect to the variable t, g (t) = 4t + [6s, and so (4) becomes 4t + 6s = (6) Likewise, h(s) = 9s + [6t 6s + [t t + 5 Differentiating with respect to the variable s, h (s) = 8s + [6t 6, and so (4) becomes 8s + 6t = 6 (7) Dividing through by and 6 respectively, (6) and (7) give us the system s + t = s + t = Subtracting the second [ equation [ from the first, we find t =, and then we see s = / s / minimum occurs at =, as we found before t Hence the : Comparing the two points of view For computing the distance between to lines, the purely algebraic approach worked out in Example is very efficient However, the approach worked out in Example, in which we apply the methods of differential calculus one variable at a time has a significant advantage: It is easy to generalize this method to seek minimum and maximum values of a rich variety of functions of several variables If a function involves third or higher powers -

of its variables, let alone transcendental functions of them, one cannot hope to write the minimization problem as a least squares problem But one can still hope to apply the methods of differential calculus one variable at a time Does this mean that the algebraic point of view is superseded by the differential calculus point of view? Not at all! As we shall see, the most powerful way to solve many problems is use both points of view together The next chapter introduces certain vectors and matrices, namely gradients, Hessians and Jacobians, that are defined in terms of derivatives Algebraic analysis of these objects will be the path of progress in much of our work Our first order of business in the next chapter though is to see how far we can go with the one variable at a time strategy* Problems Consider two lines in IR given parametrically by x (s) = x + sv and x (t) = x + tv where x = [ x = [ v = [ and v = [ Compute the distance between these two lines Consider two lines in IR given parametrically by x (s) = x + sv and x (t) = x + tv where x = [ x = [ v = [ and v = [ Compute the distance between these two lines Consider two lines in IR given parametrically by x (s) = x + sv and x (t) = x + tv where x = [ x = [ 5 v = [ 4 and v = [ Compute the distance between these two lines 4 Consider two lines in IR given parametrically by x (s) = x + sv and x (t) = x + tv where x = [ x = [ v = [ 5 and v = [ Compute the distance between these two lines * This is a good place to emphasize that to master this subject, one must approach it with the goal of learning problem solving strategies and not just formulas The formulas are important, but still they are only the foot soldiers needed to advance the strategies, and there are many ways to employ them Choosing one that succeeds is a strategic decision, and unless one develops strategic vision, the largest army of formulas will be of no avail -4

Section : Functions from IR to IR n Curves in IR n In many ways, the simplest sort of multivarible functions are functions from IR to IR n for some n These are functions that have one input variable (one independent variable), and n output variables (n dependent variables) If n = or if n =, we can think of the output variables a giving the coordinates of a point moving in IR or IR, and we can think of the input variables as the time so that the function gives us the location of a moving point at time t Such functions describe curves in IR or IR For this reason, it is natural to use x for the dependent variables, and t for the independent variable, and to write the functions as x(t) Such functions are also called vector valued functions of a real variable, for obvious reasons For example, consider a function x(t) of the real variable t with values in IR n For example, let s consider n =, and x(t) = cos(t) sin(t) /t () Here is a three dimensional plot of the curve traced out by x(t) as t varies from t = to t = : 8 6 4 5 5 Such vector valued functions arise whenever we need to describe the position of a particle as a function of time But more generally, we might have any sort of system that is described by n parameters These could be, for example, the voltages across n points in an electric circuit We can arrange this data into a vector, and if the data is varying with time, as is often the case in applications, we then have a time dependent vector x(t) in IR n When quantities are varying in time, it is often useful to consider their rates of change; ie, derivatives Definition (Derivatives of Vector Valued Functions) Let x(t) be a vector valued function of the variable t We say that x(t) is differentiable at t = t with derivative x (t ) in case lim h h (x(t + h) x(t )) = x (t ) 5-5 5

in the sense that this limit exists for each of the n entries separately A vector valued function is differentiable in some interval (a, b) if it is differentiable for each t in (a, b) There is nothing really new going on here To compute the derivative of x(t), you just differentiate it entry by entry in the usual way [ x(t) Indeed, consider a t dependent vector x(t) = in IR Then, by the rules for y(t) vector subtraction and scalar multiplication, h (x(t + h) x(t)) = ([ [ ) x(t + h) x(t) h y(t + h) y(t) [ (x(t + h) x(t))/h = (y(t + h) y(t))/h Now taking the limits on the right, entry by entry, we see that x (t) = [ x (t) y provided (t) x(t) and y(t) are both differentiable The same reduction to single variable differentiation clearly extends to any number of entries Example (Computing the derivative of a vector valued function of t) Let x(t) be given by () Then for any t, [ sin(t) x (t) = cos(t) /t Because we just differentiate vectors entry by entry without mixing the entries up in any way, familiar rules for differentiating numerically valued functions hold for vector valued functions as well In particular, the derivative of a sum is still the sum of the derivatives, etc: (x(t) + y(t)) = x (t) + y (t) () Things are only slightly more complicated with the product rule because now we have several types of products to consider Here is an example that we shall need soon: a product rule for the dot product Theorem (Differentiating Dot Products) Suppose that v(t) and w(t) are differentiable vector valued functions for t in (a, b) with values in R n Then v(t) w(t) is differentiable for t in (a, b), and d dt v(t) w(t) = v (t) w(t) + v(t) w (t) () Proof: For each i, we have by the usual product rule Summing on i now gives us () d dt v i(t)w i (t) = v i(t)w i (t) + v i (t)w i(t) -6

Velocity and acceleration Let x(t) represent the position of a point particle at time t The first time derivative of position is called the velocity This gives you the rate of change of the position The second time derivative of the position is called the acceleration It gives you the rate of change of the velocity vector Consider x(t), the x coordinate of x(t) This is a garden variety real valued function of a single real variable By Taylor s Theorem, given any t, x(t) = x(t ) + x (t )(t t ) + x (y )(t t ) + O((t t ) ) (4) Doing the same for the other coordinates, and combining the results in vector form, we have x (t) = x(t ) + (t t )x (t ) + (t t ) x (t ) + O((t t ) ) (5) In the vector form, we have written the scalar multiples in front of the vectors, as we usually do You see here what the velocity tells you: If h = t t is small, then where you will be at time t is in leading order of approximation x(t ) + hv(t ) The velocity tells you what increment in your position corresponds to a small increment h in time, namely hv(t ) The straight line function z(t) given by z(t) = x(t ) + (t t )v(t ) is called the tangent line to x(t) at t It gives the best linear approximation to the curve x(t) at t : x (t) = x(t ) + (t t )v(t ) + O((t t ) (6) Example (Taylor approximation for a curve) Let x(t) be given by x(t) = for all t >, and Taking t =, x() = [ / / / x (t) = x (t) = x (t) = [ [ / t / t / t / [ / and x (t) = [ [ / t / t / / t / Then Therefore, when t, x(t) [ / / / + (t ) [ / (t ) + [ / -7

We get the tangent line by just keeping the linear term in this approximation Hence the tangent line at t = is given in parametric form by z(t) = [ / / / + (t ) [ / Here is a plot showing the curve for t, together with the tangent line at t = tangency Observe the Notice that for a vector valued function z of the form z = x + tv, z (t) = for all t The acceleration is zero for such a linear path This is characteristic of linear paths Indeed, if x (t) =, then x(t) = x(t ) + (t t )x(t ) Taylor expansion stops exactly after the linear term The error term in (6) depends on the second derivatives, but if x =, there is no error term In general though, the velocity is not constant, and (5) gives a better approximation than (4) One could go on and consider higher derivatives However, it is rarely useful to do so* The reason is that Newton s second law relates the acceleration of point particle to the forces acting on it Once the physical nature of the force is determined, the accelaration is determined, and the motion can be determined The acceleration and velocity are both vector quantities Henceforth, we will write v(t) in place of x (t) to denote the velocity, and write a(t) in place of x (t) to denote the acceleration * The third derivative x does have a name; it is called the jerk, at least in some references -8

The magnitude of the velocity vector is called the speed We denote it by v(t) That is, As you see from (6), when h is small, v(t) = v(t) v(t) x(t + h) x(t) h so that v(t) is measuring the rate of increase in distance travelled This is what we mean by speed Assuming that v(t), we can define a unit vector valued function T(t) by T(t) = v(t) (7) v(t) Then clearly v(t) = v(t)t(t) (8) The vector T(t) is called the unit tangent vector at time t It gives the instantaneous direction of motion at time t This factorization of the velocity vector into a unit vector giving the direction of motion, and a scalar multiple giving the speed of motion in that direction provides a very good way to think about the motion of point particles, as we shall explain Example (Speed and the unit tangent vector) Let x(t) be given by x(t) = the previous example Then, as we have seen, for all t >, x (t) = that v(t) = + t + t = + t, [ / t / t [ t / t / / t / as in We then easily compute and so T(t) = + t [ / t / t Problems Let x(t) = r (a) Compute v(t) and a(t) (b) Compute v(t) and T(t) [ cos(t) where r > This is a parameterization of the unit circle sin(t) (c) Find the tangent line to this curve at t = π/4 Also find the tangent line at t = π/ Where do these lines intersect? [ t + Let y(t) = t This is a parameterization of the parabola y = (x ) (The reason we are using y instead of x will become clear in problem 5) -9

(a) Compute v(t) = y (t) and a(t) = y (t) (b) Compute v(t) and T(t) (c) Find the tangent line to this curve at t = [ t Let x(t) = t /t (a) Compute v(t) and a(t) (b) Compute v(t) and T(t) (c) Find the tangent line to this curve at t = Also find the tangent line at t = pi/ Do these lines intersect? [ t 4 Let y(t) = / t t (a) Compute v(t) = y (t) and a(t) = y (t) (b) Compute v(t) and T(t) (The reason we are using y instead of x will become clear in problem 5) (c) Find the tangent line to this curve at t = 5 Let x(t) be the curve from problem, and let y(t) be the curve from problem (a) Compute f(t) = x(t) y(t), and then compute f (t) by direct differentiation (b) Compute x (t) y(t) + x(t) y (t) using your results from problems and, and then check that indeed (x(t) y(t)) = x (t) y(t) + x(t) y (t) 6 Let x(t) be the curve from problem, and let y(t) be the curve from problem 4 (a) Compute f(t) = x(t) y(t), and then compute f (t) by direct differentiation (b) Compute x (t) y(t) + x(t) y (t) using your results from problems and 4, and then check that indeed (x(t) y(t)) = x (t) y(t) + x(t) y (t) -