Functional differentiation

Similar documents
Robotics. Islam S. M. Khalil. November 15, German University in Cairo

Mechanical Systems II. Method of Lagrange Multipliers

2.3 Calculus of variations

Principles of Optimal Control Spring 2008

e (x y)2 /4kt φ(y) dy, for t > 0. (4)

THEODORE VORONOV DIFFERENTIAL GEOMETRY. Spring 2009

Feynman Path Integrals in Quantum Mechanics

Chapter 4. Adjoint-state methods

The Path Integral Formulation of Quantum Mechanics

MATH. 4548, Autumn 15, MWF 12:40 p.m. QUIZ 1 September 4, 2015 PRINT NAME A. Derdzinski Show all work. No calculators. The problem is worth 10 points.

Lecture 3 Dynamics 29

Classical Mechanics in Hamiltonian Form

MODULE 12. Topics: Ordinary differential equations

Ostrogradsky theorem (from 1850)

LECTURE NOTES IN CALCULUS OF VARIATIONS AND OPTIMAL CONTROL MSc in Systems and Control. Dr George Halikias

First Variation of a Functional

Week 4: Differentiation for Functions of Several Variables

Optimisation and optimal control 1: Tutorial solutions (calculus of variations) t 3

Math 342 Partial Differential Equations «Viktor Grigoryan

Numerical Analysis: Interpolation Part 1

Calculus of Variations

δ Substituting into the differential equation gives: x i+1 x i δ f(t i,x i ) (3)

There is a more global concept that is related to this circle of ideas that we discuss somewhat informally. Namely, a region R R n with a (smooth)

Chapter 2 The Euler Lagrange Equations and Noether s Theorem

How to Use Calculus Like a Physicist

Maxima and Minima. (a, b) of R if

Pontryagin s Minimum Principle 1

Sketchy Notes on Lagrangian and Hamiltonian Mechanics

Copyrighted Material. L(t, x(t), u(t))dt + K(t f, x f ) (1.2) where L and K are given functions (running cost and terminal cost, respectively),

CHAPTER 2 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME. Chapter2 p. 1/67

Numerical Optimization

Quantum Field Theory Notes. Ryan D. Reece

Path Integral Quantization of the Electromagnetic Field Coupled to A Spinor

Summary of Fourier Transform Properties

Second Order Optimality Conditions for Constrained Nonlinear Programming

A Short Essay on Variational Calculus

MATH 56A SPRING 2008 STOCHASTIC PROCESSES 197

Definition 5.1. A vector field v on a manifold M is map M T M such that for all x M, v(x) T x M.

Fundamentals Physics

Course Summary Math 211

Path Integrals and Quantum Mechanics

Vectors. January 13, 2013

Gaussian integrals and Feynman diagrams. February 28

Vector and Tensor Calculus

THEODORE VORONOV DIFFERENTIABLE MANIFOLDS. Fall Last updated: November 26, (Under construction.)

Math General Topology Fall 2012 Homework 11 Solutions

Special classical solutions: Solitons

Variational Methods & Optimal Control

G : Statistical Mechanics

1 Quantum fields in Minkowski spacetime

On the Euler Lagrange Equation in Calculus of Variations

221A Lecture Notes Path Integral

1 Infinite-Dimensional Vector Spaces

Legendre Transforms, Calculus of Varations, and Mechanics Principles

6 Lecture 6b: the Euler Maclaurin formula

Math 207 Honors Calculus III Final Exam Solutions

1.2 Derivation. d p f = d p f(x(p)) = x fd p x (= f x x p ). (1) Second, g x x p + g p = 0. d p f = f x g 1. The expression f x gx

Chapter 1. Distributions

Classical mechanics of particles and fields

Frequency Response (III) Lecture 26:

for changing independent variables. Most simply for a function f(x) the Legendre transformation f(x) B(s) takes the form B(s) = xs f(x) with s = df

Existence and Uniqueness

Maxwell s equations for electrostatics

CHAPTER II MATHEMATICAL BACKGROUND OF THE BOUNDARY ELEMENT METHOD

Optimal Control. Macroeconomics II SMU. Ömer Özak (SMU) Economic Growth Macroeconomics II 1 / 112

NOTES ON CALCULUS OF VARIATIONS. September 13, 2012

Tangent spaces, normals and extrema

Number-Flux Vector and Stress-Energy Tensor

B = 0. E = 1 c. E = 4πρ

16. Local theory of regular singular points and applications

On Cauchy s theorem and Green s theorem

Optimal Control. with. Aerospace Applications. James M. Longuski. Jose J. Guzman. John E. Prussing

Lecture 4. Magic Numbers

Gauge Fixing and Constrained Dynamics in Numerical Relativity

Path integrals and the classical approximation 1 D. E. Soper 2 University of Oregon 14 November 2011

Multiple integrals: Sufficient conditions for a local minimum, Jacobi and Weierstrass-type conditions

Übungen zur Elektrodynamik (T3)

Calculus of Variations

1 Mathematical preliminaries

2015 Math Camp Calculus Exam Solution

Math 600 Day 1: Review of advanced Calculus

M311 Functions of Several Variables. CHAPTER 1. Continuity CHAPTER 2. The Bolzano Weierstrass Theorem and Compact Sets CHAPTER 3.

3 Modeling and Simulation

Degree Master of Science in Mathematical Modelling and Scientific Computing Mathematical Methods I Thursday, 12th January 2012, 9:30 a.m.- 11:30 a.m.

MATH 819 FALL We considered solutions of this equation on the domain Ū, where

Division of the Humanities and Social Sciences. Supergradients. KC Border Fall 2001 v ::15.45

Vector Spaces. Vector space, ν, over the field of complex numbers, C, is a set of elements a, b,..., satisfying the following axioms.

Math The Laplacian. 1 Green s Identities, Fundamental Solution

Math 342 Partial Differential Equations «Viktor Grigoryan

NOTES ON DIFFERENTIAL FORMS. PART 3: TENSORS

Introduction to Path Integrals

Stochastic Differential Equations

2. The Schrödinger equation for one-particle problems. 5. Atoms and the periodic table of chemical elements

h(x) lim H(x) = lim Since h is nondecreasing then h(x) 0 for all x, and if h is discontinuous at a point x then H(x) > 0. Denote

FEM Convergence for PDEs with Point Sources in 2-D and 3-D

CH.11. VARIATIONAL PRINCIPLES. Multimedia Course on Continuum Mechanics

HOMEWORK 7 SOLUTIONS

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

Continuum Probability and Sets of Measure Zero

Logistic Map, Euler & Runge-Kutta Method and Lotka-Volterra Equations

Transcription:

Functional differentiation March 22, 2016 1 Functions vs. functionals What distinguishes a functional such as the action S [x (t] from a function f (x (t, is that f (x (t is a number for each value of t, whereas the value of S [x (t] cannot be computed without knowing the entire function x (t. Thus, functionals are nonlocal. If we think of functions and functionals as maps, a compound function is the composition of two maps giving a third map f : R R x : R R f x : R R A functional, by contrast, maps an entire function space into R, S : F R F {f (x x : R R} In this section we develop the functional derivative, that is, the generalization of differentiation to functionals. 2 Intuitions 2.1 Analogy with the derivative of a function We would like the functional derivative to formalize finding the extremum of an action integral, so it makes sense to review the variation of an action. The usual argument is that we replace x(t by x(t + h(t in the functional S[x(t], then demand that to first order in h(t, δs S[x + h] S[x] 0 Now, suppose S is given by S[x(t] L(x(t, ẋ(tdt Then replacing x by x + h, subtracting S and dropping all higher order terms gives δs L(x + h, ẋ + ḣdt L(x, ẋdt ( L(x, ẋ + ḣ dt L(x, ẋdt ( d dt h + 1 h(t dt

( 0 δs d dt h(t dt and since h (t is arbitrary, we conclude that the Euler-Lagrange equation must hold at each point, d 0 dt By analogy with ordinary differentiation, we would like to replace this statement by the demand that the extrema are given by the vanishing of the first functional derivative of S[x], δs[x(t] δx(t This means that the functional derivative of S should be the Euler-Lagrange expression. It is tempting to define δs[x(t] δx(t 0 lim h 0 S[x + h] S[x] h (t but even the ratio S[x+h] S[x] h(t is not well-defined if h (t has any zeros. What does work is to convert the problem to an ordinary differentiation. 2.2 The Dirac delta Suppose S is given by S[x(t] L(x(t, ẋ(tdt Then replacing x by x + h and subtracting S gives δs L(x + h, ẋ + ḣdt L(x, ẋdt ( L(x, ẋ + ( d dt h + h(t dt ḣ dt L(x, ẋdt Setting δx h(t we may write this as ( δs Now write or simply δs d dt δx(t dt δs ( ( δx(t δx(t d δx(t dt δx(t dt δx(t ( δs δx(t d δx(t dt δx(t dt (1 We might write this much by just using the chain rule. What we need is to evaluate the basic functional derivative, δx(t δx(t To see what this might be, consider the analogous derivative for a countable number of degrees of freedom. Beginning with q j q i δj i 2

we notice that when we sum over the i index holding j 0 fixed, we have i q j0 q i i δ j0 i 1 since i j 0 for only one value of i. We need the continuous version of this relationship. The sum over independent coordinates becomes an integral, i dt, so we demand δx(t δx(t dt 1 This will be true provided we use a Dirac delta function for the derivative: δx(t δx(t δ(t t Substituting this expression into eq.(1 gives the desired result for δs δx(t ( δs δx(t : d dt δ(t t dt d dt Notice how the Dirac delta function enters this calculation. When finding the extrema of S as before, we reach a point where we demand ( 0 δs d h(t dt dt for every function h(t. To complete the argument, suppose there is a time t 0 at which L(x,ẋ d L(x,ẋ dt is nonzero. For concreteness, let L(x,ẋ d L(x,ẋ dt > 0. Then by continuity there must be a neighborhood d L(x,ẋ dt t0 of t 0 (t 1, t 2 on which L(x,ẋ > 0. Choose any function h (t which is positive on (t 1, t 2 and which vanishes outside this interval. Then we have a contradiction, since the integral must be positive. An identical argument holds if the integrand is negative, so we must have d dt 0 t0 Since the point t 0 is arbitrary, we conclude that the expression vanishes everywhere. This argument works fine if we only wish to find the extremum of the action. However if we can develop a formalism such that we get a Dirac delta function, δx (t δx (t δ (t t then the functional derivative will exist for all curves. We now turn to a formal definition. 3 Formal definition of the functional derivative 3.1 Test functions and distributions We begin with some definitions. A test function is a smooth function which vanishes outside of a compact region. 3

A distribution is a limit of test functions. There is an alternative definition as a linear functional taking as set of test functions to the reals. I find the idea of a limit more intuitive, though the definition as functionals has calculational advantages. For example, the Dirac delta function is such a functional, for if f (x is any test function, δ (x x 0 : f (x f (x 0 R That is, for any test function f, the delta function maps f to the real number f (x 0. As a limit of smooth functions, we may define [ ] 1 δ (x x 0 lim (x x σ 0 2πσ 2 e 0 2 2σ 2 This is only one of many representations for δ (x x 0. In the end we only use the linear functional properties. It is straightforward to show that for any test function f, 1 f (x e (x x 0 2 2σ 2 dx f (x 0 2πσ 2 lim σ 0 This is written as f (x δ (x x 0 dx f (x 0 which gives us an explicit calculational form for the linear functional. 3.2 The functional derivative Given a functional of the form f[x(t] g (x (t, ẋ (t,... dt we consider a sequence of 1-paramater variations of f given by replacing x(t by x n (ε, t x(t + εh n (t, t where each h n is a test function, and the sequence of functions h n defines a distribution, lim h n (t, t δ (t t Since we may vary the path by any function h(x, each of these functions εh n is an allowed variation. Then the functional derivative is defined by δf[x(t] d lim δx(t dε f[x n(ε, t ] (2 ε0 The derivative with respect to ε accomplishes the usual variation of the action by using the chain rule. Taking a derivative and setting ε 0 is just a clever way to select the part of the variation linear in ε. Then we take the limit of a carefully chosen sequence of variations h n to extract the variational coefficient from the integral with a Dirac delta function. To see explicitly that this works, we compute: δf [x (t] δx (t lim lim lim d dε f [x n (ε, t ] ε0 dg (ε, x (t, ẋ (t,... dε ε0 d ( dε g x (t + εh n (t, t, ẋ (t + εḣn (t, t,... ε0 4

( lim h n (t, t + dh n (t, t (t dt +... dt ( lim h n (t, t d dt +... dt ( d dt +... δ (t t dt d dt +... A convenient shorthand notation for this procedure is δf[x(t] δ g (x(t, ẋ(t,... dt δx(t δx(t ( δx(t (t δx(t + d δx(t dt δx(t +... dt ( (t d δx(t dt +... δx(t dt ( (t d δx(t dt +... δx(t δ (t t dt (t d dt (t +... The method can be extended to more general forms of functional f. One advantage of the formal definition is that it can be iterated, δ 2 f [x (t] δx (t δx (t δ δf [x (t] δx (t δx (t ( δ δx (t (t d dt (t (t (t + ( (t d δx (t dt (t + δx (t + ( (t d dt (t + δ (t t + ( (t (t (t d dt ( (t d dt δẋ (t (t + δx (t d (t + dt δ (t t Another advantage of treating variations in this more formal way is that we can equally well apply the technique to classical field theory. 4 Field equations as functional derivatives We can vary field actions in the same way, and the results make sense directly. Consider varying the scalar field action S 1 ( α ϕ α ϕ m 2 ϕ 2 d 4 x 2 with respect to the field ϕ. Setting the functional derivative of S to zero, we have δs [ϕ] 0 δϕ (x µ 1 δ ( α 2 δϕ (x µ ϕ α ϕ m 2 ϕ 2 d 4 x 5

( α ϕ δϕ (x µ α δϕ (x ν m2 ϕ δϕ (x µ δϕ (x ν d 4 x ( α α ϕ m 2 ϕ δϕ (x µ δϕ (x ν d4 x ( α α ϕ m 2 ϕ δ 4 (x µ x µ d 4 x ϕ m 2 ϕ and we have the field equation. Exercise: Find the field equation for the complex scalar field by taking the functional derivative of its action. Exercise: Find the field equation for the Dirac field by taking the functional derivative of its action. Exercise: Find the Maxwell equations by taking the functional derivative of its action. With this new tool at our disposal, we turn to quantization. 6