Statistical Mechanics, Math 266: Week 1 Notes

Similar documents
Automorphic Equivalence Within Gapped Phases

Introduction to Quantum Spin Systems

08a. Operators on Hilbert spaces. 1. Boundedness, continuity, operator norms

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

Measurable functions are approximately nice, even if look terrible.

The following definition is fundamental.

Lecture Notes Introduction to Ergodic Theory

1 Math 241A-B Homework Problem List for F2015 and W2016

here, this space is in fact infinite-dimensional, so t σ ess. Exercise Let T B(H) be a self-adjoint operator on an infinitedimensional

MATHS 730 FC Lecture Notes March 5, Introduction

Propagation bounds for quantum dynamics and applications 1

THEOREMS, ETC., FOR MATH 515

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

at time t, in dimension d. The index i varies in a countable set I. We call configuration the family, denoted generically by Φ: U (x i (t) x j (t))

I teach myself... Hilbert spaces

HOPF S DECOMPOSITION AND RECURRENT SEMIGROUPS. Josef Teichmann

Analysis Preliminary Exam Workshop: Hilbert Spaces

The 1+1-dimensional Ising model

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Probability and Measure

1 Stochastic Dynamic Programming

2 Garrett: `A Good Spectral Theorem' 1. von Neumann algebras, density theorem The commutant of a subring S of a ring R is S 0 = fr 2 R : rs = sr; 8s 2

MATH 205C: STATIONARY PHASE LEMMA

1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer 11(2) (1989),

Representation theory and quantum mechanics tutorial Spin and the hydrogen atom

Introduction and Preliminaries

Lebesgue Measure on R n

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

SPECTRAL PROPERTIES OF THE LAPLACIAN ON BOUNDED DOMAINS

Economics 204 Fall 2011 Problem Set 2 Suggested Solutions

1 Continuity Classes C m (Ω)

SPECTRAL THEORY EVAN JENKINS

Lattice spin models: Crash course

Frustration-free Ground States of Quantum Spin Systems 1

PROBLEMS. (b) (Polarization Identity) Show that in any inner product space

Problem Set 6: Solutions Math 201A: Fall a n x n,

Lecture 19 L 2 -Stochastic integration

are Banach algebras. f(x)g(x) max Example 7.4. Similarly, A = L and A = l with the pointwise multiplication

Functional Analysis I

6.1 Main properties of Shannon entropy. Let X be a random variable taking values x in some alphabet with probabilities.

2) Let X be a compact space. Prove that the space C(X) of continuous real-valued functions is a complete metric space.

Gaussian Random Fields

10.1. The spectrum of an operator. Lemma If A < 1 then I A is invertible with bounded inverse

Analysis-3 lecture schemes

4 Linear operators and linear functionals

Supplementary Notes for W. Rudin: Principles of Mathematical Analysis

Theorem 5.3. Let E/F, E = F (u), be a simple field extension. Then u is algebraic if and only if E/F is finite. In this case, [E : F ] = deg f u.

16 1 Basic Facts from Functional Analysis and Banach Lattices

1 Mathematical preliminaries

Vector Spaces. Vector space, ν, over the field of complex numbers, C, is a set of elements a, b,..., satisfying the following axioms.

A note on the σ-algebra of cylinder sets and all that

Spectral Theory, with an Introduction to Operator Means. William L. Green

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Spectral Continuity Properties of Graph Laplacians

THEOREMS, ETC., FOR MATH 516

The Condensate Equation for non-homogeneous Bosons. André F. Verbeure 1. Institute for Theoretical Fysics, K.U.Leuven (Belgium)

A Concise Course on Stochastic Partial Differential Equations

CHAPTER 4. Cluster expansions

Math212a1413 The Lebesgue integral.

Integration on Measure Spaces

+ 2x sin x. f(b i ) f(a i ) < ɛ. i=1. i=1

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

3 Symmetry Protected Topological Phase

OPERATOR THEORY ON HILBERT SPACE. Class notes. John Petrovic

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

Elementary linear algebra

Introduction to the Mathematics of the XY -Spin Chain

LECTURE 1: SOURCES OF ERRORS MATHEMATICAL TOOLS A PRIORI ERROR ESTIMATES. Sergey Korotov,

Math Solutions to homework 5

Mathematical Analysis of a Generalized Chiral Quark Soliton Model

If Y and Y 0 satisfy (1-2), then Y = Y 0 a.s.

Lebesgue Integration on R n

Laplace s Equation. Chapter Mean Value Formulas

SPRING 2006 PRELIMINARY EXAMINATION SOLUTIONS

Continuous Functions on Metric Spaces

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

CHAPTER 8. Smoothing operators

A PRIMER ON SESQUILINEAR FORMS

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

conventions and notation

BRST and Dirac Cohomology

A Brief Introduction to Functional Analysis

SPACETIME FROM ENTANGLEMENT - journal club notes -

S chauder Theory. x 2. = log( x 1 + x 2 ) + 1 ( x 1 + x 2 ) 2. ( 5) x 1 + x 2 x 1 + x 2. 2 = 2 x 1. x 1 x 2. 1 x 1.

REAL AND COMPLEX ANALYSIS

An introduction to some aspects of functional analysis

BOUNDARY VALUE PROBLEMS ON A HALF SIERPINSKI GASKET

REVIEW OF ESSENTIAL MATH 346 TOPICS

AN EFFECTIVE METRIC ON C(H, K) WITH NORMAL STRUCTURE. Mona Nabiei (Received 23 June, 2015)

9 Radon-Nikodym theorem and conditioning

Metric Spaces and Topology

Frustration-free Ground States of Quantum Spin Systems 1

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy

Analysis II: The Implicit and Inverse Function Theorems

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1

Topological vectorspaces

Math The Laplacian. 1 Green s Identities, Fundamental Solution

Exercises to Applied Functional Analysis

MATH & MATH FUNCTIONS OF A REAL VARIABLE EXERCISES FALL 2015 & SPRING Scientia Imperii Decus et Tutamen 1

DEFINABLE OPERATORS ON HILBERT SPACES

Transcription:

Statistical Mechanics, Math 266: Week Notes January 5 and 7, 200 The Microcanonical Ensemble The Microcanonical Ensemble refers to a choice of a probability distribution on state space in which all configurations of a fixed energy E are given equal weight. To be concrete, we illustrate with a simple example. Consider a system of N classical point particles of mass m in a finite box Λ = [0, L] 3 R 3. The phase space of this system is given by Γ = [0, L] 3N R 3N where (q, p) = (q,..., q N, p,..., p N ) Γ and q i [0, L] 3 and p i R 3 for all i N. The energy of the system is given by the Hamiltonian, H : Γ R given by H(q, p) = N i= 2m p i 2 () The Hamiltonian enables us to define the submanifold of constant energy, Γ E : Γ E = {(q, p) Γ H(q, p) = E} (2) Hamiltonian dynamics is given by Hamilton s equations of motion: q i = H(q, p) (3) p i p i = H(q, p) (4) q i The following theorem of Liouville, stated without proof, will be necessary to define a measure on Γ E. Theorem.. Liouville The measure dqdp on Γ is preserved under the Hamiltonian flow solving equations 3 and 4.

What this means precisely is the following. Consider the flow, ϕ t : Γ Γ such that ϕ t (q 0, p 0 ) = (q(t), p(t)) is a solution of equations 3 and 4 when q(0) = q 0 and p(0) = p 0. Then letting B be a Lebesgue measurable set, define Liouville s theorem then says that i.e., B t = {ϕ t (q, p) (q, p) B} vol(b t ) = vol(b) for all t (5) dqdp = B t In our concrete example, this is easy to see, since B dqdp (6) p i (t) = p i (0) (7) q i (t) = q i (0) + m p it (8) is just translation which clearly preserves volume. Another important property is that H, energy, is conserved. H(q(t), p(t)) = H(q(0), p(0)) for all t (9) This is a direct consequence of the form of the Hamilton equations of motion: d N dt H(q(t), p(t)) = H dq i q i dt + H dp i p i dt = i= N i= (0) H q i H p i H p i H q i = 0 () This means that Γ E, the manifold of constant energy E is invariant under the flow, i.e., For all (q, p) Γ E, ϕ t (q, p) Γ E (2) It follows also that the trace measure of dqdp on Γ E will also be preserved under the flow ϕ t. It may be intuitively clear what the trace measure means, but we provide a mathematical definition. Let B Γ E be a nice set and suppose Γ E is a differentiable submanifold of Γ so that we define the normal to each point of Γ E. Consider the normals at each point of B and define { B( E) = (q, p) Γ (q, p) lies on a normal to a point in B Γ } E (3) and H(q, p) [E, E + E] and ν E (B) = vol(b( E)) lim E 0 E (4) 2

It can be shown that this defines a measure of Γ E which will clearly be invariant under the flow ϕ t. In our case, Γ E is compact, and ν E (Γ E ) = which we can easily compute: Γ E = [0, L] 3N S 3N ( 2mE)ν E (Γ E ) = d de vol({(q, p) H(q, p) E}) Ω E (5) d } {L 3N (2mE) 3N 2 ω3n de where ω d is the volume of the unit ball in R d which is given by So ω d = Ω E = ν E (Γ E ) = 2mL 3N π 3N 2 (6) πd/2 Γ( d 2 + ) (7) 3N 3N (2mE) 2 2 Γ( 3N 2 + ) (8) (ν E is an invariant positive measure, but it is not normalized.) The microcanonical probability measure is the normalized version of ν E, µ E (A) = ν E(A) ν E (Γ E ) (9) which represents the equal probability distribution of all configurations (q, p) of fixed energy E. 2 Derivation of the Maxwell Distribution Now assuming the probability distribution µ E on Γ E, what is the resulting distribution of one particle? We want to find the distribution function, ( ) P 2m p2 e = µ E (A(e)) (20) where A(e) = {(q, p) Γ E p 2 2me} (2) Then the probability density function will be f(e) = d ( ) de P 2m p2 e = d d de de ν E(A(e)( E)) (22) = L3 4π2meL 3N 3 ν 3N 3 E (Γ 3N 3 νe 3N (Γ3N E e ) = 4π2me (2m(E e)) 3 2 (N ) (2mE) 3N 2 E ) (23) 3(N ) 2 3(N) 2 ( 4π2meC e (2mE) 3/2 E ) 3N 2 5 2 Γ( N 2 + ) (24) 2 + ) ) 3/2 (25) Γ( 3(N ) ) ( 3N 2 3

where in the last (almost) equality, we have made use of Stirling s formula: Γ(n) = 2πn n /2 exp( n) ( O( n ) ) (26) We shall be interested in taking the thermodynamic limit in which we take N N L 3 = ρ L E L 3 = η E (27) Therefore, in the thermodynamic limit, f(e) = C e ( ) 3/2 3ρ exp( 3ρ e) (28) 2m 2η 2η Recalling now that e = 2 mv2 where v is the speed of the first particle, we can change variables, f(v) = Cv 2 exp ( β 2 ) mv2 (29) where β = 3ρ 2η is to be interpreted as inverse temperature. 3 Explanations and Issues Why does the Maxwell distribution just derived so accurately describe the reality of molecules in a gas? When we measure this distribution for a gas in equilibrium, we sample the system over a long time and by Poincare s recurrence theorem, explores the available portion of phase space over and over again such that the net effect is that the time averages become averages over phase space. To make this more precise, we will use Birkhoff s Ergodic theorem without proof. Theorem 3.. Birkhoff For Γ 0 Γ such that vol(γ 0 ) < and for all f L (Γ 0 ), exists for almost all (q, p) Γ 0 T lim f ϕ t (q, p)dt = f (q,p) T T 0 Here f (q,p) is the time average of f on a trajectory of dynamics with initial condition (q, p). The theorem says that this time average is defined almost everywhere on Γ 0. We can prove furthermore, f (q,p) on a trajectory (q(t), p(t)) is actually independent of t 4

If we assume further that Γ 0 is metrically indecomposable, then f (q,p) = f(q, p)dqdp (30) vol(γ 0 ) Γ 0 independent of (q, p) Γ 0 where the right hand side is the phase space average with respect to the invariant measure. Here, metric decomposability means that there is no non-trivial decomposition Γ 0 = Γ Γ 2 such that Γ Γ 2 = where Γ and Γ 2 are of non-zero measure and are invariant under the flow. So now one can argue that the microcanonical measure gives the correct result because phase space averages computed with it are equal to time averages. This makes sense except that the condition of metric indecomposability would have to be checked, which is usually impossibly difficult or else demonstrably false. In the case at hand, it is the latter, since Γ E is actually metrically decomposable, since the energies of individual particles are conserved. Of course, in an experiment, we would have to pick many different particles to build a histogram of their energy distribution. So implicitly we are also averaging over a larger number of different particles. Whatever the precise explanation may be, the experimental fact is that the assumption of uniform distribution over a configuration of given energy gives the correct prediction under very general circumstances. 5

Statistical Mechanics, Math 266: Week 2 Notes January 2 and 4, 200 The Canonical Gibbs Measure By analysis similar to the derivation of the Maxwell distribution, one can show E for Λ 0 Λ, N 0 N, Λ = η, N Λ = ρ, and N, E, Λ, the phase space distribution for Λ 0, and N 0 are obtained by conditioning µ E,Λ,N on the N 0 particles found in the region Λ 0. In the thermodynamic limit, µ Can Λ 0,N 0,β e βh Λ 0,N 0 () where β = 3ρ 2η as before. (See Minlos for details.) This is a special case of a canonical ensemble measure. Note that the expected value of the energy of the subsystem in Λ 0 is then given by H Λ0,N 0 (q, p)µ Can Λ 0,N 0,β(dqdp) = HΛ0,N 0,β(q, p)e βh Λ 0,N 0,β(qp) dqdp Z Can (Λ 0, N 0, β) Here, Z Can (Λ 0, N 0, β) is the partition function Z Can (Λ 0, N 0, β) = e βh Λ 0,N (q,p) 0 dqdp (3) Therefore (2) H Can = d dβ log ZCan (Λ 0, N 0, β) (4) So we can compute Z Can and derive the energy from this expression. For later use, we also define the free energy F (Λ, N, β) = β log Z Λ,N,β (5) To close out this section, we compute the partition function for a simple model. Let us suppose that we have N classical particles in a box Λ R 3 as before, with Hamiltonian given by H(q, p) = N p 2 i i= j i 2m + U( q i q j ). If U = 0,

then Z Λ,N,β = Λ N ( We can also calculate the expected energy ) 3N p2 β e 2m dp (6) ( ( ) ) 3/2 N 2πm = Λ (7) β H = d 3N log Z = dβ 2β (8) So that we obtain equipartition of energy of 2 k BT per degree of freedom. In the case that U 0, then Λ N is replaced by dq... dq N e βu( qi qj ) 2 The Ising Model Λ Λ i<j N We shall be interested in the Ising Model because it is a simple model. historically significant in the development of Statistical Mechanics. a model that exhibits phenomena and allows us to ask questions that are central in Statistical Mechanics. a useful model for a number of physical phenomena, including some aspects of liquid-gas phase transitions, and adsorbtion of molecules on a surface such as in catalytic converter. The point is that we will be able to understand the effects of interaction between particles in this model in considerable detail. To define the model, we replace the continuous space R d with the lattice Z d, and attach variables to each lattice site For all x Z d, σ x {, +} (9) which can take only two values. One interpretation is that σ x represents the rotation of a magnetic dipole attached to position x. Another useful interpretation is that of a lattice gas where we make the transformation n x = 2 ( + σ x) (0) n x {0, } () 2

so that each n x is considered an occupation variable where a value of 0 denotes that the site x is empty, and a value of denotes that site x is occupied by a gas molecule. One could think of a layer of gas molecules attaching to the surface of a catalytic material. In both cases, the model is only a caricature of the real stuff, but we will see that it is a useful caricature. The phase space Γ is replaced by the configuration space Ω Λ = {, +} Λ (2) = { all functions σ : Λ {, }} (3) where σ = (σ x ) x Λ is called a configuration. The minimal description of the physics that we need to do Statistical Mechanics is the energy, i.e., the Hamiltonian defined on Ω Λ which assigns to each configuration its energy. (There are no Hamilton equations of motion here.) H Ising Λ = J σ x σ y + h σ x (4) x Λ x,y Λ x y = One could generalize to J(x, y) and h(x) or even random variables J xy but for now we restrict to constant J and h. J is a coupling constant and h gives the strength of an external magnetic field. We could again show from the microcanonical measure which assigns equal weight to all configurations of a given energy. With some effort we could then prove that restricted to the variables in a subvolume, and after taking Λ, we get a canonical distribution. But we don t have to work with the subset of configuration of a given energy. In fact, µ Λ,β (σ Λ ) = e βhλ(σλ ) Z(Λ, β) (5) defines a probability measure in which lower energy configurations have higher probability. But counterbalancing this is the fact that there are many more configurations of energy Λ than there are of energy = min σ H(σ). This will eventually lead us to introduce the concept of entropy and the competition between energy and entropy is at the core of the most interesting phenomena in Statistical Mechanics. The Ising model is named after Ising, who studied it for his PhD thesis. It was his adviser, Lenz, who introduced it. It took a while to understand it, and Ising solved the one dimensional case in his dissertation. We will turn to that solution in a minute. It turned out to be less interesting than they had hoped, but Ising didn t realize that the dimension plays a crucial role. What do we mean by solution? Here, we mean calculating Z, log Z, or lim Λ Z d Λ log Z. Let s first explain why this calculation indeed allows one to answer the most important questions. The free energy density in the thermodynamic limit is given by f(β, h) = lim log Z(Λ, β, h) (6) Λ Z d β Λ 3

For a finite system Λ Z d, the free energy is We already noted that F Λ (β, h) = log Z(Λ, β, h) (7) β H Λ = β βf Λ(β, h) (8) Another important quantity is the magnetization (related to the particle number in the lattice gas interpretation) σ M Λ = x Λ σ x)e βhλ(σ) Z Λ (β, h) (9) = β h log Z Λ(β, h) (20) = h F Λ(β, h) (2) For any observable A Λ, one could add a term λa Λ to H Λ and define the corresponding F Λ (β, λ) so that A Λ = λ F Λ(β, λ) (22) Moreover, it doesn t stop with the mean. We can also calculate variance and, in principle, any moment we like. For example, One can show that the limits V (A Λ ) = (A Λ A Λ ) 2 (23) = A 2 Λ A Λ 2 (24) and A 2 Λ = 2 λ 2 F Λ(β, λ) (25) lim Λ Z d Λ F Λ(β, h) (26) H Λ lim = e(β, h) = βf(β, h) Λ Z d Λ β (27) M Λ lim = m(β, h)[= f(β, h)] Λ Z d Λ h (28) all exist, and where the energy density, e, was previously denoted η. Here interchanging the derivative and the thermodynamic limit is an interchange of limits and requires further justification. This interchange can be shown to be true under quite general assumptions, but this does not mean that all such limits are interchangeable. For example, m(β, h) is, in general, not a continuous function of h and lim = µ 0 (29) h 0 ± 4

while m(β, 0) = 0. This is the signature of a phase transition. In fact, this is what Ising hoped to show. To this end, he calculated f and m for the onedimensional Ising model. 3 More on the Ising Model Recall that Hamiltonian of the the Ising model is H Λ = J σ x σ y (30) x,y Λ x y = where we restrict to the situation where J > 0. This corresponds to the ferromagnetic Ising model. We have previously considered the thermodynamic limit as a limit in which a clear picture of the asymptotic behavior emerges. But there is more. In order to give a mathematically rigorous description of non-uniformity of the thermodynamic limit, the hallmark of phase transitions, we introduce the mathematical formulation of -systems. We first focus on the Ising model and generalize later. The set of configurations, Ω, is given by Ω = x Z{, +} (3) and we recall that Ω is compact with the Tychonoff topology. There is no difficulty with considering infinite configurations σ Ω; in d =, σ is a biinfinite sequence. In general, it just means that we are given a value σ x {, +} for all x Z d. It does not make sense in general to talk about probabilities of infinite configurations not even ratios of probabilities, since typically µ β (σ) = 0. The solutions is to limit the kind of events under consideration to the cylinder sets: A X (σ X ) = {η Ω η x = σ x for all x X} (32) and X Z d is a finite set. Then we can make sense of µ(a X ) and the ratio µ(a X (σ X )) µ(a X (σ X )) = e β H X (σ,σ) (33) We can make this a bit more elegant by defining a fuction { if σ AX (σ f A (σ) = X ) (34) 0 if σ / A X (σ X and considering linear combinations and uniform limits of such functions. This leads to the algebra of functions C(Ω), since such functions are continuous on Ω with the Tychonoff topology. Instead of defining a probability measure directly, we define the expectation of all such functions µ(f) = σ Ω f(σ)µ(σ) (35) 5

By the Riesz Representation Theorem for measures, this is actually equivalent to defining a Borel probability measure on Ω. References: Minlos Ruelle, Statistical Mechanics: Rigorous Results. London: Imperial College Press 999 H-O Georgii, Gibbs Measures and Phase Transitions. De Gruyter Studies in Mathematics Vol. 9. Berlin: de Gruyter 988. 4 Explicit Calculation of the Thermodynamic Limit of the -Dimensional Ising Model In the following section, we shall use the transfer matrix method which also appears in the analysis of transition matrices of Markov chains. For the Ising model in one dimension, we have where we have Λ = [ L, L] Z (36) H(σ) = J Z Λ (β) = = L x= L σ L,...,σ L =± σ L,...,σ L =± T = σ x σ x+ (37) e βj P L x= L σxσx+ (38) T σ L σ L+ T σ L+ σ L+2 T σl σ L (39) ( ) T++ T + = T + T ( ) e βj e βj e βj It follows then that in terms of the transfer matrix, T, ( ( ), T ) 2L e βj (40) (4) which can be simplified further, since T is clearly a diagonalizable matrix. The eigenvalues can clearly be seen to be: Thus, ( λ+ 0 T = S 0 λ Z Λ (β) = ( ), S λ + = 2 cosh(βj) (42) λ = 2 sinh(βj) (43) ) S (44) ( λ 2L + 0 0 λ 2L ) ( = 2(2 cosh(βj)) ) 2L (45) 6

Then by taking the thermodynamic limit Λ Z, we obtain the free energy density, f(β) = log(2 cosh(βj)) (46) β in a similar way, we can compute the expectation σ L,...,σ 0=± e βh(σ) σ µ Λ (A [,n] )(σ,..., σ n ) = n+,...,σ L =± Z Λ (β) (47) where we hold σ,..., σ n fixed. Expanding this now using the transfer matrix method, we obtain for the numerator (T L ) σ L σ T σσ 2 T σ2σ 3... T σn σ n (T L n ) σnσ L σ L σ L =± = e βh [,n](σ,...,σ n) ( ( ), T ) L e σ e σn, T L n where we use the notation ( e + = 0) ( ) 0 e = And so the final result is µ [ L,L] (A [,n] (σ,..., σ n )) = e βh(σ,...,σn) λ L +σ+ L n 2λ+ 2L = e βh(σ,...,σn) 2λ n + (48) (49) 7

Statistical Mechanics, Math 266: Week 3 Notes January 9 and 2, 200 Phase Transitions and Spontaneous Symmetry Breaking Consider the d-dimensional Ising model: Λ Z d e.g., Λ = [, L] d () H Λ = J σ x σ y (2) x y = x,y Λ As before, we will assume that the model is ferromagnetic, so J > 0. The Hamiltonian H Λ exhibits a spin flip symmetry which takes σ x σ x Many fundamental models have symmetries and many interesting phase transitions are accompanied by symmetry breaking. More precisely, let F : Ω Ω be defined by F (η) = η (3) Clearly H Λ F = H Λ. It follows that the equilibrium state ω Λ (f) = η f(η)e βhλ(η) η e βhλ(η) (4) is also F -symmetric, meaning ω Λ (f F ) = ω Λ (f) for all f C(Ω Λ ) (5) In particular, we have for all x Λ, ω Λ (σ x ) = ω Λ (σ x F ) = ω Λ (σ x ) (6) and hence ω Λ (σ x ) = 0 for all x Λ (7) Taking the thermodynamic limit does not change this, lim ω Λ (σ x ) = 0 (8) Λ Z d

All of the thermodynamics is contained in the function f(β), the free energy density βf(β) = lim Λ Z d Λ log Z Λ(β) (9) Z Λ (β) = (0) η Ω Λ e βhλ(η) It is easy to see that boundary conditions do not affect f, so in the thermodynamic limit, we obtain the same thermodynamics. Now define the boundary of Λ, Λ = {x Λ there exists y Z d Λ c and x y = } () Then consider b Λ C(Ω Λ ) and suppose that b Λ sup B Λ, so that b Λ is uniformly bounded with B some fixed constant. For some sequence of boundary terms, we may find that ω b = lim Λ ω b Λ exists (2) ω bλ Λ (f) = Z bλ Λ (β) η Ω Λ f(η)e βh b Λ Λ (η) (3) where H bλ Λ = H Λ + b Λ (4) Then, e Bβ Λ Z Λ Z bλ Λ Z Λe Bβ Λ (5) So that as long as we have Λ Λ 0, we will have that f b (β) = f(β). It is therefore reasonable to assume that ω b = lim Λ ω bλ Λ. If this limit exists, it will also describe the equilibrium of the thermodynamic system. (We will make this precise and rigorous when we study characteristics of equilibrium later.) Under quite general conditions, one can show that for some β c > 0, ω b is independent of b for all 0 β β c. But it often happens that there is some dependence on the boundary condition, b, if β is large enough. Before doing anything more general, we will show that this happens for the d-dimensional Ising model. 2 The Peierls Argument We will consider the particular boundary term leading to what is called + boundary conditions, b Λ = J σ x (6) x Λ y Λ c, x y = as if the spin at y Λ c are all fixed to be +. boundary conditions are completely analogous. Let s assume that the following limit exists: ω + = lim Λ ω + Λ (7) = lim Λ ω bλ Λ (8) 2

Note that if we were free to interchange limits, it would be rather trivial to show that lim β m(β) = since lim β ω + Λ (σ x) = for all finite Λ Z d, for all d. Theorem 2.. Let J > 0 and d = 2. Then,. There exists β > 0 such that for all β > β, Λ ω + Λ (σ x) = m Λ (β) > 0 (9) x Λ and lim β m Λ (β) = uniformly in Λ. 2. There exists β 2 > 0 such that for all β > β 2 and for all x Λ, and lim β m x (β) = uniformly in x. In fact, we obtain bounds of the form For sufficiently large β. ω + Λ (σ x) = m x (β) > 0 (20) 0 m x (β) 26e 8Jβ (2) 0 m Λ (β) 26e 8Jβ (22) Remark 2.. Onsager obtained an exact solution of the free energy density if the 2-dimensional Ising model frem which it follows that β c = log(+ 2) 2J. To prove the theorem we will use the contour description of Ω + Λ = {η Ω Λ (Λ c ) η Λ c= +} (23) + + + + + + + + + + + + - - - - + + + + + + + + + + + + Figure : Configuration space as described by contours. There is a one-to-one correspondence between configurations and configurations of contours. σ Ω + Λ {γ,..., γ n } = Γ(σ) (24) 3

+ + + - γ Figure 2: A dual edge is in γ if the corresponding edge has a + pair. Here, Γ(σ) is a configuration of closed, non-intersecting paths (where we ignore corner intersection) in the dual lattice. We define the support of a configuration, supp Γ = n supp γ i = {(x, y) Z 2 σ x σ y = } (25) i= and this allows us to define the length of a configuration of contours as l(γ) = supp Γ = n l(γ i ) (26) Now, we wish to rewrite the Hamiltonian of the Ising model in terms of contours. H + Λ (σ) = J{#{(x, y) Z2 σ x σ y = +}} + J{#{(x, y) Z 2 σ x σ y = }} (27) = J B(Λ) + 2Jl(Γ) (28) where B(Λ) is equal to the number of edges in Λ. Here, we can think of energy as being proportional to the length of the contours. Since the configurations Γ(σ) that we consider consist of compatible, closed, non-intersecting paths, we define V (γ) to be the vertices enclosed by a given contour γ. Lemma 2.. For all γ such that l(γ) <, i= V (γ) 6 l(γ)2 (29) Proof. Left to reader. See homework. Lemma 2.2. Let Λ Z 2 and l = 4, 6, 8,... Define M Λ (l) = # of distinct simple contours of length l within Λ (with + boundary conditions). Then M Λ (l) 3 l Λ (30) Proof. Left to reader. See homework. 4

We know what P + Λ (σ) is, and this allows us to make sense of P(Γ). Now, for a given contour γ, we define P + Λ (γ) to be the probability that γ occurs. Specifically, it is the event that contains all configurations Γ that have γ in it. Explicitly, P + Λ (γ) = Lemma 2.3 (Peierls Estimate). σ Ω + Λ γ Γ(σ) e βh+ Λ (σ) (3) σ Ω + e βh+ Λ (σ) Λ P + Λ (γ) e 2Jβl(γ) (32) Proof. For all σ such that γ Γ(σ), for some fixed γ define σ as the unique configuration such that Γ(σ ) = Γ(σ)\{γ}. Explicitly, σ is obtained by flipping all spins located at x V (γ). Recall that H + Λ (σ) = JB(Λ) + 2Jl(γ), and therefore H + Λ (σ) H+ Λ (σ ) = 2Jl(Γ) (33) and P + Λ (γ) = σ Ω + Λ γ Γ(σ) e βh+ Λ (σ) e βh+ Λ (σ) (34) σ Ω + Λ σ Ω + e βh+ Λ (σ) Λ γ Γ(σ) σ Ω + e βh+ Λ (σ ) Λ γ Γ(σ) (35) e 2βJl(γ) (36) Proof of Theorem 2.. We will estimate 0 ω + Λ (σ x) = ω + Λ ( σ x) (37) Observe that σ x takes the values 0 and 2. If σ x = 2, then there exists a γ Γ(η) such that x V (γ). Denote by γ (σ) the first contour you meet 5

starting at x. Then σ,γ (σ)=γ e βh+ Λ (σ) ω + Λ (σ x) 2 γ,x V (Γ) (38) σ e βh+ Λ (σ) 2 = 2 2 γ,x V (γ) γ,x V (γ) γ,x V (γ) σ,γ Γ(σ) e βh+ Λ (σ) σ e βh+ Λ (σ) (39) P + Λ (γ) (40) e 2βJl(γ) (4) where (39) comes from the fact that ω + Λ vanishes if it is enclosed by an even number of contours, and (4) follows from Lemma 2.3. From here, Parts and 2 of Theorem 2. proceed only slightly differently, and so we only present the proof of Part 2. We rewrite the inequality (4) in a more suggestive way. ω + Λ (σ x) 2 e 2βJl (42) 2 l=4,6,8,... l=4,6,8,... γ,x V (γ) l(γ)=l l 2 3 l e 2βJl (43) 6(3e 2βJ ) 2 where 3e 2βJ 2 (44) Here, we have used the observation that a contour of length l must be contained within a square box of size l centered at x, along with lemma 2.2 and the fact that k 2 r k = 2r2 (2 3 2 r + 2 r2 ) ( r) 3 (45) k=2 The lower bound on β now follows. Remark 2.2. Clearly ω + Λ (σ x) as β, and with the same estimates for the boundary condition, we have that ω Λ (σ x) as β. Hence, if ω ± Λ ω± as Λ Z 2 (46) clearly, ω + ω. A similar argument works for d 2, and these same arguments (using Peierls Estimate) can be generalized to other models with somewhat similar structure. 3 The Griffiths Inequalities The goal for us is to show the existence of the limiting Gibbs states, but the Griffiths inequalities have many other applications. Again, we are considering 6

Ising systems on Z d. The algebra of observables for a finite volume Λ Z d is C(Ω Λ ). Consider the special observables σ A = x A σ x for all A Λ (47) σ = (48) We make the observation that the set {σ A A Λ} is a basis for C(Ω Λ ) because δ ηx=ε = 2 (+εσ x) forms a basis upon taking products. The ferromagnetic Ising model can be generalized to a general class of ferromagnetic models with local Hamiltonians of the form H Λ = A J A σ A, J A 0 (49) Note that ω Λ (σ A ) = ω Λ (σ A σ B ) ω Λ (σ A )ω Λ (σ B ) (50) J B where we have set β = in this equation. Theorem 3. (Griffiths Inequalities). Let ω Λ be the Gibbs state at β with ferromagnetic Hamiltonian H Λ. Then. ω Λ (σ A ) 0 for all A Λ. 2. ω Λ (σ A σ B ) ω Λ (σ A )ω Λ (σ b ) 0 for all A, B Λ. Proof. Proof of. ω Λ (σ A ) = Z Λ = Z Λ σ A (η)e βhλ(η) (5) η Ω Λ β n σ A (η)( J B σ B ) n (52) n! n=0 η Ω Λ B Λ Clearly σ A σ B = σ C with C = A B, and where A B = (A B) \ (A B) is the symmetric difference of A and B. Now, we group all terms with the same C. ω Λ (σ A ) = a(c) σ C (η) (53) C η Ω Λ If C, η Ω Λ σ C (η) = 0 since if x C then the sum over η with η x = ± cancel each other out. In the case that C =, η Ω Λ σ C = 2 Λ. So ω Λ (σ A ) = a( )2 Λ 0. Proof of 2. Note that for, we did not really use the structure of Z d, and the argument works on any finite set Λ. Now we will consider Λ = Λ Λ, two disjoint copies of Λ. Equivalently, we can consider a system with two copies of the algebra Ã Λ = C(Ω Λ ) C(Ω Λ ) (54) 7

where each copy of C(Ω Λ ) is generated by the functions σ x and τ x respectively. The configuration space is Ω Λ = Ω Λ Ω Λ = {(η, ξ) η x, ξ x {, +}}. Similarly, we have σ A and τ A. Define If f(η, ξ) = f (η)f 2 (ξ), we have Now consider rotated variables H Λ (η, ξ) = H Λ (η) + H Λ (ξ) (55) Z Λ = e βhλ(η) e βhλ(ξ) (56) ξ Ω Λ η Ω Λ = (Z Λ ) 2 (57) ω Λ (f) = ω Λ (f )ω Λ (f 2 ) (58) s x = 2 (σ x + τ x ) (59) t x = 2 (σ x τ x ) (60) which take values 2, 0, 2 on double configurations. Note that and for A Λ, ± A = σ A ± τ A = where σ A = x A σ x, and τ A is completely analogous. Lemma 3.. with some K B 0 Proof. Just calculate σ x = 2 (s x + t x ) (6) τ x = 2 (s x t x ) (62) ( 2 ) A {(s + t) A ± (s t) A } (63) ± A = B A K B s A\B t B (64) (s + t) A = x A(s x + t x ) (65) = B A s A\B t B (66) and (s t) A = B A( ) B s A\B t B (67) The lemma follows. 8

Now, we can proceed with the proof of 2. ω Λ (σ A σ B ) ω Λ (σ A )ω Λ (σ B ) (68) = ω Λ (σ A σ B ) ω Λ (σ A τ B ) (69) = ω Λ (σ A (σ B τ B )) (70) ( ) A + B = 2 ω Λ (s + t) A {(s + t) B + (s t) B } (7) = ω Λ ( C B K C (s + t) A s B\C t C ) (72) = ω Λ ( D K D s A B\D t D ) (73) The variables s x and t x have the following properties: s x t x = 0 and either s x or t x = 0 for a given configuration σ x, τ x (74) t x and s x are odd functions of σ x and τ x, so all their odd powers are odd and all their even porwers are of course greater than or equal to 0. So { s n = 0 if n odd. x = (75) > 0 if n even. σ x,τ x and the same result holds for t n x. The Hamiltonian can be rewritten, H Λ = A Λ K A σ A + K A τ A (76) = A Λ K A ( + ( ) C )s A\C t C (77) C A and is again a Hamiltonian with coefficients greater than or equal to 0 in the monomial basis. Although the polynomials in s and t are not independent variables, the same argument as in applies. H Λ = K C,D s C t D (78) Indeed, C,D Λ ω Λ = ZΛ n=0 = ZΛ n=0 β n n! β n n! {σ x,τ x} {σ x,τ x} C,D s A t B ( C,D A K C,D s C t D ) n (79) K ABCD s A s C t B t D (80) H Λ = A Λ J A σ A + J A τ A (8) = 2 A Λ C A J A ( + ( ) C )s A\C t C (82) 9

Therefore we can apply to H Λ and finish the proof of 2. Recall the observation J B ω Λ (σ A ) = ω Λ (σ A σ B ) ω Λ (σ A )ω Λ (σ B ) (83) J B ω Λ (σ A ) 0 for ferro- therefore the second Griffiths inequality implies that magnetic Ising models. 4 The Thermodynamic Limit of Ising Equilibrium States The set of states on C(Ω Z d) is weak-* compact. From this we deduce that at each fixed βj, the set of finite volume states {ω Λ } Λ Z d has at least one limit point (extend them to states on all of Z d in more or less any way you like). And more generally the same is true for sequences with other boundary conditions b Λ. The Peierls argument shows that if ω + and ω are such limit points of {ω + Λ } and {ω Λ } respectively, then for all β large enough, they will be distinct, since ω + (σ 0 ) = ω (σ 0 ) 0 (84) Later, in a more general context, we will show that for small β the limit points are unique independent of b Λ. It is nevertheless still an interesting question whether the sequence ω + Λ itself converges. Theorem 4... Let {ωλ 0 } be the sequence of β Gibbs states in finite volume Λ Z d of the Ising model with free boundary conditions. Then ω 0 Λ(σ A ) ω 0 (σ A ) for all finite subsets A Z d (85) 2. If {ω + Λ } is a sequence corresponding to + boundary conditions, then ω + Λ (σ A) ω + (σ A ) (86) i.e., we have weak* convergence in both cases and they are monotone increasing and decreasing, respectively, on the basis functions σ A. Proof. Proof of. ωλ 0 can be regarded as the Gibbs state for the ferromagnetic Ising model H Λ = X J Λ Xσ X (87) with { JX Λ J if X = (x, y), x, y Λ, x y = = 0 otherwise (88) 0

with J > 0. Hence, JX Λ is monotonic increasing by the second Griffiths inequality. Proof of 2. Define JX Λ = J x if X = x, y, x, y Λ, x y = + if X = {x}, x Λ c 0 otherwise (89) It is not hard to see that the infinite coupling constant does not pose a problem. Note that the Griffiths inequalities also show that a variety of other Ising models with higher dimensionality and anisotropies also have a non-vanishing magnetism at sufficiently low temperature by comparing with the two-dimensional translation invariant model.

Statistical Mechanics, Math 266: Week 4 Notes January 26 and 28, 200 Quantum Statistical Mechanics The canonical formalism of Quantum Statistical Mechanics has the same structure as in the classical case, but the mathematical objects playing the role of the probability measure and the Hamiltonian are of a different kind. We will first set up the mathematical structure, then make it more explicit in the context of Quantum Spin Systems. Later, we will also consider systems of quantum particles in the continuum. A finite quantum system, analogous to a finite number of classical particles in a finite volume Λ R 3, or the Ising model with a finite number of spins, is described by a separable Hilbert space of states H a densely defined self-adjoint operator, H on H, which plays the role of the Hamiltonian observables described by bounded operators on H, usually a norm-closed *-subalgebra of B(H) Example.. N non-interacting, spinless, structureless, distinguishable point particles in a box Λ R d : H = L 2 (Λ N, dx) () H 0 = N i 2m (2) i= D(H) = H 2 (3) where i is the d-dimensional Laplacian with suitable boundary conditions, and H 2 is the Sobolev Space of twice weakly differentiable functions in L 2. Example.2. N spinless, structureless point particles interacting via a pair potential V in Λ R d. H = H 0 + V (x i x j ) (4) i<j N

Example.3. N non-interacting spin 2 Fermions. where H 0 is the Hamiltonian defined in Example.. H = [L 2 (Λ) C 2 ] N (5) H = H 0 (6) Example.4. Two-state quantum spins or qubits on labelled sites x Λ Z d H = (C 2 ) Λ (7) One typical model is the Heisenberg model with the Hamiltonian H = J S x S y (8) x,y Λ x y = 2 The Canonical Formalism for Quantum Systems We will assume (and not encounter exceptions to this assumption) that the finite system Hamiltonian H Λ is such that e βhλ is trace class for all β > 0. The index Λ indicates that typically we will consider again sequences of systems indexed by finite volume Λ which will be taken to R d, Z d, or whatever. For self-adjoint H, the condition that e βh is equivalent to the statement that its spectrum consists entirely of eigenvalues of finite multiplicity, which we will often enumerate as follows: λ 0 λ λ 2... (9) repeated according to their multiplicity, and the assumption that e βλn < + (0) n 0 If dim H Λ < +, this is of course automatically satisfied. For systems of particles in finite volume, there are general theorems about operators of the form H = N i + V (x i x j ) () 2m i= i<j N that guarantee this property for most systems of interest. Since e βh is assumed to be trace class, we can define ρ β = Z e βh (2) where Z = Tr e βh = n 0 e βλn. ρ β is then a positive definite operator of trace class and with trace : Tr ρ β = (3) 2

This is what is called a density matrix. Density matrices in quantum mechanics play the role of probability densities for classical systems. We illustrate their interpretation: Denote by P [a,b], the projection onto the eigenvectors of H with eigenvalues λ i [a, b]. Then, Tr(ρ β P [a,b] ) = P( system has energy, E [a, b]) (4) i.e., upon observation, the energy value is measured to be in the interval [a, b]. More generally, observables in quantum mechanics are represented by bounded, and sometimes unbounded operators. We will usually assume that our observables come from B(H). This is the object that generalizes a random variable in classical probability and functions on phase space for a system of classical particles, or functions f C(Ω) for a classical spin system such as the Ising model. The mean of the observable A in the canonical Gibbs state at inverse temperature β is given by ω β (A) = Tr(ρ β A) (5) to be compared with f β = fdµ β (6) in the classical case. Similarly, the variance of A is given by ω β ([A ω β (A)] 2 ) = ω β (A 2 ) ω β (A) 2 (7) and the correlation (covariance) between A and B is given by ω β (AB) ω β (A)ω β (B) (8) In general, A A B(H), a norm-closed, unital *-subalgebra of B(H), a.k.a., a C -algebra. A map ω : A C is called a state if the system (a state on A) if it is linear, positive, and normalized:. ω(a + cb) = ω(a) + cω(b) for all c C, A, B A. 2. ω(a A) 0 for all A A. 3. ω() =. Linear functionals with the positivity and normalized properties above have further nice properties:. ω is continuous. i.e., ω is a bounded linear functional with ω = sup A = A A 2. The Cauchy-Schwarz inequality holds: 3. ω(a BA) ω(a A) B. ω(a) = (9) ω(a B) ω(a A)ω(B B) (20) 3

3 The Free Energy Functional and the Variational Principle Entropy plays an essential role in Thermodynamics and Statistical Mechanics as well as in Information Theory and in Large Deviation Theory in probability. It was Boltzmann who realized the precise connection between thermodynamic entropy and a mathematical formula involving physical states. Von Neumann introduced quantum entropy, for a density matrix ρ. One may observe that S(ρ) = Tr ρ log ρ (2) 0 S(ρ) + (22) There is much to say about this function, and one could devote an entire course to its properties and applications. Here, we will use it to give another interpretation of the canonical formalism. Let s consider a finite quantum system with Hamiltonian H and define F β (ρ) = E(ρ) S(ρ) (23) β where E(ρ) = Tr ρh. F β is the free energy functional. At this point, it is not clear what it has to do with the previously defined quantity β log Z β, but here it is: F β (ρ β ) = inf F β(ρ) = ρ β log Tr e βh (24) with ρ β = Z e βh Z β = Tr e βh Minimizing the free energy functional is equivalent to maximizing the entropy given the expectation of the energy. For a proof of (24) we need Klein s inequality. Lemma 3. (Klein s Inequality). Let A and B be two non-negative definite matrices satisfying 0 A, B and such that ker B ker A. Then Tr A(log A log B) Tr(A B) + 2 Tr(A B)2 (25) Proof. The function f(x) = x log x, x > 0, continuously extended such that f(0) = 0, is easily seen to be concave. In fact it is C 2 ((0, )) with f (x) = x (26) 4

By the Taylor Remainder Theorem and the expression for f, it follows that for all x and y such that 0 x < y, there exists a ξ such that x ξ y and f(y) f(x) (y x)f (y) = 2 (x y)2 f (ξ) 2 (x y)2 (27) As A and B are non-negative definite, they are diagonalizable. Denote their eigenvalues by a i and b i, and the corresponding orthonormal eigenvectors by ϕ i and ψ i, respectively. From the assumptions it follows that 0 a i, b i. Using the spectral decompositions of A and B, i.e., A = i B = i ϕ i ϕ i = i i a i ϕ i ϕ i (28) b i ψ i ψ i (29) ψ i ψ i = (30) we see that Tr A(log A log B) Tr(A B) Tr(A B)2 2 = [ Tr ψ i ψ i ϕ j ϕ j f(a) + f(b) + (A B)f (B) ] 2 (A2 + B 2 2AB) ij = Tr ψ i ψ i ϕ j ϕ j [ f(a j ) + f(b i ) + (a j b i )f (b i ) 2 ] (a j b i ) 2 ij 0 where the last inequality follows from applying (27) term by term. Now to prove the variational principle, we can apply Lemma 3. with A = ρ, where ρ is an arbitrary density matrix, and B = ρ β. Note than ker B = {0}. This gives ( ) e βh β(f β F (β)) = Tr ρ log ρ Tr ρ log (3) Z(β) 2 Tr(ρ ρ β) 2 0 (32) If the RHS vanishes, we have ρ = ρ β. Hence the minimum of F β is uniquely attained for ρ = ρ β. 4 Quantum Spin Systems Quantum spin systems are the simplest examples of non-trivial interacting quantum systems with many degrees of freedom. They are defined on a finite set 5

Λ; typically Λ Z d. Usually we have the same kind of spin in each x Λ, but it is sometimes useful to consider a more general situation where for all x Λ we have a finite-dimensional Hilbert space H x of dimension n x ; and where H x C nx and H Λ = x Λ H x. In statistical mechanics we are typically interested in sequences of finite systems indexed by a sequence of sets Λ. For finite Λ, the Hamiltonian is a Hermitian matrix H Λ and the observables are represented by matrices, regarded as linear transformations of H Λ : A Λ = B(H Λ ). If Λ Λ 2, A Λ A Λ2 by the embedding provided by A Λ A A A Λ A Λ2\Λ A Λ2 (33) Often we consider sequences of finite Λ Z d, or another countable set Γ. A natural way to describe the Hamiltonian H Λ in such a case is by considering a potential or interaction of the following form: Φ : { finite subsets of Γ} Λ Γ A Λ. where the union is defined as an inductive limit, and for all X, Φ(X) A X, Φ(X) = Φ(X), and H Λ = X Λ Φ(X). Boundary terms could also be added separately. e.g., in the Heisenberg model, n x = 2 for all x Z d = Γ (34) Φ(X) = 0 unless X = {x, y}, x y = (35) Φ({x, y}) = J σ x σ y (36) In this case, observe A X A X+a for all a Z d and Φ(X +a) = τ a (Φ(X)) where τ a is the translation isomorphism mapping A X to A X+a. τ a is an automorphism of -algebras: τ a (A λ B) = τ a (A) + λτ a (B) (37) τ a (AB) = τ a (A)τ a (B) (38) τ a (A ) = τ a (A) (39) τ a () = (40) We will encounter other important examples of automorphisms shortly. (They describe the dynamics as well as the symmetries of the system.) As in the classical case, the dynamics of the system, i.e., its time evolution, is determined by the Hamiltonian H: i d dt ψ t = Hψ t (Schrödinger) d dt A t = i[h, A] for A A B(H) (Heisenberg) where for convenience, we have set =. The two equations are equivalent, in the sense that in moth cases, the solutions can be expressed in the one-parameter group of unitaries U t B(H) generated by H as follows: U t = e ith (4) and then, ψ t = U t ψ 0, A t = U t A 0 U t (42) 6

Note that the map A α (Λ) t (A) = e ithλ Ae ithλ where A B(H Λ ) is an automorphism of A Λ = B(H ). For models such as the Heisenberg model it is reasonable to ask whether it can be extended to an automorphism of Λ A Λ, which we will call A loc : the algebra of local observables. The answer is no, but almost yes. What we need to do is to complete A loc, in the sense of metric spaces, to obtain a C algebra A, which we now call the algebra of quasi-local observables. It contains all norm-limits of Cauchy sequences of local observables. It can be shown that there exist a one-parameter group of automorphisms α t on A such that for all A A loc lim α (Λ) Λ Z d t (A) = α t (43) in the norm topology. One says that αt Λ α t strongly. Note that there is no good way to define a Hamiltonian in the limit On the other hand, it is straightforward to define lim H Λ =? (44) Λ Z d δ(a) = lim Λ Z d [H Λ, A] (45) for all A A loc. This does not automatically imply that one can define e itδ (A) since δ is an unbounded operator. This is one reason why we need to define A on the completion (or closure) of A loc. 5 Symmetries and Symmetry Breaking (in the Heisenberg Model) In finite volume, we have a unique Gibbs state e βhλ ; boundary conditions can be used to modify the ground states, i.e., lim β ρ β. Just like in the Ising model, we will ask whether we can also obtain different limits as Λ Z d. It is instructive to look at ground states first and to look at the role of symmetries. Since boundary conditions (b Λ ) affect symmetries, they are more obvious in infinite system objects such as Φ or δ, rather than in the local Hamiltonians 7

themselves. Let s focus on the spin 2 Heisenberg model: 0 0 0 0 0 0 0 0 0 σ σ = 0 0 0 0 0 0 + 0 0 0 0 0 0 + 0 0 0 0 0 0 (46) 0 0 0 0 0 0 0 0 0 0 0 0 = 0 2 0 0 2 0 (47) 0 0 0 0 0 0 0 0 0 = 2 0 0 0 0 0 0 0 0 0 0 0 0 (48) 0 0 0 0 0 0 = 2t (49) where t(u v) = v u. Clearly [U U, t] = 0, and this implies that σ σ is SU(2) invariant. More specifically, for all U SU(2), π (Λ) U = x Λ U (50) so that π (Λ) U is a unitary representation of SU(2) on H Λ and it commutes with the Hamiltonian H Λ : [H Λ, π (Λ) U ] = 0 (5) for all U SU(2). The corresponding adjoint representation on A Λ = B(H Λ ) is given by ρ (Λ) U (A) = π(λ) U Aπ(Λ) U (52) for all A A Λ. ρ (Λ) U is a representation of SU(2) by -automorphisms of A Λ, and it commutes with the dynamics: ρ (Λ) U α(λ) t = α (Λ) t ρ (Λ) U (53) for all U SU(2) and for all t R. As a consequence of the SU(2) invariance of the dynamics, we also get the invariance of the finite volume Gibbs state: satisfies ω (Λ) β (A) = Tr e βhλ A (54) Tr e βhλ ω (Λ) β ρ (Λ) U = ω(λ) β (55) A major question in the statistical mechanics of the Heisenberg model is again the question of spontaneous symmetry breaking. As was the case in the Ising model, the symmetry will be broken only in sufficiently high dimension and at sufficiently low temperatures (large β). This topic will occupy the next several lectures. 8

Statistical Mechanics, Math 266: Week 5 Notes February 2, 200 Quantum Spin Systems: Existence of the Dynamics and Lieb-Robinson Bounds Consider a relatively general class of systems (to keep notations simple, not the most general one can handle with the same arguments). Λ Z d, for all x Z d, H x C n () H Λ = x Λ H x, A Λ = B(H Λ ), (2) A Λ A Λ2, if Λ Λ 2 (3) A loc = Λ A Λ, A = A loc (4) H Λ = X Λ Φ(X) (5) For all X, Φ(X) = Φ(X) A X (6) α (Λ) t (A) = e ithλ Ae ithλ (7) Claim.. Under suitable conditions on Φ, there exist α t on A, strongly continuous one-parameter group of automorphisms on A, such that for all A A loc lim α (Λ) Λ Z d t (A) = α t (A) (8) This limit will be a problem if there are more and more terms in S x (Λ) = {X Λ x X, Φ(X) 0} (9) and these elements Φ(X) are not decreasing in size fast enough as X increases. The limit (8) typically should not be expected to be trivial. supp α (Λ) t (A) = Λ (0) Very often, Φ is of finite range. This means that Φ(X) = 0 if diam X > R for some given range R, and where diam X = max{d(x, y) x, y X} and

d(x, y) = x y = d i= x i y i in Z d. (Other definitions are possible). But in some cases pair interactions (or even higher-order interactions) that act at long distance need to be considered. e.g., magnetic dipole-dipole interaction decays as r. 6 Define F (r) = for r 0 and define a norm on the interactions Φ by (+r) d+ In particular, Φ = max x,y Z d X x,y X has to decay at least as ( x y +) d+. Remark.. X Z d x,y X Φ(X) F ( x y ) () Φ(X) (2) F ( x ) = C (3) ( x + ) d+ x Z d x z Z d F ( x z )F ( z y ) C µ F ( x y ) 2 d+ CF ( x y ) (4) where C F ( x z )F ( z y ) µ = sup x,y z Z d F ( x y ) The proof of the remark is left as a homework assignment. For all µ > 0 we define F µ (r) = e µr F (r) (5) and observe that the inequalities (3) and (4) are also satisfied when we replace F with F µ. But Φ µ defined with F µ instead of F is of course a stronger norm with increasing µ. Theorem. (Lieb-Robinson Bound). Suppose µ > 0 such that Φ µ < +. Then there exists numbers C and v such that for all Λ, for all X,Y Λ, and for all A A X, B A Y, and for all t R, we have [α (Λ) t (A), B] 2 A B F µ (d(x, y))e C v t (6) µ x X y Y where F (d(x, y)) min( X, Y )Ce µd(x Y ) (7) x X y Y and C = Φ µ, v = 2 C µ Φ µ 2

Proof. Let f(t) be the quantity [α (Λ) t, B]. Then f (t) = [i[h Λ, α (Λ) t (A)], B] (8) Define S Λ (X) = {Z Λ Z X, Φ(Z) 0} so that f (t) = i[ = i Z S Λ(X) Z S Λ(X) [α (Λ) t [[α (Λ) t eh(t) (Φ(X)), α (Λ) t (A)], B] (9) (A), B], α (Λ) t { }} { = i[ (Φ(Z)), f(t)] i Z S Λ(X) α (Λ) t where (2) uses the Jacobi identity. (Φ(X))] [[B, α (Λ) t Z S Λ(X) [α (Λ) t (Φ(Z))], α (Λ) t (A)] (20) (A), [α (Λ) t (Φ(Z)), B]] (2) Lemma.. For all t R, let H(t) = H(t) B(H Λ ) and w(t) B(H Λ ). Then the unique solution f(t) with initial condition f(0) of is given by where α t is the map solving i.e., g(t) = α t (g(0)). Using the lemma, f (t) = i[ H(t), f(t)] + w(t) (22) f(t) = α t (f(0) + t 0 α s (w(s))ds) (23) g (t) = i[ H(t), g(t)] (24) t t f(t) = f(0) + α s (w(s))ds f(0) + ds w(s) (25) 0 0 t [A, B] + 2 A ds [α s (Λ) (Φ(Z)), B] (26) Z S Λ(X) 0 Now define It follows now that C B (X, t) = sup A A X A 0 [α (Λ) t (A), B] A (27) f(t) A C B (X, t) (28) 3

and C B (X, t) C B (X, 0) + 2 Φ(Z) t Z S 0 Λ(X) dsc B (Z, s) (29) This inequality can be iterated. Note that C B (Z, 0) 2 B δ Z (Z ) for B A Z (30) where δ Z (Z ) = { if Z Z 0 if Z Z = (3) C B (X, t) C B (X, 0) + 2 2 B δ Y (X) + 2 + 2 B 2 B Z S Λ(X) Z S Λ(X) Z S Λ(X) Φ(Z ) Φ(Z ) t 0 ds C B (Z, 0) + 2 Z 2 S Λ(Z ) s Φ(Z 2 ) 0 (32) 2t B δ Y (Z ) Φ(Z ) + (33) Z 2 S Λ(Z ) Φ(Z 2 ) δ Y (Z 2 ) t 0 ds s 0 ds 2 (34) (2t) n a n (35) n! n=0 where a n = Z S Λ(X) Z δ n S Λ(Z n ) Y (Z n ) n i= Φ(Z i). Now, we estimate these coefficients: a = δ Y (Z) Φ(Z) (36) Z S Λ(X) y Y Z S Λ(X) y Z Φ(Z) (37) F µ (d(x, y)) Φ(Z) (38) F µ (d(x, y)) y Y x X Z x,y Z Φ µ F µ (d(x, y)) (39) y Y x X (40) ds 2 C B (Z 2, s 2 ) 4

a 2 = Z S Λ(X) Z 2 S Λ(Z ) y Y Z S Λ(X) Φ µ δ Y (Z 2 ) Φ(Z ) Φ(Z 2 ) (4) Φ(Z ) Φ(Z 2 ) (42) z Z Z 2 z,y Z 2 Φ(Z ) F µ (d(x, z)) (43) F µ (d(z, y)) F µ (d(x, z)) y Y z Λ x X Z x,z Z C µ Φ 2 µ F µ (d(x, y)) (44) y Y x X Proceeding in this way, one finds that for all n, a n Φ n µcµ n F µ (d(x, y)) (45) and this implies that v = 2C Φ µ. x X y Y 5

Statistical Mechanics, Math 266: Week 6 Notes February 9 and, 200 The Existence of Infinite System Dynamics Corollary.. For X Λ, define for δ > 0 X(t, δ) = {x Λ d(x, X) v t + δ} Then there exists C such that for all A A X and for all t R, there exists A t (δ) A X(t,δ) such that α (Λ) t (A) A t (δ) C A e µδ Proof. The proof follows from the Lieb-Robinson bound and the following lemma: Lemma.. Let X Λ and define X c = Λ \ X. If A A Λ satisfies [A, B] ε B for all B A X c. Then there exists A ε A X such that A A ε ε. Theorem. (Existence of Infinite System Dynamics). Under the conditions described above, which imply Lieb-Robinson bounds for α (Λ) t uniformly in Λ, there exists a strongly continuous one parameter group of automorphisms, α t on A = A loc, such that for all A A loc, lim α (Λ) Λ Z d t (A) α t (A) = 0 The convergence is uniform on compact sets in t. Proof.. The first step in the proof is to establish that for any increasing, absorbing sequence of finite sets, Λ n Z d, for fixed A and t, (α (Λn) t ) n is a Cauchy sequence in A. Let Λ n Λ m, then α (Λn) t (A) α (Λm) t (A) = α s (Λn) (A) = = t 0 t 0 α (Λm) t s ds d ds (α(λn) s ds iα (Λn) s t 0 α (Λm) t s (A) ([H Λn, α (Λm) t s (A)]) iα (Λn) s (α (Λm) t s ([H Λm, A]))

Hence α (Λn) t (A) α (Λm) t (A) t 0 t since F µ is summable over Z d. 0 ds [H Λn H Λm, α (Λm) t s (A)] ds C A e µv s C A e µv s Y Λ n Y (Λ n\λ m) Y Λ n x X z Y C A e µv s Φ µ C A e µv s Φ µ [Φ(Y ), α s (Λm) (A)] Φ(Y ) F µ (d(x, z)) Φ(Y ) F µ (d(x, z)) y Λ n\λ m Y x X z Y y Y y Λ n\λ m z Z d x X y Λ n\λ m F µ (d(x, y)) 0 2. The limit α t (A) is independent of the chosen sequence of (Λ n ). F µ (d(x, z))f µ (d(y, z)) 3. α t (A) defines an automorphism of A, α t satisfies the group property, and α t (A) is continuous in t for all A. 2 Positive Semigroups on A In this section, we will not use that the Hilbert space H is finite-dimensional. The results, and the proofs given, are valid for arbitrary Hilbert spaces. In fact, A can be replaced by an arbitrary C -algebra A, i.e., a subalgebra of A that is closed under taking and adjoints, and closed in the operator norm topology. It is convenient to assume that l A. For concreteness, you may still want to think of A as M n (C), but the finite-dimensionality will not be used. Let X A. Define L X B(A), by L X (A) = X AX 2 (X XA + AX X) () Clearly, as L X (A) 2 X 2 A, L X is a bounded linear transformation on the Banach space A. Therefore, we can define [ γ t (A) = e tl X (A) = id + L X + ] 2! (L X) 2 + (A) The family of transformations, (γ t ) t 0, is a semigroup and L X is called the generator of γ t. If the semigroup is differentiable in the sense that d dt γ t(a) 2

exists for all A A, then the generator is the linear map L defined by the derivative of the semigroup in t = 0: L(A) = d dt γ t(a) t=0 In fact, for semigroups of this kind one can show that continuity in t = 0 implies differentiability. We will show that, if the generator is of the form given in (), then the semigroup has the following following properties: γ t (l) = l, and γ t (A) is positive definite if A is. A map γ t with this property is called a positive map. We will prove these properties below. From these properties it follows that, for all states ω, and all t 0, there is a state ω t given by ω t (A) = ω(γ t (A)) Although γ t is a well-defined bounded linear transformation on A for all t R, the properties that make it useful only hold for t 0. E.g., the norm of the transformation γ t diverges as t. Even more importantly, its positivity only holds for t 0. So, although we have curves ω t in the space of the linear functionals defined for all t R, we will only use t 0, as ω t may cease to be state for t < 0. In infinite-dimensional situations one often considers γ t with an unbounded generator L, in which case e tl is often not defined for t < 0. Proposition 2.. Let X A and let L : A A, be defined by L(A) = X AX 2 (X XA + AX X) Then, the following properties hold: (i) For all A A, L(A ) = L(A), and γ t (A ) = γ t (A), for all t 0. (ii) For all A A, L(A A) L(A )A + A L(A). (iii) The semigroup γ t with generator L is unit preserving (also called unital), i.e., γ t (l) = l, for all t 0, (iv) The semigroup γ t with generator L is positive, and satisfies, for all A A γ t (A A) γ t (A )γ t (A) = γ t (A) γ t (A) 0, for all t 0, (v) For every t 0, γ t is a contraction, i.e., for all A A γ t (A) A Proof. (i) This follows directly from the definition of L and the properties of. (ii) This follows from the easy-to-verify identity L(A A) L(A )A A L(A) = (XA AX) (XA AX) (iii) This follows immediately form L(l) = 0. (iv) We will first prove that γ t (A A) 0, for all A A, and then use that result to get the stronger property claimed in (iv). 3