Density Operators and Ensembles

qitd422 Density Operators and Ensembles Robert B. Griffiths Version of 30 January 2014 Contents 1 Density Operators 1 1.1 Introduction.............................................. 1 1.2 Partial trace.............................................. 1 1.3 Interpretation and use of density operators............................ 2 2 Classical and quantum correlations 3 3 Ensembles 4 3.1 Definition............................................... 4 3.2 Density operators of ensembles................................... 5 4 Purification of a density operator 5 5 Time development of density operators 5 Reference: CQT = Consistent Quantum Theory by Griffiths (Cambridge, 2002) 1 Density Operators 1.1 Introduction Notation. What will here be called density operators are often (as in CQT) referred to as density matrices. Alas, a density matrix is not always a matrix, and has nothing whatsoever to do with density in the usual meaning of that term. Thus the term density operator, which only suffers from the second of these defects, is to be preferred. Mathematical definition: any operator ρ on the Hilbert space H which is positive (semidefinite) and has trace 1 is a density operator. Positive: ρ = ρ is Hermitian, and none of its eigenvalues is negative. If one eigenvalue is 1 and the rest are zero, ρ = ψ ψ for some normalized ψ, and is called a pure state. Otherwise it represents a mixed state. The density operator for a single qubit can always be written as a matrix in the standard basis in the following form ρ = 1 ( ) I +r σ = 1 ( ) 1+z x iy, (1) 2 2 x+iy 1 z where r = (x,y,z) is a real three-dimensional vector of length 1, σ = (σ x,σ y,σ z ), the σ s are Pauli matrices, and r σ = xσ x +yσ y +zσ z. Pure states correspond to r = 1, the surface of the Bloch sphere; mixed states to r < 1, the interior of the Bloch sphere (or Bloch ball, if one is a mathematician). 1.2 Partial trace Compare notes Measurements, Sec. 3.3 Let ψ be a pre-probability on H a H b, and suppose we are only interested in calculating probabilities for the subsystem b, i.e., those associated with some decomposition of the identity I = I a k Q k (2) 1

where the Q k are projectors on H b and sum to I b. Or perhaps we are interested in the average B of an operator acting on H b. These can be calculated using the reduced density operator ) ρ b := Tr a ( ψ ψ Tr a is the partial trace over H a. Applied to any operator on H a H b it produces some operator on H b. In particular Tr a (A B) = Tr a (A)B, (4) where on the right side Tr a is the ordinary trace of the operator A on the Hilbert space H a. This formula, together with the requirement that Tr a be linear, defines what Tr a does to a general operator on H a H b, since any operator can be written as a sum of products of operators. Tr b, the partial trace over H b is defined in an analogous fashion. Exercise. Show that for a general state (3) ψ = jk c jk a j b k, (5) where { a j } and { b k } are orthonormal bases of H a and H b, (3) yields ρ b = jkk c jk c jk bk b k. (6) Let R be any operator on H a H b and a j b k R a j b k its matrix elements using orthonormal bases { a j } and { b k }. Then Tr a (R) is defined to be the operator whose matrix elements are given by b k Tr a (R) b k = j a j b k R a j b k (7) Exercise. What is the corresponding expression for matrix elements of Tr b (R)? Let {Q k } be a decomposition of the identity I b. For a given pre-probability ψ on H a H b, one can calculate the corresponding probability distribution using either ψ or the reduced density operator ρ b : Pr(Q k ) = ψ I a Q k ψ = Tr b (Q k ρ b ) (8) Exercise. Check that the second equality is correct when ρ b is given by (6) Let C = C be an observable on system b with spectral decomposition C = k c k Q k. (9) Its average can be conveniently written using ρ b : C = ψ I a C ψ = k c k Pr(Q k ) = Tr b (Cρ b ) (10) 1.3 Interpretation and use of density operators Equations (8) and (10) illustrates the principal role of a density operator in quantum mechanics: it serves as a pre-probability from which a probability distribution can be generated by employing a quantum sample space. Thus it is in some sense a quantum analog of a probability distribution in classical statistical mechanics. One should not take the attitude that a density operator represents a specific property of a quantum system; instead, it is used to assign a probability to a set of mutually-exclusive properties. Sometimes, as discussed above, a density operator is obtained by partial trace over a pure quantum state. But sometimes it is simply a guess, or is obtained by fitting parameters to experimental data. 2

The density operator ρ = e βh /Tr(e βh ) (11) used in quantum statistical mechanics for a system in thermal equilibrium, where H is its Hamiltonian and β = 1/k B T the inverse temperature, belongs to this category. The polarization of a beam of spin-half particles used in a scattering experiment can be conveniently expressed using the r in (1), with the parameters x, y, and z determined by experiment. 2 Classical and quantum correlations Classical correlations. Charlie prepares red R and green G slips of paper, inserts them in two opaque envelopes. One goes to Alice in Atlanta, the other to Bob in Boston. Probability that Alice has R or G is 1/2, but if Bob opens his envelope and sees R, he can conclude that Alice s envelope contains G. This is an example of statistical inference based upon the use of a conditional probability: Pr(A=G) and Pr(A=G B=R) are two very different things No mysterious long-range influences, wave function collapse or the like in the classical world. Or the quantum world. Probabilistic model based on Pr(A,B), where A and B can be R or G. We have Marginal probabilities Pr(R,G) = Pr(A=R,B=G) = 1/2 = Pr(G,R); Pr(R,R) = 0 = Pr(G,G). (12) Pr(A=R) = B Pr(A=R, B) = 1/2 = Pr(A=G), (13) and same thing for Pr(B= ). These do not reveal the correlation. The conditional probability Pr(B A) = Pr(A, B)/ Pr(A) reveals the correlations, e.g., Pr(B=R A=R) = 0, Pr(B=G A=R) = 1. (14) Statistical correlations indicate the presence of information: by examining A (and knowing the joint probability distribution) one finds out something about B; in this sense A contains information about B. In the case of a quantum bipartite system H a H b the relation between an entangled ψ used as a pre-probability and the two reduced density operators ρ a and ρ b obtained from it by taking a partial trace is analogous to the relationship of a classical joint probability distribution Pr(A, B) and the marginal distributions Pr(A) and Pr(B). Thusthereduceddensityoperatorρ b canbeusedtofindtheprobabilitydistributionforadecomposition of the identity {Q k } for system b, and ρ a for a decomposition {P j } for system a. However, neither of these density operators contain information about the correlations between systems a and b, whereas the state ψ from which they are obtained will in general contain additional information. As an example, in the case of two qubits in the famous EPR-Bohm singlet state taking partial traces yields the two density operators ψ = ( 01 10 )/ 2 (15) ρ a = I a /2, ρ b = I b /2. (16) These tell us that the given any orthonormal basis for a the probability of each of the two states is 1/2, and the same for any orthonormal basis for b. On the other hand, from (15) one can show that Pr([0] a,[0] b ) = Pr([1] a,[1] b ) = 0, Pr([0] a,[1] b ) = Pr([1] a,[0] b ) = 1/2. (17) Thus, for example if S z = +1/2 for particle a, then S z = 1/2 for particle b. This information is not available in either of the reduced density operators ρ a or ρ b. Exercise. Find a ψ for two qubits such that the reduced density operators are the same as in (16), but S az = S bz, i.e., the z component of spin is the same for the two particles. 3

3 Ensembles 3.1 Definition See notes Probabilities, Sec. 1.4, for the general idea of an ensemble in probability theory. The term is employed in a similar way in quantum mechanics, but often in a way that causes a lot of confusion because a sample space has not been properly defined. Hence it is best, when possible, to avoid using this term for discussions of quantum probabilities. There is, however, a usage in quantum theory which is precisely defined and of some use. Let {p k } be a set of probabilities summing to 1, and { ψ k } a set of normalized states, generally not orthogonal to each other. The ensemble E is then the set of pairs E = {p k, ψ k }, (18) and the intuitive ideas is that the state of the quantum system is ψ k with probability p k. Classical counterpart. Think of a slip of colored paper in an opaque envelope, or a die that has been rolled but is concealed under a cup. One may be able to assign probabilities to the different possibilities even though one does not know the actual state of affairs. In place of {p k, ψ k } it is sometimes more convenient to employ a collection of un-normalized kets: E = { ˇψ k }, ˇψ k = p k ψ k, p k = ˇψ k ˇψ k. (19) It is best to think of such an ensemble not as a set of isolated possibilities existing by themselves, since the { ψ k } need not be orthogonal to each other, and in that case do not represent physically distinct states. Instead, think of them as labeled or tagged states, meaning that with probability p k there exists some distinctive state-of-affairs external to the system described by one of the ψ k, and with which it is correlated. One way to construct an ensemble is to use the expansion of a bipartite pure state ψ in the form ψ = j a j β j, (20) where the a j are an orthonormal basis of H a. Now think of the { β j }, which are not normalized, as forming an ensemble in the sense of (19). Note that the states { a j β j } are orthogonal to each other, whereas (in general) the { β j } are not. Hence one should not think of the β j as distinct states in the absence of the corresponding a j which serve as tags to distinguish them. Another way to generate an ensemble. Charlie has prepared the state ψ k using an appropriate apparatus and a random number generator that prints out k with probability p k. When the printout is 3, Charlie prepares ψ 3 and ships it to Alice in a closed box. Even if ψ 3 is not orthogonal to ψ 2, it is correlated with the record in Charlie s notebook. If we do not know what is written in Charlie s notebook (the tag), it is appropriate to describe the situation by means of an ensemble. Ensembles can be useful conceptual tools in quantum information, but will cause confusion if one forgets that the states have labels or tags that render them distinctive. The phases of states in an ensemble are irrelevant, and thus one sometimes writes E = {p k,ρ k } or {ˇρ k }; ρ k = ψ k ψ k, ˇρ k = ˇψ k ˇψ k = p k ρ k, (21) where the ρ k are pure-state density operators (see below). They could also represent mixed states, thus providing a more general notion of ensemble. It is also sometimes useful to generalize the notion of a quantum ensemble to one in which in place of pure states ρ k = ψ k ψ k the individual ρ k are themselves mixed state density operators. The first expression in (21) makes sense if one sets ˇρ k = p k ρ k. 4

3.2 Density operators of ensembles Given an ensemble (18) or (19), one can always calculate a corresponding density operator ρ = k ˇψ k ˇψ k = k p k ψ k ψ k. (22) Exercise. How do you know that the operator defined in (22) is positive with trace 1? Notice that in the case of the ensemble { β j } corresponding to the expansion (20), the associated density operator in the sense of (22) is the same as the reduced density operator ρ b given in (3). This suggests how one is to interpret the density operator associated with an ensemble: it is a pre-probability appropriate for generating probabilities of the system if one ignores the tags. To use the classical analogy of red and green slips of paper: the density operator for system b is like the marginal probability distribution for the color of the slip of paper in Bob s envelope when no account is taken of its correlation with the color of the slip in Alice s envelope. By contrast, the ensemble with its tags contains additional information not present in the density operator, namely how the quantum state is correlated with the tag. Therefore, one should not confuse the description given by an ensemble with the description provided by a density operator. Either description can serve as a pre-probability, but the latter is a more coarsegrained description than the former, with less information. Coarse-grained descriptions are often more useful than detailed descriptions, as anyone knows who has tried to look up a subject on the Internet. However, their limitations should be recognized. Attempting to use its density operator to discuss how a quantum system is correlated with its environment is to employ exactly the wrong tool for addressing this question. To be sure, it may be an aid to the imagination to suppose that a density operator has been generated from some ensemble via (22); often an ensemble allows one to think about a problem in terms which are more physically intuitive: Let s just suppose that what we have here is with probability 1/3 the state ψ and with probability 1/3... There is no harm in reasoning this way, provided one keeps in mind that the same density operator could have been generated from many different ensembles. 4 Purification of a density operator Sometimes it is convenient to imagine that a density operator is the result of applying a partial trace to an entangled ket Ψ defined on a tensor product of the Hilbert space of interest and a mythical reference system. In the notation of Sec. 1.2, think of H b as the system of interest, and H a as the reference system. This trick is frequently employed in quantum information theory, and is known as purification. If Ψ is expanded in an orthonormal basis of the reference system, as in (20), the result is an ensemble {p k, β k }, with the original density operator equal to the density operator of this ensemble in the sense of (22). Sometimes one thinks of this ensemble as arising by carrying out some measurement on the mythical reference system. Thinking about things in this way in no way commits one to affirming the actual existence of the reference system and the entangled state, or that measurements are actually carried out on the reference system. These are just convenient myths to aid one s thinking. No harm is done if one remembers that measurements on distant systems (mythical or not) do not have any influence on the system of interest, by what is sometimes referred to as the principle of Einstein locality. 5 Time development of density operators If the density operator ρ refers to an isolated system, its unitary time development is give by ρ(t) = T(t,t 0 )ρ(t 0 )T(t 0,t) = T(t,t 0 )ρ(t 0 )T (t,t 0 ). (23) Since T(t,t 0 ) satisfies the Schrödinger equation i T(t,t 0 )/ t = HT(t,t 0 ), (24) 5

differentiating (23) with respect to t shows that ρ(t) satisfies the equation i dρ(t)/dt = [H,ρ], (25) which looks very much like Schrödinger s equation for a ket, except that on the right side there is a commutator in place of H ψ. Exercise. Carry out the steps leading from (23) to (25). Exercise. Suppose that ρ(t) = ψ(t) ψ(t). Show that (25) is a consequence of i (d/dt) ψ(t) = H ψ(t). One way to derive (23) is to suppose that ρ(t) comes from an ensemble {p k, ψ k (t) }, and that for each k ψ k t = T(t,t 0 ) ψ k 0 (26) is a solution of Schrödinger s equation. It then follows that ρ(t) = p k ψt ψ k t k = ) p k (T(t,t 0 ) ψ0 ψ k 0 T(t k 0,t) = T(t,t 0 )ρ(t 0 )T(t 0,t). (27) k k Suppose a system consists of two subsystems a and b that do not interact with each. That is, the total Hamiltonian H on H a H b is the sum H = H a +H b = H a I b +I a H b of separate operators for the two subsystems. Then the unitary time development operator factors, T(t,t ) = T a (t,t ) T b (t,t ). (28) Note that this is true if H a and H b are time-independent, but also if they depend on t. Exercise. Show that (28) is a consequence of H = H a +H b. As a consequence of (28), if Ψ(t) is an entangled ket on H a H b whose unitary time development is given by T(t,t 0 ) Ψ 0, the reduced density operator ) ρ b (t) = Tr a ( Ψ(t) Ψ(t) (29) satisfies (23) with T(t,t 0 ) replaced by T b (t,t 0 ), so once again we arrive at (25), with ρ replaced by ρ b and H by H b. Exercise. Verify that ρ b (t) as defined in (29) satisfies (23) with T(t,t 0 ) replaced by T b (t,t 0 ). If an isolated composite system consists of subsystems which interact with each other, it is still possible to assign density operators as a function of time to the subsystems using (29) and its analogs. However, these reduced density operators no longer satisfy any simple time development equation, even if unitary time development applies to the system as a whole. In some cases it is useful to introduce approximate differential or differential-integral equations for the time development of the density operator of a subsystem, but this lies outside the scope of these notes. It is worth keeping in mind that a density operator as a function of time provides only limited information about the stochastic time development of a quantum system. It is analogous to the singletime probability for a classical system discussed in CQT Sec. 9.2, and lacks information about correlations between different times. More refined stochastic descriptions can be constructed, but one needs to use histories, and ρ(t) is not the appropriate tool for assigning probabilities to any but the simplest (two-time) histories. 6