INFORMATION THEORY AND STATISTICAL MECHANICS REVISITED arxiv:64.8739v [math-h] 29 Ar 26 JIAN ZHOU Abstract. We derive Bose-Einstein statistics and Fermi-Dirac statistics by Princile of Maximum Entroy alied to two families of entroy functions different from the Boltzmann-Gibbs-Shannon entroy. These entroy functions are identified with secial cases of modified Naudts φ-entroy. In a ioneering work sixty years ago, Jaynes [4] initiated the study of statistical hysics by information theory. He exounded the Princile of Maximum Entroy and alied it to Boltzmann-Gibbs-Shannon entroy to derive Boltzmann-Gibbs statisitics. Many different entroy functionals have been introduced, studied and used in many areas where statistics have been alied. But in hysics, only the Boltzmann- Gibbs-Shanon entroy has been considered as hysical. Tsallis [7] has roosed the q-entroy as another kind of hysical entroy functional. Naudts [6] introduced φ-exonentials, φ-logarithms and φ- entroy as a further generalization related to the q-entroy. On [6,. 94] he wrote: One of the conclusions is that the q-deformed exonential family occurs in a natural way within the context of classical mechanics. The more abstract generalisations discussed in the final chaters may seem less imortant from a hysics oint of view. But they have been helful in elucidating the structure of the theory of generalised exonential families. We will show that such generalizations are indeed of hysical imortance, in articular, in understanding the Bose-Einstein statistics and the Fermi-Dirac statistics by Princile of Maximum Entroy. We will first consider the subjective statistical hysics of free bosons and free fermions. More recisely, we will derive Bose-Einstein statistics and Fermi-Dirac statistics from the Princile of Maximum Entroy, not for the Boltzmann-Gibbs-Shanon entroy as Jaynes did for the Boltzmann-Gibbs statistics, but instead for two different entroy functions. We quote here Tsallis [7,. 4]: Indeed, the hysically imortant entroy - a crucial concet - is not thought as being an universal functional that is given once for ever, but it rather is a delicate and owerful concet to be carefully constructed for classes of
2 JIAN ZHOU systems. We will actually consider two different families interolating the Bose-Einstein and Fermi-Dirac statistics. They give us two families of entroy functions. Next we will resent a unified understanding of the Boltzmann-Gibbs weight function, the Bose-Einstein weight function and the Fermi-Dirac weight function from the oint of view of natural arameters of exonential families. Finally, the three entroy functions and the three weight functions are unified in terms of secial cases of the generalized logarithmic functions and generalized exonential functions develoed by Naudts, resectively. Some modifications are introduced for this urose. We also briefly treat the case of fractional exclusion statistics [3, 8]. In a subsequent work we will treat the case of general statistics interolating the Bose-Einstein and Fermi-Dirac statistics.. Unified Derivation of Boltzmann-Gibbs Statistics, Bose-Einstein Statistics and Fermi-Dirac Statistics from Princile of Maximum Entroy In this section we will introduce the statistical manifolds that describe a single article, in finitely many states. Then we will use suitable entroy functions on these manifolds and the Princile of Maximum Entroy to give a unified derivation of three imortant hysical statistics that describe noninteracting articles... The statistical manifold. Suose that one is erforming a test with one observable E with finitely many outcomes {E,...,E n }, E < < E n. Suose that each outcome has a ositive robability i of aearance: () (E = E i ) = i, i >, i =,...,n; these robabilities are required to summed u to one: (2) + + n =. Such a distribution is called a categorical distributions in statistics. Putting all the ossible robability distributions together, one gets an oen (n )-simlex: (3) P n = {(,..., n ) R n + + n =, i >, i =,...,n}. It is an oen (n )-dimensional manifold. We will take,..., n as coordinates on P n, and exress n as a function in these coordinates: (4) n = n.
INFORMATION THEORY AND STATISTICAL MECHANICS REVISITED 3 As in Jaynes [4] one actually works with some submanifolds of P n, denoted by P n (E,...,E n ;E) and defined by the following constraint: (5) E + + n E n = E, where E satisfies (6) E = min{e,...,e n } E max{e,...,e n } = E n..2. The entroy functions. Recall the Boltzmann-Gibbs-Shannon entroy function is defined by: (7) H BGS = i log i. We now introduce the following two entroy functions: (8) H BE = (( i +)ln( i +) i ln i ), (9) H FD = i= i= (( i )ln( i ) i ln i ). i= The motivation for the introduction of these functions will be elaborated elsewhere. Here let us just say H BE can be thought of as a discrete version of [9, (4)]. We also introduce a family of entroy functions: () H ǫ = ( ǫ (+ǫ i)ln(+ǫ i ) i ln i ). Then we have () i= H = H BE, H = H BGS +, H = H FD..3. Princile of Maximum Entroy. We now show that the alication of Princile of Maximum Entroy to the three entroy functions in last subsection on the statistical manifold P n (E,...,E n ) gives us a unified derivation of the Boltzmann-Gibbs statistics, Bose-Einstein statistics, Fermi-Dirac statistics, and more generally, Acharya-Swamy statistics []. Theorem.. OnP n (E,...,E n ;E), the entroyfunctionh ǫ achieves its maximum at the oints: (2) i (ǫ) =, i =,...,n e a+be i ǫ for some constants a and b.
4 JIAN ZHOU Proof. By the method of Lagrange multilier, consider the function F ǫ = ( ǫ (+ǫ i)ln(+ǫ i ) i ln i ) i= + a( One easily gets: i )+b(e i E i ). i= i= (3) F ǫ = ln +ǫ i a be i, i =,...,n, i i so the critical oint where Fǫ i = for all i =,...,n is given by (4) i (ǫ) =. e a+be i ǫ The entries of Hessian matrix of F ǫ are given by: (5) 2 F ǫ i j = δ ij i (+ǫ i ), the Hessian is clearly negatively definite. 2. Weight Functions as Inverse Functions of Natural Parameters of Exonential Families In this section we understand the weight functions: (6) ǫ (E) = e a+be ǫ as inverse functions of natural arameters of exonential families. 2.. Exonential family. In statistics, an exonential family of robability densities is an n-dimensional model S = { θ } of the form [ ] (7) (x;θ) = ex C(x)+ θ i T i (x) ψ(θ). Thearameters{θ i }arecalledthenatural arameters,andthefunction ψ(θ) is determined by the normalization condition (8) (x;θ)dx =, i= and so it is given by: (9) ψ(θ) = log [ ex C(x)+ i= ] θ i T i (x).
INFORMATION THEORY AND STATISTICAL MECHANICS REVISITED 5 Recall when Gibbs [2] introduced the canonical ensemble in 9 he ostulated a distribution of the form (2) (E) = ex(g βe) where G is a normalization constant and where the control arameter β = is the inverse temerature. This is an examle of exonential kt family. 2.2. Boltzmann-Gibbs weight functions as inverse natural arameters for the categorical distribution. The categorical distribution is an examle of the exonential families. First one can rewrite it in the following form: (2) = [E=E ] [E=En] n, where [E = E i ] is the indicating function that equals to one when the energy level is E i, zero otherwise. Note (22) log = [E = E i ]ln i = [E = E i ] ln i. i= So by comaring with (7), one can take T i = [E = E i ], andthe natural arameters can be taken to be: (23) η i = ln i, and so (24) i = e η i. This gives us the Boltzmann-Gibbs weight function when we take η i = (a+be i ). 2.3. Fermi-Dirac weight function as inverse natural arameter for the Bernoulli distribution. The Fermi-Dirac weight function can be interreted as the inverse function of natural arameter of the Bernoulli distribution. By Pauli s Exclusion Princile, the outcome for observing a free fermion at a fixed state is like the toss of coins, it can be only be or article at this state. Suose the robability is given by: (25) (X = ) =, The distribution can be written as i= (X = ) =. (26) P(X = x) = x ( ) x.
6 JIAN ZHOU This is called the Bernoulli distribution in statistics. This is also an examle of exonential families: (27) logp = xln +ln( ). The natural arameter is given by: (28) η = ln, and so the inverse function is given by: (29) = e η +. This is the Fermi-Dirac weight function when we take η = (a+be). 2.4. The Acharya-Swamy weight function for ǫ < as inverse natural arameter for the Bernoulli distribution. In the case of ǫ <, consider the robability distribution given by: ( (3) P(X = x) = +(+ǫ) ) x ( +ǫ +(+ǫ) ) x, suorted on the set {, }. This is a curved Bernoulli distribution. This is also an examle of exonential families: ( ) +ǫ (3) logp = xln +ǫ +ln. +(+ǫ) The natural arameter is given by: (32) η = ln +ǫ, and so the inverse function is given by: (33) = e η ǫ. This gives the Acharya-Swamy weight function for ǫ < when we take η = (a+be). 2.5. The Bose-Einstein weight function as inverse natural arameter of the geometric distribution. Similarly, the Bose-Einstein weight function can be interreted as the inverse function of natural arameter of the geometric distribution. The number of a free boson at a fixed state can be any nonnegative integer n. Suose the robability is given by: n (34) P(X = n) = (+) n+,n =,,2,...
INFORMATION THEORY AND STATISTICAL MECHANICS REVISITED 7 This is called the geometric distribution in statistics. Since (35) lnp(x = x) = ln x (+) = xln x+ + +ln +, one sees that it is an exonential family with natural arameter: (36) η = ln +, with inverse function: (37) = e η. This is the Bose-Einstein weight function when we take η = (a+be). 2.6. The Acharya-Swamy weight function for ǫ > as inverse natural arameter for the geometric distribution. Consider the robability distribution given by: (38) P(X = n) = (/(+(ǫ ))) n ((+ǫ)/(+(ǫ ))) n+,n =,,2,... This is a curved geometric distribution. Since (39) lnp(x = x) = xln +ǫ +(ǫ ) +ln, +ǫ one sees that it is an exonential family with natural arameter: (4) η = ln +ǫ, with inverse function: (4) = e η ǫ. This is the Acharya-Swamy weight function for ǫ > when we take η = (a+be). 3. Bose-Einstein Statistics and Fermi-Dirac Statistics as Generalized Statistical Physics The discussions of exonential families in last section serve as a sychological vehicle that takes us to the notion of generalized exonential families develoed by Naudts [6], which generalizes the q-exonential families of Tsallis [7]. We first recall the φ-logarithm function, the φ-exonential function and the φ-entroy function, then we use their suitable modifications to study H ǫ and ǫ.
8 JIAN ZHOU 3.. The φ-logarithm. Fix a strictly ositive non-decreasing function φ(u), defined on the ositive numbers (,+ ). It can be used to define a deformed logarithm by u (42) ln φ (u) = dv φ(v), u >. It satisfies ln φ () = and (43) d du ln φ(u) = φ(u). The natural logarithm is obtained with φ(u) = u, The Tsallis q- logarithm is obtained with φ(u) = u q for q >. 3.2. The φ-exonential function. Theinverse ofthefunctionln φ (x) is called the φ-exonential and is denoted ex φ (x). It can be written in terms of a function ψ on R defined by: φ(ex φ (u)), if u is in the range of ln φ, (44) ψ(u) =, if u is too small, +, if u is too large. Clearly is φ(u) = ψ(ln φ (u)) for all u >. Then ex φ is defined by: (45) ex φ (u) = + It is clear that ex φ () = and u dvψ(v). d (46) du ex φ(u) = ψ(u). 3.3. Deduced Logarithms. The deduced logarithm is defined by (47) ω φ (u) = u /u It satisfies ω φ () = and that (48) Introduce a function: dv v φ(v) d du ω φ(u) = (49) χ(u) = /u [ /u dv v φ(v) ln φ u. dv v φ(v). dv v ], φ(v) so one can see that the deduced logarithmic function is the χ-logarithm function: (5) ω φ (u) = ln χ (u).
INFORMATION THEORY AND STATISTICAL MECHANICS REVISITED 9 3.4. The φ-entroy. The φ-entroy is defined by [6]: (5) H φ () = i ln χ (/ i ), After a short calculation: (52) H φ () = i= i i i= For our urose, we will define + [ v (53) Hφ () = i i v 2 i= [ v du u ] dv. v 2 φ(u) du u ] dv φ(u) in order to remove some irrelevant constants. We will call this the modified φ-entroy. 3.5. The entroy function H BE and H FD as modified φ-entroy. Define the following family of functions arameterized by ǫ: (54) φ ǫ () = (+ǫ). We have H φǫ () = = = i= i= [ v i i v 2 u du u(+ǫu) ] dv i i ǫv ln +ǫv dv 2 ( ) +ǫi ln(+ǫ i ) i ln i ǫ i= = H ǫ. In articular, the entroy functions H BE, H BGS + and H FD are the modified φ ǫ -entroy functions for ǫ = +, and resectively. 3.6. Bose-Einstein weight function and Fermi-Dirac weight function as modified φ-exonential function. Similarly, we defined the modified φ-logarithm function by: (55) lnφ (u) = u dv φ(v), u >, and define the modified φ-exonential function ẽx φ as its inverse function.
JIAN ZHOU For the function φ ǫ () = (+ǫ), we have (56) lnφǫ (u) = It follows that u dv v(+ǫv) = ln +ǫu ln(u). (57) u = ẽx φǫ (η) = e η ǫ, and so we have (58) ǫ (E) = ẽx φǫ (a+be). 4. Fractional Exclusion Statistics In this section, we treat the case of fractional exclusion statistics of Haldane [3]. We refer to [5, Chater 5] for backgrounds. Since the ideas are similar, we will be very brief. Wu [8] has derived the following formula for the weight function: (59) (g) = ω(η)+g, where the function ω(η) satisfies the functional equation: (6) ω(η) g (+ω(η)) g = e η. For the secial cases of g = and we have w(η) = e η and w(η) = e η, and so we recover the Bose-Einstein and the Fermi-Dirac statistics resectively for η = (a + be). This weight function can be derived by maximizing the following family of entroy functions arameterized by g: (6) H g () = (+( g))ln(+( g)) ( g)ln( g) ln, under the constraints (2) and (5). This is because H g (62) = ln (+( g)) g ( g) g, and so by the method of Lagrange multilier one can get: (63) One can readily check that (64) (65) for the following function: (+( g)) g ( g) g H g = H φg (), = e (a+be). g (E) = ẽx φg (a+be), (66) φ g () = ( g)(+( g)).
INFORMATION THEORY AND STATISTICAL MECHANICS REVISITED 5. Conclusions and Prosects In this aer we have generalized Jaynes derivation of Boltzmann- Gibbs statistics by the Princile of Maximum Entroy. A family H ǫ of entroy functions has been introduced to give a unified derivation of Bose-Einstein, Boltzmann-Gibbs and Fermi-Dirac statistics together with the interolating Acharya-Swamy statistics. The family H ǫ turns out to be a secial case of Naudts φ-entroy and the robabilities are φ-exonentials, with suitable modifications, for φ given by φ ǫ () = + ǫ 2. A different interolation of Bose-Einstein and Fermi-Dirac statistics is given by the φ g -exonential function and the corresonding entroy function is given by the φ g -logarithm function, for φ g () = ( g)( + ( g)). The two series of functions φ ǫ and φ g suggest us to study more general deformation of φ = given by φ T () = n 2 T n n. In a subsequent work we will verify that other statistics interolating Bose-Einstein statistics and Fermi-Dirac statistics are φ T -exonential functions and are critical oint of φ T - entroy functions. Furthermore, any deformation of the Boltzmann- Gibbs-Shannon entroy in some suitable sense can be obtained as a φ T -entroy. We will use such considerations to establish a connection with string theory. More recisely, we will show that some comutations in string theory can be used to generate interolating statistics. References [] R. Acharya, P. N. Swamy, Statistical mechanics of anyons, J. Phys. A Math. Gen. 27 (994), 7247-7263. [2] J.W. Gibbs, Elementary rinciles in statistical mechanics develoed with secial reference to the rational foundation of thermodynamics. Dover, 96. [3] F. D. M. Haldane, Fractional Statistics in arbitrary dimensions: A generalization of the Pauli Princile, Phys. Rev. Lett. 67 (99), 937-94. [4] E. T. Jaynes, Information Theory and Statistical Mechanics. Physical Review. Series II (957) 6 (4): 62-63. [5] A. Khare, Fractional Statistics and Quantum Theory. World Scientific, 25. [6] J. Naudts, Generalised Thermostatistics. Sringer Verlag, 2. [7] C. Tsallis, Introduction to nonextensive statistical mechanics. Sringer Verlag, 29. [8] Y.-S. Wu, Statistical distribution for generalized ideal gas of fractionalstatistics articles, Phys. Rev. Lett. 73 (994), 922-925. [9] C.N. Yang, C.P. Yang, Thermodynamics of a one-dimensional system of bosons with reulsive delta-function interaction, J. Math. Phys. (969), 5-22. Deartment of Mathematical Sciences, Tsinghua University, Beijng, 84, China E-mail address: jzhou@math.tsinghua.edu.cn