Lagrange Principle and the Distribution of Wealth

Lagrange Principle and the Distribution of Wealth J. Mimkes, G. Willis*, Physics Department, University of Paderborn, 3396 Paderborn, Germany mimkes@zitmail.uni-paderborn.de, *South Elmsall, W. Yorkshire, U.K. gmw@intonet.co.uk Key words: Lagrange principle, Cobb Douglas production function, entropy, Boltzmann distribution, Pareto distribution, econophysics. Abstract The Lagrange principle L = f + λ g maximum! is used to maximize a function f(x) under a constraint g(x). Economists regard f(x) = U as a rational production function, which has to be maximized under the constraint of prices g(x). In physics f(x) = ln P is regarded as entropy of a stochastic system, which has to be maximized under constraint of energy g(x). In the discussion of wealth distribution it may be demonstrated that both aspects will apply. The stochastic aspect of physics leads to a Boltzmann distribution of wealth, which applies to the majority of the less affluent population. The rational approach of economics leads to a Pareto distribution, which applies to the minority of the super rich. The boundary corresponds to an economic phase transition similar to the liquid gas transition in physical systems. Introduction Since the work of Pareto (1) as long ago as 1897, it has been known that economic distributions strictly follow power law decays. These distributions have been observed across a wide variety of economic and financial data. More recently, Roegen (2), Foley (3), Weidlich, (4) Mimkes (5,6), Levy and Solomon (7,8), Solomon and Richmond (9), Mimkes and Willis (1), Yakovenko (11), Clementi and Gallegatti (12) and Nirei and Souma (13) have proposed statistical models for economic distributions. In this paper the Lagrange principle is applied to recent data of wealth in different countries.

2 Distribution of property in Germany 1993 (DIW) Property data for Germany (1993) have been published (14) by the German Institute of Economics (DIW). The data shows the number N(x) of households and the amount of capital K(x) in each property class (k), table 1. Property distribution in Germany 1993 (DIW estimated) Total property or capital (K ) 992 Bill. DM Number (N ) of households 35,6 Mill. Mean property (λ) per household 278 kdm / Hh Property class (k) in kdm Number N(x) of Hh Property K(x) of Hh in % in % k = 1: <1 46, 9,5 k = 2: 1-25 24,7 17,6 k = 3: 25-5 2,3 28,1 k = 4: 5-1 6,3 16,8 k = 5 >1 2,7 28, Table. 1 Distribution of property in different property classes (k) for households in Germany (1993), according to study by the DIW (14). The data are generally presented by a Lorenz distribution. Fig. 1 shows the distribution of capital K vs. the number N of households according to table 1. Distribution of wealth D 1993 (DIW) Capital K in % 1 8 6 4 2 D I W data (estimated) 2 4 6 8 1 Households N in % Fig. 1 Lorenz distribution sum of capital K vs. sum of households N in Germany 1993, data from DIW (14), solid line according to Eq.(2.9). The distribution of the N(k) households and the amount of capital K(k) = k N(k) in each property class (k) are now discussed by the Lagrange principle for two models, an economic and a physical approach.

3 1. Lagrange principle The Lagrange equation f λ g maximum! (1.) applies to all functions (f) that are to be maximized under constraints (g). The factor λ is called Lagrange parameter. 2. Calculation of the Boltzmann distribution In the view of most physicists economic interactions are stochastic and the probability P is to be maximized under constraints of capital according to the Lagrange principle ln P(x) λ Σ j w j x j maximum! (2.). ln P is the logarithm of probability P(x j ) or entropy that will be maximized under the constraints of the total capital in income Σ w j x j. The variable (x j) is the relative number of people in the income class (w j). The Lagrange factor λ = 1 / <w> is equivalent to the mean income <w> per person. Distributing N households to (w j ) property classes is a question of combinatorial statistics, P = N! / Π ( N j!) (2.1). Using Sterling s formula ( ln N! = N ln N - N) and x = N j / N we may change Eq.(2.) to Σ j x j ln x j λ Σ j w j x j maximum! (2.2). At equilibrium (maximum) the derivative of equation (2.2) with respect to x j will again be zero. Inserting Eq.(2.1) into equation (2.2) we obtain ln P / x j = (ln x j + 1) = λ w j (2.3). In this operation for x j all other variables are kept constant and we may solve Eq.(2.4) for x = x j. The number N(w) of people in income class (w) is given by N (w) = A exp ( - w / <w> ) (2.4). In the physical model the relative number of people (x) in the income class (w) follows a Boltzmann distribution, Eq.(2.4). The Lagrange parameter λ has been replaced by the mean income <w>. The constant A is determined by the total number of people N 1 with an income following a Boltzmann distribution and may be calculated by the integral from zero to infinity, N 1 = N (w) d w = A <w> (2.5). The amount of capital K(w) in the property class (w) is K (w) = A w exp ( - w / <w> ) (2.6). The total amount K 1 is given by the integral from zero to infinity, K 1 = A w x (w) d w = A <w> ² (2.7).

4 The ratio of total wealth (2.7) divided by the total number of households (2.6) leads to K 1 / N 1 = <w> (2.8). And is indeed the mean income <w> per household. The Lorenz distribution y = K(w) as a function of x = N(w) fig.1 may be calculated from the Boltzmann distribution. Eqs.(2.6) and (2.4) and leads to y = x + (1 - ) ln (1 x) (2.9). 2.1 German wealth data 1993 and the Boltzmann distribution The Boltzmann distribution in Eqs. (2.5) and (2.6) may be compared to the property data in table 1. Property in D 1993 (DIW and Boltzmann) households N(x) in % 5 4 3 2 1 DIW data Boltzmann 1 2 3 4 5 property tax class x Fig. 2 Distribution of households x(w) in property classes (w) in Germany 1993, data points according to DIW (14), curve according to Boltzmann, Eq.(2.4). Due to the data set of uneven classes of wealth in table 1 the function does not look like a Boltzmann distribution. Property in D 1993 (DIW and Boltzmann) 5 Capital K(x) in % 4 3 2 1 DIW data Boltzmann 1 2 3 4 5 property tax class x Fig. 3 Distribution of capital K(w) in property classes (w) in Germany 1993, data points according to DIW (14), curve according to Boltzmann, Eq.(2,6) Due to the data set of uneven classes of wealth in table 1 the function does not look like a Boltzmann distribution.

5 Calculations and data show a good agreement: 1. The data of the Lorenz distribution in fig 1 agree well with the calculation of Eq.(2.9). 2. The data for N(w) in fig. 2 agree rather well with Eq.(2.4) with small deviations. 3. The data for K(w) in fig. 3 also agree rather well with Eq.(2.6) again with small deviations. 4. The total number of persons is determined by the integral (from A = to B = ) N (w) d w = A <w> = N = 19 Mill Hh (2.5). The integral leads to the correct number of total households and is used to calculate A. 5. The total amount of property is determined by the integral (from A = to B = ) K (w) d w = A <w> 2 = K = 19 Mrd. DM (2.7). The integral leads to the correct amount of total capital 6 The Lagrange parameter λ = 1 / <w> is determined by the ratio of total property and total number households, <w> = K / N = 1. DM (2.8). This is the mean property of all households in Germany 1993. 2.2 US wages data 1995 and the Boltzmann distribution Wages like wealth may be expected to show a Boltzmann distribution according to eq.(2.4). However, jobs below a wage minimum w have the attractiveness a* =. Accordingly, the distribution of income will be given by N (w ) = a*(w, w ) exp ( - w / <w> ) (2.1). The number of people earning a wage (w) will depend on the job attractiveness a*(w, w ) and the Boltzmann function. Low wages will be very probable, high wages less probable. Figs. 4 and 5 show the wage distribution for service and manufacturing in the US in 1995 (12). US wage distribution (services) earnings per wage class 4,5 4 3,5 3 2,5 2 1,5 1,5 5 1 15 earnings / $ per week services Boltzmann Fig. 4 Number of people in wage classes in services in the US (15). The data have been fitted by the Boltzmann distribution,,eq.(2.1) with a*(w, w,) = (w w ).

6 US wage distribution (manufacturing) earnimgs per wage class 4,5 4 3,5 3 2,5 2 1,5 1,5 5 1 15 earnings / $ per week manufacturing Boltzmann Fig. 5 Number of people in wage classes in manufacturing in the US (15).The data have been fitted by the Boltzmann distribution, Eq.(2.1) with a*(w, w,) = (w w ). Again the Boltzmann distribution seems to be a good fit. However, some authors prefer to fit similar data by a log normal distribution e.g. F. Clementi and M. Gallegatti (15). Presently, both functions seem to apply equally well. 3. US wealth data 1993 to 21, Boltzmann and Pareto Distribution Yakovenko [12] and others have presented data that follow a Boltzmann distribution for the majority of normal wages and a Pareto distribution for the income of the minority of very rich people, fig. 6. Distribution of Wealth 1 percentage of people 1 1,1,1,1 1 1 1 income relative to average Fig. 6 US income (1): Percentage of people in wage classes relative to average. The wealth of the majority of people (97 %) follows a Boltzmann distribution, only the wealth of the minority of super rich follows a Pareto law. The Pareto tail has a slope between 2 and 3.

8 3.1 Pareto distribution of wealth Economic actions (f) are optimized under the constraints of capital, costs or prices (g). Economists are used to maximize the rational production function U(x j ) under constraints of total income Σ w j d x j. The variable (x j ) is relative number of people in each income class (w j ). The Lagrange principle (1.) is now given by U(x j) λ Σ j w j x j maximum! (3.). At equilibrium (maximum) the derivative of equation (3.) with respect to x j will be zero, U / x j = λ w j (3.1) In economic calculations a Cobb Douglas type Ansatz for the production function U is often applied, U(x j ) = A Π x j α j (3.2). A is a constant, the exponents α j are the elasticity constants. Inserting Eq.(3.2) into equation (3.1) we obtain U / x j = α j A x j αj - 1 = λ w j (3.3). In this operation for x j all other variables are kept constant and we may solve Eq.(3.3) for x = x j. The relative number x(w) of people in income class (w) is now given by N(w) = A (λ w / α A ) 1 / ( α -1) = C (w m / w ) 2 + δ (3.4). According to this economic model the number of people N(w) in the income class (w) follows a Pareto distribution! In Eq.(3.4) the Lagrange parameter λ has been replaced by the minimum wealth class of the super rich, w m = 1 / λ, the constants have been combined to C. The Pareto exponent stands for 2 + δ = 1 / (1- α). The relative number of people x (w) decreases with rising income (w). The total number of rich people N 2 with an income following a Pareto distribution is given by the integral from a minimum wealth w m to infinity, N 2 = N(w) d w = C w m / (1 + δ) (3.5). The minimum wealth w m is always larger than zero, w m >. The amount of capital K(w) in the property class (w) is K (w) = N(w) w = C w m (w m / w ) 1 + δ (3.6). The total amount of capital K 2 of very rich people with an income following a Pareto distribution is given by the integral from a minimum wealth w m to infinity, K 2 = N(w) w d w = C w m 2 / δ (3,7). For positive wealth the exponent δ needs to be positive, δ >. For a high capital of the super rich the exponent δ is expected to be: < δ < 1. The ratio of total wealth (3.7) divided by the total number of super rich households (3.6) leads to w m = (K 2 / N 2 ) δ / (1 + δ) (3.8).

9 3.2 Boltzmann and Pareto distribution in USA wealth data 1993 to 21 The calculations of the Boltzmann and Pareto law may both be applied to the USA distribution of wealth in fig. 6. A. Boltzmann distribution 1. The wealth of the majority of the population follows the Boltzmann distribution in Eq.(2.4). 2. The total number of normal rich people is N 1 = 97 % of the total population, N 1 = N(w) d w = A <w> =,97 (2.5) 3. The wealth of the normal rich population is given by K 1 = N(w) w d w = N 1 <w> =,97 <w> (2.7) B. Pareto distribution 1. The wealth of the super rich minority follows a Pareto law, Eq.(3.4) 2. The exponent of the Pareto tail in fig. 6 is between 2 and 3 or < δ < 1, as required by Eq.(3.7). 3. The minimum wealth of the super rich according to fig. 6 is about eight times the normal mean, w m = 8 <w>. 4. The total number of super rich minority is N 2 = 3 % of the total population, N 2 = N(w) d w = C w m =,3 (3.5). 5. The total capital of the super rich minority (for δ =,5) is K 2 = N 2 w m (1 + δ) / δ =,3 * 8 <w> * 3 =,72 <w> (3.8). 6. The mean wealth of the super rich minority is <w 2 > = (K 2 / N 2 ) = w m (1 + δ) / δ = 8 <w> * 3 = 25 <w> (3.8). 7. In fig. 6 the super rich minority (3 % of the population) owns 4 % (,72 <w>) of the national wealth and the normal rich majority (97 % of the population) owns 6 % (,97 <w>). 4. Boltzmann and Pareto phases of wealth distribution The normal rich majority and the super rich minority belong to two different states or phases. The majority is governed by the Boltzmann law, the minority by a Pareto law. This corresponds to two different phases, like liquid and gas in physical sciences, and may be calculated by the Lagrange principle, L = f λ g maximum! (1.). L 1 = x ln x <w> Σ j w j x j maximum! (2.). L 2 = A x α <w> Σ k w k x k maximum! (3.). y = b x m (4.). Eq.(1.) is the general Lagrange equation, Eq.(2.) the Lagrange principle in stochastic systems and Eq.(3.) the Lagrange principle in rational systems. All may be considered linear equations of <w>. The b - value in Eq.(4.) is given by the entropy or utility function, which is lower for the super rich population due to the small factor A, which is of the order of A =,1 in fig.6. The slope m is given by the total wealth Σ j w j x j, which is higher for the normal population (6 %).

1 Lagrange Principle: L= max! 1 Lagrange function 8 6 4 2 2 4 <w> c 6 8 1 mean wealth <w> normal super rich Fig. 7. For low mean income <w> the Lagrange function L 1 (solid line) is at maximum, the normal rich phase dominates. For very large mean income <w> the Lagrange function L 2 (broken line) is higher (at maximum), the super rich phase dominates. The borderline between the normal and the super rich population is given by the intersection of the two lines at <w> c in fig.7. Below <w> c the solid line is higher (at maximum) and the normal phase dominates. Above <w> c the broken line is higher (at maximum) and the super rich phase dominates. The transition point <w> c is given by L 1 = L 2. However, the data are not yet sufficient to tell whether the transition normal super rich really is of first order, as it is indicated by the sharp knee in fig. 6 and the intersection in fig. 7. Other authors (Yoshi) find a smooth second order transition from the Boltzmann region of normal people to the Pareto region of the super rich. The point of transition is important for the full understanding of the system, but even more important is the mechanism that keeps normal and super rich people separated and drifting more and more apart. This topic will be discussed in a separate paper on the mechanism of economic growth. Conclusion What is the advantage of introducing the Lagrange function to economics? The Lagrange function leads to a better understanding of economic interactions: 1. Most economic systems may be regarded to be stochastic with a probability that depends on costs and prices. Normal wealth and income are a matter of probability, as are selling, buying, finding a job or getting married. 2. The Lagrange principle includes stochastic and rational views and will also allow the inclusion of other probability theories (eg. game theory). 3. The Lagrange function links economic systems to social and natural systems such as physics, chemistry, meteorology, which are also governed by the Lagrange principle. This is one reason why many natural scientists are now involved in econo-physics and socio-economics. 4. As in other scientific disciplines, the Lagrange principle may be regarded as the basis of economics, and all economic properties of the system should be calculable from the Lagrange principle! However, the Lagrange principle does not include time. For time dependent properties other methods like. the Fokker Planck or Master equation (3) need to be applied.

11 References (1) V. Pareto, Cours d'economie politique. Reprinted as a volume of Oeuvres Completes (Droz. Geneva, 1896-1965) (2) N. Georgescu-Roegen, The entropy law and the economic process / Cambridge, Mass. Harvard Univ. Press, 1974. (3) D.K. Foley, A Statistical Equilibrium Theory of Markets, Journal of Economic Theory Vol. 62, No. 2, April 1994. (4) W. Weidlich, The use of statistical models in sociology, Collective Phenomena 1, (1972) 51 (5) J. Mimkes, Binary Alloys as a Model for Multicultural Society, J. Thermal Anal. 43 (1995) 521-537 (6) J. Mimkes, Society as a many-particle System, J. Thermal Analysis 6 (2) 155-169 (7) M. Levy & S. Solomon, Power Laws are Logarithmic Boltzmann Laws, International Journal of Modern Physics C, Vol. 7, No. 4 (1996) 595; J. (8) M. Levy, H. Levy & S. Solomon, Microscopic Simulation of Financial Markets, Academic Press, New York, 2. (9) P. Richmond & S. Solomon, Power Laws are Boltzmann Laws in Disguise, S. Solomon & P. Richmond, Stability of Pareto-Zif Law in Non-Stationary Economies, 2 (1) J. Mimkes and G Willis, Ecidence for Boltzmann Distribution in Waged Income, cond. mat./46694 (11) A.C.Silva and V.M. Yakovenko, Temporal evolution of the thermal and and superthermal income classes in the USA during 1983 21, Europhysics Letters 69, 34 31 (25) (12) F. Clementi and M. Gallegati, Power law tails in Italien Personal Income Distribution, Physica A submitted (13) M. Nirei and W. Souma, Two Factor Model of Income Distribution Dynamics www.santafe.edu/research/publications/wpabstract/24129 (14) Report Deutsches Institut für Wirtschaftsforschung, DIW, Berlin 1993 (15) Cleveland Federal Reserve Bank; US Department of Labor; Bureau of Statistics. 1996