applet-magic.com
Thayer Watkins
Silicon Valley &
Tornado Alley
USA

Maximum Entropy Distributions

General Nature of the Problem:
The Discrete Case

Consider a set of states labeled i=0,1,2,... and a set of entities which are distributed among those states. Let the number of entities in state i be denoted as ni and the total number of entities be n. The proportion in state i is pi = ni/n.

The entropy for a distribution is defined as:

S = Σpiln(1/pi) = -Σpiln(pi)

As to whether the above definition corresponds to or is consistent with the phenomenonological definition of entropy in thermodynamics is another question and that must be dealt with elsewhere.

Each state has an associated value wi. The total value of this state variable is given by the expression:

W = Σniwi

One important example of the subject matter is the case in which the entities are molecules in a gas and the state variable is energy. However the above material may also be applied to other cases such as when the entities are households and the state variable is wealth.

The constraints on the distribution are that the number of entities in the states is fixed and also the value of the state variable for the ensemble is fixed; i.e.,

Σ ni = n
Σ niwi = W

These constraints can be put into a more convenient form by dividing through by n. The results are:

Σ ni/n = Σ pi = 1
Σ (ni/n)wi = Σ piwi = W/n = w

The Determination of the Maximum Entropy Distribution

The problem is now to choose the pi's so as to

maximize S = -Σpiln(pi)
subject to the constraints
Σpi =1
and
Σpiwi = w

The Lagrangian multiplier method may be used to solve this constrained maximization problem. This means we should seek the unconstrained maximum of

S - λ(Σpi-1) - μ(Σpiwi - w)

and choose values for the multipliers λ and μ which will result in the satisfaction of the constraints.

The first order conditions are:

∂S/∂pi - λ - μwi = 0
for all i

Since

∂S/∂pi = -ln(pi) -1

the first order conditions reduce to

ln(pi) = 1-λ - μwi
or
pi = e1-λe-μwi
for all i.

For convenience let e1-λ=A. Then the first constraint is that

ΣAe-μwi
which means that
A = 1/(ΣAe-μwi)

The second constraint is then that

(Σwie-μwi)/(ΣAe-μwi) = w

An Important Special Case

The solution for the value of μ which satisfies the above condition in general would require numberical. This is however a special case in which an analytical solution is possible. That is the special case in which the state variable values are equally spaced and start at zero; i.e.,

wi = hi for some constant h.

Let H(μ) be defined as:

H(μ) = Σ e-μhi = Σ(e-μh)i

This is a geometric series which has the value

H(μ) = 1/(1-e-μh)

Furthermore

H'(μ) = Σ(-hi)e-μhi = -Σ(hi)e-μhi

But since

H(μ) = (1-e-μh)-1
it follows that
H'(μ) = -he-μh(1-e-μh)-2

The second constraint for the case that wi=hi is that

(Σ(hi)e-μhi)/(Σe-μhi) = w.

But

(Σ(hi)e-μhi)/(Σe-μhi) = -H'(μ)/H(μ)
= he-μh/(1-e-μh).

Therefore

he-μh/(1-e-μh) = w

Solving this equation for μ gives

μ = ln(1+h/w)/h.

The Limiting Continuous Case

The limit of the above solution for μ as h goes to zero is, by L'Hospital's Rule,

μ = 1/w.

The above result can also be obtained by considering the continuous case; i.e.,

Maximize S = -∫0p(x)ln(p(x))dx
with respect to p(x)
subject to the constraints
0p(x)dx = 1
and
0xp(x)dx = w

The Maximum Value of Entropy

Consider now the general case and what the value of the maximum entropy is. Let S* be the maximum entropy, the value achieved when pi = Ae-μwi.

Since ln(pi) = ln(A)-μwi
S* = Σpi(ln(A)-μwi)
= ln(A)Σpi - μΣwipi
= ln(A) - μw

This latter expression is the logarithm of the probability of a state whose value is w.

A Simple Extension

There is a simple extension of the above analysis that is worth carrying out. Suppose the minimum value of the state variable is not necessarily zero but instead is some level j. For the discrete case with evenly-spaced states this means that

pi = Ae-μhi for all i≥j
or
pi = Ae-hje-μh(i-j)

All of the previous results hold with w replaced by w-hj; i.e.,

μ = ln(1 + h/(w-hj))/h
and for the continuous case of h -> 0
μ = 1/(w-hj)

Conclusion

The maximum entropy distribution is the negative exponential distribution where the parameter of the negative exponential is in the limit of continuous values for the state variable the reciprocal of the average level of the state variable for the population. The maximum entropy for this distribution is equal to the logarithm of what the probability density would be at the average level of the state variable.

Although the concept of a maximum entropy distribution can be formally applied to distributions outside of thermal physics those applications are not necessarily of any empirical significance. In thermal physics any interaction with an extremely high probability increases the entropy so in practice the only significant distributions are the ones with maximum entropy. Outside of thermal physics the interactions of the entities may not operate in the same way. In thermal physical phenomena if there is a high concentration of thermal energy; i.e., high temperature, relative to the surroundings that energy gets dispersed. In economics it is common that entities with a relative low level of wealth transfer wealth to entities with a relative high level of wealth as at a rock concert or a sporting event. Nevertheless it is worth noting that the Italian sociologist-economist, Vilfredo Pareto, when he did a study of the distribution of income in three types of economic systems, capitalist, communist and fascist, found the the distributions for the higher levels of income were all of the same form, negative exponential.

HOME PAGE OF applet-magic.com