1The “first significant digit” of a number is its leftmost non-zero digit. For example, the first significant digit of the number 325 is 3 and the first significant digit of 0.8732 is 8. It might be expected that the first significant digits of any given series of numbers, or of a set of numbers measuring any given phenomenon, are randomly distributed. Nothing of the sort: in most series found in the real world, figure 1 appears more often than figure 2, which in turn appears more often than figure 3, and so on. The purpose of this note is to illustrate this rule, known as Benford’s law, using data for the populations of all world countries, and to show its underlying logic, which in this particular case, relies on the pattern of population growth.
I. Benford’s law
2In 1881, the mathematician and astronomer Simon Newcomb noticed that the first volumes of the tables of logarithms in the library of his institution were more worn than subsequent ones. This meant that there were more frequent consultations for numbers starting with 1 or 2 than for numbers starting with 8 or 9. In 1938, the engineer Benford made the same observation about the same tables of logarithms, independently of Newcomb’s work. In his article, Benford compiled numerous sets of data, from physical constants to baseball results and various number series he found in newspapers. The average number of appearances of the first significant digits followed a logarithmic law already proposed by Newcomb (Bedford, 1938):
3— where Fd is the frequency of appearance of the first significant digit d.
4Benford’s law predicts that first significant digits will be distributed as described in Table 1.
Predicted frequency of appearance of the first significant digit according to Benford’s law
Predicted frequency of appearance of the first significant digit according to Benford’s law
5The sum of probabilities can be shown to be equal to 1:
6In practice, it is difficult to apply this law universally; some numerical series do not conform to it. Hill (1995) demonstrated, however, that for these series, random samples taken from the complete series did follow Benford’s law. Next, we will see how the law works for the current populations of different countries worldwide.
II. The distribution of world populations
7INED regularly publishes a list of country-specific statistics worldwide. Here, we will use the 1997 list, which gives information on 198 countries or geopolitical entities. Figure 1 compares the observed distribution with the distribution predicted by Benford’s law. The difference between the distributions is small, confirming that the distribution of population among countries does in fact follow Benford’s law.
Observed frequency of the first significant digit of the 1997 population size for 198 countries and predicted frequency according to Benford’s law
Observed frequency of the first significant digit of the 1997 population size for 198 countries and predicted frequency according to Benford’s law
8If we perform the same type of calculations for surface areas and population densities, the results are similar (Figures 2 and 3).
Observed frequency of the first significant digit of the 1997 surface area for 198 countries and predicted frequency according to Benford’s law
Observed frequency of the first significant digit of the 1997 surface area for 198 countries and predicted frequency according to Benford’s law
Observed frequency of the first significant digit of the 1997 population density for 198 countries and predicted frequency according to Benford’s law
Observed frequency of the first significant digit of the 1997 population density for 198 countries and predicted frequency according to Benford’s law
III. An explanation for population sizes
9First, it must be made clear that there is no bias resulting from rounding the population size of small countries to 1 as a first significant digit, the smallest number in the table being 0.02 (million).
10The explanation lies elsewhere. To work it out, one needs to look at the distribution of population sizes for a population with constant growth rate, for example 2% annually. Table 2 shows that at a constant annual rate of growth, population sizes with first significant digit 1 are more common than others, and follow Benford’s law. If we extend these calculations over the long term, the observed frequency of first significant digits follows Benford’s law perfectly; this is also true with other constant growth rates.
Population size over 172 time periods with a 2% growth rate*
Population size over 172 time periods with a 2% growth rate*
11Demographers use the shortcut of dividing 70 by the growth rate (in%) of a population to find the population’s doubling time. This result can be derived as follows:
- the pace of the growth is given by ;
- where P(u) = population size at time u;
- r = growth rate.
- then ;
- and where r’ is the constant growth rate in %.
12To move from size 200 to size 300, it only takes 21 units of time:
13It takes 15 units of time to grow from 300 to 400:
14and only 5 units from 800 to 900:
15The number of time periods necessary for a population to grow from size (d)00 to (d+1)00 is given by the formula:
16— with
17— where M is a constant
18— and Fd the frequency of appearance of the first significant digit d.
19Thus, this formula is determined by the distribution of first significant digits, which is precisely the distribution specified by Benford’s law. Over a long period, the first significant digit of the population size of any given country is therefore more often 1 than 2, 2 than 3, and so on, up to 9.
20To look for a cross-sectional version of this longitudinal regularity for countries is equivalent to drawing a random sample among a set of 198 series that conform to Bedford’s law, assuming that the following two hypotheses hold, which can be reasonably assumed.
21Hypothesis 1: the population sizes of different countries are independent;
22Hypothesis 2: the timing of the onset of the demographic transition in any given country is independent of the first significant digit of its population size.
Conclusion
23The explanation given here for the conformity of population sizes to Bedford’s law does not negate the existence of other data sets, for example the areas of countries, that follow Benford’s law but do not fit this longitudinal argument. Certain advances in probability theory have led to a better understanding of the principal aspects of Benford’s law, but a completely satisfactory explanation still remains to be developed (Hill, 1999).
Acknowledgements
The author wishes to thank Éva Lelièvre and Etienne van de Walle, as well as the anonymous referees, for their valuable comments.Bibliography
REFERENCE
- Benford F., 1938, “The law of anomalous numbers”, Proceedings of the American Philosophical Society, 78(4), pp. 551-572.
- INED, 1997, “Tous les pays du monde”, Population et Sociétés, 326, July-Aug.
- Hill T., 1995, “A statistical derivation of the significant-digit law”, Statistical Science, 10(4), pp. 354-363.
- HillT., 1999, “Le premier chiffre significatif fait sa loi”, La Recherche, 316, January, pp. 72-75.
- Newcomb S., 1881, “Note on the frequency of use of the different digits in natural numbers”, Am. J. Mathematics, 4, pp. 39-40.