What Can We Estimate About Population in Prehistory and History?

Calculating the number of people who have ever lived is part science and part art. No demographic data exist for more than 99% of the span of human existence.

Any estimate of the total number of people who have ever lived depends essentially on three factors: the length of time that humans are thought to have been on Earth, the average size of the population at different periods, and the number of births per 1,000 population during each of those periods. The estimate, however, does not depend on the number of deaths during any period of time.

The oldest hominins are thought to have appeared as early as 7 million B.C.E. The earliest species of the Homo genus appeared around 2 million to 1.5 million B.C.E.

Modern Homo sapiens originated in Africa, though the exact location has long been debated. Diverse groups are thought to have lived in different locations across Africa for the first two-thirds of human history.

(Table 1 displays very rough figures representing averages of an estimate of ranges given by the United Nations and other sources.) Slow population growth over the 8,000-year period—from an estimated 5 million in 8000 B.C.E. to 300 million in 1 C.E.—results in a very low growth rate of only 0.05% per year.

In all likelihood, human populations in different regions grew or declined in response to food availability, the variability of animal herds, periods of peace or hostility, and changing weather and climatic conditions.

Calculating the probability[edit] [4]

In probability theory, the birthday problem asks for the probability that, in a set of n randomly chosen people, at least two will share a birthday. The birthday paradox refers to the counterintuitive fact that only 23 people are needed for that probability to exceed 50%.

The birthday paradox is a veridical paradox: it seems wrong at first glance but is, in fact, true. While it may seem surprising that only 23 individuals are required to reach a 50% probability of a shared birthday, this result is made more intuitive by considering that the birthday comparisons will be made between every possible pair of individuals.

Real-world applications for the birthday problem include a cryptographic attack called the birthday attack, which uses this probabilistic model to reduce the complexity of finding a collision for a hash function, as well as calculating the approximate risk of a hash collision existing within the hashes of a given size of population.

The problem is generally attributed to Harold Davenport in about 1927, though he did not publish it at the time. Davenport did not claim to be its discoverer “because he could not believe that it had not been stated earlier”.

From a permutations perspective, let the event A be the probability of finding a group of 23 people without any repeated birthdays. Where the event B is the probability of finding a group of 23 people with at least two people sharing same birthday, P(B) = 1 − P(A).

for a group of 2 people, mm/dd birthday format, one possible outcome is { { 01 / 02 , 05 / 20 } , { 05 / 20 , 01 / 02 } , { 10 / 02 , 08 / 04 } ,.

} {\displaystyle \left\{\left\{01/02,05/20\right\},\left\{05/20,01/02\right\},\left\{10/02,08/04\right\},..\right\}} ) divided by the total number of birthdays with repetition and order matters, V t {\displaystyle V_{t}} , as it is the total space of outcomes from the experiment (e.g. 2 people, one possible outcome is { { 01 / 02 , 01 / 02 } , { 10 / 02 , 08 / 04 } ,.

} {\displaystyle \left\{\left\{01/02,01/02\right\},\left\{10/02,08/04\right\},..\right\}} ). Therefore V n r {\displaystyle V_{nr}} and V t {\displaystyle V_{t}} are permutations.

Another way the birthday problem can be solved is by asking for an approximate probability that in a group of n people at least two have the same birthday.

any unevenness increases this probability. The problem of a non-uniform number of births occurring during each day of the year was first addressed by Murray Klamkin in 1967.

The goal is to compute P(B), the probability that at least two people in the room have the same birthday. However, it is simpler to calculate P(A′), the probability that no two people in the room have the same birthday.

Here is the calculation of P(B) for 23 people. Let the 23 people be numbered 1 to 23.

Let these events be called Event 2, Event 3, and so on. Event 1 is the event of person 1 having a birthday, which occurs with probability 1.

Similarly, the probability of Event 3 given that Event 2 occurred is 363/365, as person 3 may have any of the birthdays not already taken by persons 1 and 2. This continues until finally the probability of Event 23 given that all preceding events occurred is 343/365.

The terms of equation (1) can be collected to arrive at:.

Evaluating equation (2) gives P(A′) ≈ 0.492703. Therefore, P(B) ≈ 1 − 0.492703 = 0.507297 (50.7297%).

This process can be generalized to a group of n people, where p(n) is the probability of at least two of the n people sharing a birthday. It is easier to first calculate the probability p(n) that all n birthdays are different.

When n ≤ 365:. where.

The equation expresses the fact that the first person has no one to share a birthday, the second person cannot have the same birthday as the first (364/365), the third cannot have the same birthday as either of the first two (363/365), and in general the nth birthday cannot be the same as any of the n − 1 preceding birthdays.

The event of at least two of the n persons having the same birthday is complementary to all n birthdays being different. Therefore, its probability p(n) is.

The Taylor series expansion of the exponential function (the constant e ≈ 2.718281828). provides a first-order approximation for ex for | x | ≪ 1 {\displaystyle |x|\ll 1} :.

Thus,. Then, replace a with non-negative integers for each term in the formula of p(n) until a = n − 1, for example, when a = 1,.

Therefore,. An even coarser approximation is given by.

According to the approximation, the same approach can be applied to any number of “people” and “days”. If rather than 365 days there are d, if there are n persons, and if n ≪ d, then using the same approach as above we achieve the result that if p(n, d) is the probability that at least two out of n people share the same birthday from a set of d available days, then:.

In a room containing n people, there are (n2) = n(n − 1)/2 pairs of people, i.e. (n2) events.

Being independent would be equivalent to picking with replacement, any pair of people in the world, not just in a room. In short 364/365 can be multiplied by itself (n2) times, which gives us.

And for the group of 23 people, the probability of sharing is. Applying the Poisson approximation for the binomial on the group of 23 people,.

The result is over 50% as previous descriptions. This approximation is the same as the one above based on the Taylor expansion that uses ex ≈ 1 + x.

A good rule of thumb which can be used for mental calculation is the relation. which can also be written as.

In these equations, m is the number of days in a year.

Which is not too far from the correct answer of 23.

This is a result of the good approximation that an event with 1/k probability will have a 1/2 chance of occurring at least once if it is repeated k ln 2 times.

Using the birthday analogy: the “hash space size” resembles the “available days”, the “probability of collision” resembles the “probability of shared birthday”, and the “required number of hashed elements” resembles the “required number of people in a group”.

For comparison, 10−18 to 10−15 is the uncorrectable bit error rate of a typical hard disk. In theory, 128-bit hash functions, such as MD5, should stay within that range until about 8.2×1011 documents, even if its possible outputs are many more.

The argument below is adapted from an argument of Paul Halmos.[nb 1]. As stated above, the probability that no two birthdays coincide is.

