Changing default parameters can prevent false peaks in population size estimations.
In the final chapter of my PhD thesis, I explored the impact of hybridization on the evolutionary history of the True Geese (eventually published in BMC Evolutionary Biology). One of the genomic analyses involved a pairwise sequentially Markovian coalescent (PSMC) approach which uses estimates of coalescent times to infer changes in effective population size (Ne) over time. This technique – developed by Heng Li and Richard Durbin – is nicely explained by David Reich in his book Who We Are and How We Got Here:
A 2011 paper by Heng Li and Richard Durbin showed that the idea that a single person’s genome contains information about a multitude of ancestors was not just a theoretical possibility, but a reality. To decipher the deep history of a population from a single person’s DNA, Li and Durbin leveraged the fact that any single person actually carries not one but two genomes: one from his or her father and one from his or her mother. Thus it is possible to count the number of mutations separating the genome a person receives from his or her mother and the genome the person receives from his or her father to determine when they shared a common ancestor at each location. By examining the range of dates when these ancestors lived—plotting the ages of one hundred thousand Adams and Eves—Li and Durbin established the size of the ancestral population at different times. In a small population, there is a substantial chance that two randomly chosen genome sequences derive from the same parent genome sequence, because the individuals who carry them share a parent. However, in a large population the chance is far lower. Thus, the times in the past when the population size was low can be identified based on the periods in the past when a disproportionate fraction of lineages have evidence of sharing common ancestors.
The PSMC results from my study revealed that some goose species showed a marked increase in effective population size during the Last Glacial Maximum (about 110,000 to 12,000 years ago). We interpreted this pattern as a consequence of “population subdivision and occasional gene flow, leading to higher levels of heterozygosity and consequently higher estimates of Ne.” However, a recent paper in the journal Current Biology suggests that these population peaks might represent artefacts due to issues with default parameters.

Sophisticated Simulations
Leon Hilgers and his colleagues were analyzing several turtle species when they “detected a curious pattern of dramatic peaks followed by even more extreme population collapses.” This pattern occurred across distantly related turtle species from around the globe. Moreover, a literature search revealed similar patterns in countless other taxa, such as horses (where it was linked to expanding and contracting grasslands) and the Cape Buffalo (where it was explained by colonization of new areas). What is going on here?
First, the researchers examined whether the extreme peaks might reflect genuine increases in effective population size. However, when they simulated genomes based on the expected population history, no such peaks were detected. Similarly, simulations incorporating population fragmentation failed to produce extreme peaks (except in cases with high levels of post-fragmentation gene flow). Taken together, these simulation results suggest that the observed peaks are unlikely to represent real biological phenomena.

Splitting Time Intervals
Perhaps the extreme peaks are due to a technical artifact related to PSMC parameter settings? Indeed, when the researchers adjusted some parameter settings, the peaks disappeared from the PSMC plots. To understand this finding, we need to dive into the details of the p-parameter which divides time into a series of intervals, after which PSMC estimates an effective population size for each interval.
The default setting for this parameter looks like this: “-p 4 + 25 x 2 + 4 + 6”. This series of numbers can be broken down into a set-up of time intervals. You start with four small intervals for recent times, followed by 25 groups of two intervals for the mid-range time. Next, there are four small intervals for somewhat ancient times. And the final six intervals are merged together in the very distant past.
The artefact could be avoided by splitting the first time window into two windows (so ‘‘-p 2 + 2 + 25 x 2 + 4 + 6’’ instead of the default ‘‘-p 4 + 25 x 2 + 4 + 6’’). This adjustment, nicely demonstrated in additional analyses of several primate species, prevents the default setting from fixing the first four intervals into a single large window. In the default model, PSMC infers one Ne for this broad window and cannot capture population changes within it. When population declines occur during this period, the model thus may overcompensate by inflating Ne estimates in the preceding window. Splitting the first window into two can circumvent this issue.

A Call for Caution
This study underscores the need for careful evaluation of complex analytical methods. New genomic tools are often rapidly applied to preferred study systems (such as the True Geese, in my case), but their outputs warrant cautious interpretation. Dramatic increases in effective population size might reflect real environmental or climatic events, or they can be artefacts of the method. Researchers should therefore invest time to thoroughly understand a method’s assumptions and limitations before applying it. Not an easy choice under the time pressure of today’s “publish-or-perish” environment.
References
Hilgers, L., Liu, S., Jensen, A., Brown, T., Cousins, T., Schweiger, R., Guschanski, K. & Hiller, M. (2025). Avoidable false PSMC population size peaks occur across numerous studies. Current Biology, 35(4), 927-930.
Ottenburghs, J., Megens, H. J., Kraus, R. H., Van Hooft, P., van Wieren, S. E., Crooijmans, R. P., Ydenberg, R. C., Groenen, M. A. M. & Prins, H. H. T. (2017). A history of hybrids? Genomic patterns of introgression in the True Geese. BMC Evolutionary Biology, 17(1), 201.
Featured image: PSMC plot from Ottenburghs et al. (2017) BMC Evolutionary Biology







