We found 2,207 cryptic species reports (CSR) under 771,931 studies that were suitable to detect cryptic species. Log10
number of CSR were correlated with the log10
of estimated number of described species in different metazoan taxonomic groups (Fig. , R2
= 0.53, F1,18
= 20.69, P
= 0.0002). Deviations from the regression line, termed CSR taxon variation, are presumably composed of differences in study intensity and taxonomic practice in the respective research community, true differences among taxonomic groups and random error. In order to assess the impact of differential research intensity, we regressed the log10
number of studies on the log10
number of described species in the respective taxonomic group; the residuals were used as a measure of taxonomic study bias (R2
= 0.64, F1,18
= 8.63, P
= 0.0088). This parameter explained part of the CSR taxon variation (Fig. , R2
= 0.47, F1,18
= 15.67, P
= 0.0009). Assuming that the numbers of CSR are proportional to the true number of cryptic species, their distribution is nearly homogeneous across the taxa analysed. Only six groups fell outside the 95% confidence intervals: Mammalia, Amphibia and combined smaller Arthropoda classes exhibited an excess of CSR, while Bivalvia, Arachnida and combined smaller Mollusca classes exhibited a deficit of CSR. At least for vertebrates, we suspect that these results might result from taxonomic inflation [2
]. Overall, differential taxonomic practice in the various research communities seemingly exerted no major impact on the results.
The log10 of cryptic species reports (CSR) as a function of the log10 number of described species in the respective taxon. Deviations from the regression line represent CSR taxon variation. Dashed lines represent 95% confidence intervals.
Regression of CSR taxon variation on taxon study bias for 19 metazoan taxa. Dashed lines represent 95% confidence intervals.
Applying the same procedure to the classic biogeographical regions revealed a marginally significant correlation between the number of CSR and the proportion of described species in the respective region (Fig. , R2
= 0.59, F1,5
= 7.34, P
= 0.0423). Study intensity of biogeographical regions was independent of estimated metazoan species richness (R2
= 0.41, F1,5
= 3.41, P
= 0.1242). However, regressing the residuals against each other, the same nearly neutral pattern emerged (Fig. , R2
= 0.89, F1,5
= 40.44, P
= 0.0014), which contradicts the view that tropical regions harbour relatively more cryptic species [1
The log10 of CSR as a function of the log10 number of described species in the respective region. Deviations from the regression line represent CSR region variation. Dashed lines represent 95% confidence intervals.
Regression of CSR taxon variation on biogeographical region study bias. Dashed lines represent 95% confidence intervals.