Zonation of stratigraphic successions is a key practice for identifying intervals characterised by stability or, conversely, by palaeoenvironmental changes. Stratigraphically constrained agglomerative algorithms have been commonly adopted to obtain zonation based on quantitative palaeontological data. Here we explore constrained divisive algorithms aiming at obtaining a zonation that meets the principle of maximizing coefficients commonly used to evaluate the effectiveness of clustering algorithms. In particular, a constrained version of Cavalli Sforza’s method was applied, together with an algorithm conceived to maximise, at each division, the average silhouette width of the observations. The results were compared, following the compositional data analysis properties, with those obtained with a commonly adopted agglomerative method. When evaluated on artificial data, the divisive algorithms show stability and a tendency to identify the boundary between intervals at the midpoint of transitions, consistently with common stratigraphic practice. Overall, the application to real micropalaeontological data, consisting of percentages of planktonic foraminifera, provide reasonable zonation patterns with all algorithms considered. For the main partition, the constrained version of Cavalli Sforza’s method provides highest values of Calinski–Harabasz and Hartigan indexes, while the average silhouette width method, as expected, performs better in the evaluation of average silhouette width index as well as of Goodman–Kruskal’s coefficient and cophenetic correlation. One potential issue to consider is the tendency to define single sample intervals as the number of partitions increases
This research was supported by the Ministerio de Ciencia e Innovación under the projects “CODA-GENERA” (Ref. PID2021-123833OB-I00) and “CONBACO” (Ref. PID2021-125380OB-I00); and by the Agència de Gestió d'Ajuts Universitaris i de Recerca of the Generalitat de Catalunya under the project "COSDA" (Ref. 2021SGR01197). Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature
Article
Versió publicada
peer-reviewed
Anglès
Algorismes; Algorithms; Anàlisi de conglomerats; Cluster analysis; Paleontologia -- Models matemàtics; Paleontology -- Mathematical models
Springer
info:eu-repo/semantics/altIdentifier/doi/10.1007/s11004-025-10205-5
info:eu-repo/semantics/altIdentifier/issn/1874-8961
info:eu-repo/semantics/altIdentifier/eissn/1874-8953
Attribution 4.0 International
http://creativecommons.org/licenses/by/4.0/