Uncertainty-aware segmentation quality prediction via deep learning Bayesian modeling: comprehensive evaluation and interpretation on skin cancer and liver segmentation

dc.contributor.author
Okkath Krishnanunni, Sikha
dc.contributor.author
Riera-Marin, Meritxell
dc.contributor.author
Galdran, Adrian
dc.contributor.author
García López, Javier
dc.contributor.author
Rodriguez-Comas, Júlia
dc.contributor.author
Piella Fenoy, Gemma
dc.contributor.author
González Ballester, Miguel Ángel, 1973-
dc.date.accessioned
2026-03-18T00:29:36Z
dc.date.available
2026-03-18T00:29:36Z
dc.date.issued
2026-03-17T13:30:43Z
dc.date.issued
2026-03-17T13:30:43Z
dc.date.issued
2025
dc.date.issued
2026-03-17T13:30:43Z
dc.identifier
Okkath Krishnanunni S, Riera-Marin M, Galdran A, García López J, Rodriguez-Comas J, Piella Fenoy G, González Ballester MÁ. Uncertainty-aware segmentation quality prediction via deep learning Bayesian modeling: comprehensive evaluation and interpretation on skin cancer and liver segmentation. Comput Med Imaging Graph. 2025 Jul;123:102547. DOI: 10.1016/j.compmedimag.2025.102547
dc.identifier
0895-6111
dc.identifier
https://hdl.handle.net/10230/72828
dc.identifier
http://dx.doi.org/10.1016/j.compmedimag.2025.102547
dc.identifier.uri
https://hdl.handle.net/10230/72828
dc.description.abstract
Image segmentation is a critical step in computational biomedical image analysis, typically evaluated using metrics like the Dice coefficient during training and validation. However, in clinical settings without manual annotations, assessing segmentation quality becomes challenging, and models lacking reliability indicators face adoption barriers. To address this gap, we propose a novel framework for predicting segmentation quality without requiring ground truth annotations during test time. Our approach introduces two complementary frameworks: one leveraging predicted segmentation and uncertainty maps, and another integrating the original input image, uncertainty maps, and predicted segmentation maps. We present Bayesian adaptations of two benchmark segmentation models -SwinUNet and Feature Pyramid Network with ResNet50- using Monte Carlo Dropout, Ensemble, and Test Time Augmentation to quantify uncertainty. We evaluate four uncertainty estimates -confidence map, entropy, mutual information, and expected pairwise Kullback-Leibler divergence- on 2D skin lesion and 3D liver segmentation datasets, analyzing their correlation with segmentation quality metrics. Our framework achieves an R score of 93.25 and Pearson correlation of 96.58 on the HAM10000 dataset, outperforming previous segmentation quality assessment methods. For 3D liver segmentation, Test Time Augmentation with entropy achieves an R score of 85.03 and a Pearson correlation of 65.02, demonstrating cross-modality robustness. Additionally, we propose an aggregation strategy that combines multiple uncertainty estimates into a single score per image, offering a more robust and comprehensive assessment of segmentation quality compared to evaluating each measure independently. The proposed uncertainty-aware segmentation quality prediction network is interpreted using gradient-based methods such as Grad-CAM and feature embedding analysis through UMAP. These techniques provide insights into the model's behavior and reliability, helping to assess the impact of incorporating uncertainty into the segmentation quality prediction pipeline. The code is available at: https://github.com/sikha2552/Uncertainty-Aware-Segmentation-Quality-Prediction-Bayesian-Modeling-with-Comprehensive-Evaluation-.
dc.format
application/pdf
dc.format
application/pdf
dc.language
eng
dc.publisher
Elsevier
dc.relation
Computerized Medical Imaging and Graphics. 2025 Jul;123:102547
dc.rights
© 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
dc.rights
http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
Image segmentation
dc.subject
Ground-truth free performance evaluation
dc.subject
Uncertainty quantification
dc.subject
Uncertainty aggregate score
dc.subject
Explainable AI
dc.title
Uncertainty-aware segmentation quality prediction via deep learning Bayesian modeling: comprehensive evaluation and interpretation on skin cancer and liver segmentation
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Fitxers en aquest element

FitxersGrandàriaFormatVisualització

No hi ha fitxers associats a aquest element.

Aquest element apareix en la col·lecció o col·leccions següent(s)