Breast dense tissue segmentation with noisy labels: a hybrid threshold-based and mask-based approach

Larroza, Andrés; Pérez-Benito, Francisco Javier; Perez-Cortes, Juan-Carlos; Román, Marta; Pollán, Marina; Pérez-Gómez, Beatriz; Salas-Trejo, Dolores; Casals, María; Llobet, Rafael

Breast dense tissue segmentation with noisy labels: a hybrid threshold-based and mask-based approach

Mostra el registre complet Registre parcial de l'ítem

dc.contributor.author Larroza, Andrés
dc.contributor.author Pérez-Benito, Francisco Javier
dc.contributor.author Perez-Cortes, Juan-Carlos
dc.contributor.author Román, Marta
dc.contributor.author Pollán, Marina
dc.contributor.author Pérez-Gómez, Beatriz
dc.contributor.author Salas-Trejo, Dolores
dc.contributor.author Casals, María
dc.contributor.author Llobet, Rafael
dc.date.accessioned 2023-02-01T07:37:36Z
dc.date.available 2023-02-01T07:37:36Z
dc.date.issued 2022
dc.description.abstract Breast density assessed from digital mammograms is a known biomarker related to a higher risk of developing breast cancer. Supervised learning algorithms have been implemented to determine this. However, the performance of these algorithms depends on the quality of the ground-truth information, which expert readers usually provide. These expert labels are noisy approximations to the ground truth, as there is both intra- and inter-observer variability among them. Thus, it is crucial to provide a reliable method to measure breast density from mammograms. This paper presents a fully automated method based on deep learning to estimate breast density, including breast detection, pectoral muscle exclusion, and dense tissue segmentation. We propose a novel confusion matrix (CM)-YNet model for the segmentation step. This architecture includes networks to model each radiologist's noisy label and gives the estimated ground-truth segmentation as well as two parameters that allow interaction with a threshold-based labeling tool. A multi-center study involving 1785 women whose "for presentation" mammograms were obtained from 11 different medical facilities was performed. A total of 2496 mammograms were used as the training corpus, and 844 formed the testing corpus. Additionally, we included a totally independent dataset from a different center, composed of 381 women with one image per patient. Each mammogram was labeled independently by two expert radiologists using a threshold-based tool. The implemented CM-Ynet model achieved the highest DICE score averaged over both test datasets (0.82±0.14) when compared to the closest dense-tissue segmentation assessment from both radiologists. The level of concordance between the two radiologists showed a DICE score of 0.76±0.17. An automatic breast density estimator based on deep learning exhibited higher performance when compared with two experienced radiologists. This suggests that modeling each radiologist's label allows for better estimation of the unknown ground-truth segmentation. The advantage of the proposed model is that it also provides the threshold parameters that enable user interaction with a threshold-based tool.
dc.format.mimetype application/pdf
dc.identifier.citation Larroza A, Pérez-Benito FJ, Perez-Cortes JC, Román M, Pollán M, Pérez-Gómez B, et al. Breast dense tissue segmentation with noisy labels: a hybrid threshold-based and mask-based approach. Diagnostics (Basel). 2022 Jul 28; 12(8): 1822. DOI: 10.3390/diagnostics12081822
dc.identifier.doi http://dx.doi.org/10.3390/diagnostics12081822
dc.identifier.issn 2075-4418
dc.identifier.uri http://hdl.handle.net/10230/55509
dc.language.iso eng
dc.publisher MDPI
dc.rights Copyright © 2022 by Larroza A, Pérez-Benito FJ, Perez-Cortes JC, Román M, Pollán M, Pérez-Gómez B, et al. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.rights.uri http://creativecommons.org/licenses/by/4.0/
dc.subject.keyword Breast density segmentation
dc.subject.keyword Deep learning
dc.subject.keyword Mammography
dc.subject.keyword Noisy labels
dc.title Breast dense tissue segmentation with noisy labels: a hybrid threshold-based and mask-based approach
dc.type info:eu-repo/semantics/article
dc.type.version info:eu-repo/semantics/publishedVersion

Col·leccions

Articles (Hospital del Mar Research Institute)