TY - JOUR
T1 - An end-to-end weakly supervised learning framework for cancer subtype classification using histopathological slides
AU - Zhou, Hongren
AU - Chen, Hechang
AU - Yu, Bo
AU - Pang, Shuchao
AU - Cong, Xianling
AU - Cong, Lele
PY - 2024/3/1
Y1 - 2024/3/1
N2 - AI-powered analysis of histopathology data has become an invaluable assistant for pathologists due to its efficiency and accuracy. However, existing deep learning methods still face some challenges in specifying cancer subtypes. For example, the ultra-high resolution of histopathological slides generally contains numerous redundant features, which are not useful for cancer subtype classification and thus lead to considerable computational costs. Moreover, the lack of manual annotations of disease-specific regions (i.e., patch-level annotations) from experts makes it more difficult to learn such histological features with only slide-level labels. In this paper, we propose an end-to-end weakly supervised learning framework called EWSLF to address these issues. First, we employ a cluster-based sampling strategy to refine the histological features for further training, which can improve classification accuracy and reduce computational cost. Second, we employ a multi-branch attention mechanism to produce patch-level pseudo-labels and aggregate the patch features into slide-level features, which can complement the missing patch-level labels from experts. Experimental results on both public and in-house datasets demonstrate the superiority and credible results of our model compared with the state-of-the-art methods for cancer subtype classification. Code: https://github.com/hongren21/ewslf.
AB - AI-powered analysis of histopathology data has become an invaluable assistant for pathologists due to its efficiency and accuracy. However, existing deep learning methods still face some challenges in specifying cancer subtypes. For example, the ultra-high resolution of histopathological slides generally contains numerous redundant features, which are not useful for cancer subtype classification and thus lead to considerable computational costs. Moreover, the lack of manual annotations of disease-specific regions (i.e., patch-level annotations) from experts makes it more difficult to learn such histological features with only slide-level labels. In this paper, we propose an end-to-end weakly supervised learning framework called EWSLF to address these issues. First, we employ a cluster-based sampling strategy to refine the histological features for further training, which can improve classification accuracy and reduce computational cost. Second, we employ a multi-branch attention mechanism to produce patch-level pseudo-labels and aggregate the patch features into slide-level features, which can complement the missing patch-level labels from experts. Experimental results on both public and in-house datasets demonstrate the superiority and credible results of our model compared with the state-of-the-art methods for cancer subtype classification. Code: https://github.com/hongren21/ewslf.
KW - attention mechanism
KW - histopathological data
KW - interpretable diagnosis
KW - subtype classification
KW - weakly supervised
UR - http://www.scopus.com/inward/record.url?scp=85171803626&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2023.121379
DO - 10.1016/j.eswa.2023.121379
M3 - Article
AN - SCOPUS:85171803626
SN - 0957-4174
VL - 237
SP - 1
EP - 12
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 121379
ER -