TY - GEN
T1 - Adaptive clustering for EGFR amplification prediction in glioblastoma
T2 - 23rd Conference on Artificial Intelligence in Medicine in Europe, AIME 2025
AU - Mehr, Homay Danaei
AU - Cong, Cong
AU - Noorani, Imran
AU - Ieva, Antonio Di
AU - Liu, Sidong
PY - 2025/6/23
Y1 - 2025/6/23
N2 - Glioblastoma (GBM) - an aggressive brain tumor- is notorious for its resistance to treatments due to its high heterogeneity and rapid growth. The epidermal growth factor receptor (EGFR) plays an important role in the diagnostic, prognostic, and therapeutic biomarkers of GBM. With advancements in digital pathology, deep learning models, especially Multiple Instance Learning (MIL)-based approaches, have achieved promising results in tumor classification. However, MIL models are often task-specific, constraining their generalizability. On the other hand, the morphological redundancy in tissue can be leveraged to provide task-agnostic slide representation in an unsupervised approach like the newly emerged morphological prototype-based PANTHER model. PANTHER could improve the classification performance; however, its K-Means clustering depends on a fixed and predefined number of prototypes, which may cause over or under-clustering, reducing the classification performance. To address this limitation, we proposed an adaptive Variational Autoencoder-Dirichlet Bayesian Gaussian Mixture Model (VAE-DBGMM) to learn optimal prototypes dynamically. Using the TCGA-GBM dataset with EGFR labeling, we evaluated our adaptive approach against the PANTHER model with predefined numbers of prototypes (8, 16, 18, 32) and three state-of-the-art MIL models (CLAM, TransMIL, and DTFD). The results demonstrate that the optimal prototypes derived from VAE-DBGMM significantly improved classification performance, achieving an AUC of 0.795 ± 0.0105, outperforming PANTHER and MIL baselines. Furthermore, testing on the external CPTAC-EGFR dataset demonstrates the robustness and generalizability of our approach. These findings emphasise the significance of adaptive clustering in improving EGFR biomarker classification in GBM.
AB - Glioblastoma (GBM) - an aggressive brain tumor- is notorious for its resistance to treatments due to its high heterogeneity and rapid growth. The epidermal growth factor receptor (EGFR) plays an important role in the diagnostic, prognostic, and therapeutic biomarkers of GBM. With advancements in digital pathology, deep learning models, especially Multiple Instance Learning (MIL)-based approaches, have achieved promising results in tumor classification. However, MIL models are often task-specific, constraining their generalizability. On the other hand, the morphological redundancy in tissue can be leveraged to provide task-agnostic slide representation in an unsupervised approach like the newly emerged morphological prototype-based PANTHER model. PANTHER could improve the classification performance; however, its K-Means clustering depends on a fixed and predefined number of prototypes, which may cause over or under-clustering, reducing the classification performance. To address this limitation, we proposed an adaptive Variational Autoencoder-Dirichlet Bayesian Gaussian Mixture Model (VAE-DBGMM) to learn optimal prototypes dynamically. Using the TCGA-GBM dataset with EGFR labeling, we evaluated our adaptive approach against the PANTHER model with predefined numbers of prototypes (8, 16, 18, 32) and three state-of-the-art MIL models (CLAM, TransMIL, and DTFD). The results demonstrate that the optimal prototypes derived from VAE-DBGMM significantly improved classification performance, achieving an AUC of 0.795 ± 0.0105, outperforming PANTHER and MIL baselines. Furthermore, testing on the external CPTAC-EGFR dataset demonstrates the robustness and generalizability of our approach. These findings emphasise the significance of adaptive clustering in improving EGFR biomarker classification in GBM.
KW - Adaptive Clustering
KW - Variational Encoder
KW - Bayesian Model
KW - Glioblastoma
KW - EGFR
KW - Multi Instance Learning
UR - http://www.scopus.com/inward/record.url?scp=105009789819&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-95838-0_9
DO - 10.1007/978-3-031-95838-0_9
M3 - Conference proceeding contribution
SN - 9783031958373
T3 - Lecture Notes in Artificial Intelligence
SP - 88
EP - 97
BT - Artificial Intelligence in Medicine
A2 - Bellazzi, Riccardo
A2 - Juarez Herrero, José Manuel
A2 - Sacchi, Lucia
A2 - Zupan, Blaž
PB - Springer, Springer Nature
CY - Switzerland
Y2 - 23 June 2025 through 26 June 2025
ER -