Adaptive clustering for EGFR amplification prediction in glioblastoma: a Variational Autoencoder-Dirichlet Bayesian Gaussian approach

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

Abstract

Glioblastoma (GBM) - an aggressive brain tumor- is notorious for its resistance to treatments due to its high heterogeneity and rapid growth. The epidermal growth factor receptor (EGFR) plays an important role in the diagnostic, prognostic, and therapeutic biomarkers of GBM. With advancements in digital pathology, deep learning models, especially Multiple Instance Learning (MIL)-based approaches, have achieved promising results in tumor classification. However, MIL models are often task-specific, constraining their generalizability. On the other hand, the morphological redundancy in tissue can be leveraged to provide task-agnostic slide representation in an unsupervised approach like the newly emerged morphological prototype-based PANTHER model. PANTHER could improve the classification performance; however, its K-Means clustering depends on a fixed and predefined number of prototypes, which may cause over or under-clustering, reducing the classification performance. To address this limitation, we proposed an adaptive Variational Autoencoder-Dirichlet Bayesian Gaussian Mixture Model (VAE-DBGMM) to learn optimal prototypes dynamically. Using the TCGA-GBM dataset with EGFR labeling, we evaluated our adaptive approach against the PANTHER model with predefined numbers of prototypes (8, 16, 18, 32) and three state-of-the-art MIL models (CLAM, TransMIL, and DTFD). The results demonstrate that the optimal prototypes derived from VAE-DBGMM significantly improved classification performance, achieving an AUC of 0.795 ± 0.0105, outperforming PANTHER and MIL baselines. Furthermore, testing on the external CPTAC-EGFR dataset demonstrates the robustness and generalizability of our approach. These findings emphasise the significance of adaptive clustering in improving EGFR biomarker classification in GBM.
Original languageEnglish
Title of host publicationArtificial Intelligence in Medicine
Subtitle of host publication23rd International Conference, AIME 2025, Proceedings, Part I
EditorsRiccardo Bellazzi, José Manuel Juarez Herrero, Lucia Sacchi, Blaž Zupan
Place of PublicationSwitzerland
PublisherSpringer, Springer Nature
Pages88-97
Number of pages10
ISBN (Electronic)9783031958380
ISBN (Print)9783031958373
DOIs
Publication statusPublished - 23 Jun 2025
Event23rd Conference on Artificial Intelligence in Medicine in Europe, AIME 2025 - Pavia, Italy
Duration: 23 Jun 202526 Jun 2025

Publication series

NameLecture Notes in Artificial Intelligence
Volume15734
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd Conference on Artificial Intelligence in Medicine in Europe, AIME 2025
Country/TerritoryItaly
CityPavia
Period23/06/2526/06/25

Keywords

  • Adaptive Clustering
  • Variational Encoder
  • Bayesian Model
  • Glioblastoma
  • EGFR
  • Multi Instance Learning

Cite this