VR-DiagNet: medical volumetric and radiomic diagnosis networks with interpretable clinician-like optimizing visual inspection

Shouyu Chen, Liang Hu*, Tangwei Ye, Laizhong Yuan, Qi Zhang, Ke Liu, Usman Naseem, Ke Sun, Nengjun Zhu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

Abstract

Interpretable and robust medical diagnoses are essential traits for practicing clinicians. Most computer-augmented diagnostic systems suffer from three major problems: non-interpretability, limited modality analysis, and narrow focus. Existing frameworks can either deal with multimodality to some extent but suffer from non-interpretability or partially interpretable but provide a limited modality and multifaceted capabilities. Our work aims to integrate all these aspects in one complete framework to fully utilize the full spectrum of information offered by multiple modalities and facets. We propose our solution via our novel architecture VR-DiagNet, consisting of a planner and a classifier, optimized iteratively and cohesively. VR-DiagNet simulates the perceptual process of clinicians via the use of volumetric imaging information integrated with radiomic features modality; at the same time, it recreates human thought processes via a customized Monte Carlo Tree Search (MCTS) which constructs a volume-tailored experience tree to identify slices of interest (SoIs) in our multi-slice perception space. We conducted extensive experiments across two diagnostic tasks comprising six public medical volumetric benchmark datasets. Our findings showcase superior performance, as evidenced by heightened accuracy and area under the curve (AUC) metrics, reduced computational overhead, and expedited convergence while conclusively illustrating the immense value of integrating volumetric and radiomic modalities for our current problem setup.
Original languageEnglish
Title of host publicationMM '24
Subtitle of host publicationproceedings of the 32nd ACM International Conference on Multimedia
Place of PublicationNew York
PublisherAssociation for Computing Machinery
Pages10459-10467
Number of pages9
ISBN (Electronic)9798400706868
DOIs
Publication statusPublished - 2024
EventACM International Conference on Multimedia (32nd : 2024) - Melbourne, Australia
Duration: 28 Oct 20241 Nov 2024
Conference number: 32nd

Conference

ConferenceACM International Conference on Multimedia (32nd : 2024)
Abbreviated titleMM '24
Country/TerritoryAustralia
CityMelbourne
Period28/10/241/11/24

Keywords

  • Medical Volume Diagnosis
  • Multimodality
  • Radiomics
  • Unsupervised Planning

Cite this