Vision-language models for biomedical applications

Surendrabikram Thapa, Usman Naseem, Luping Zhou, Jinman Kim

Research output: Chapter in Book/Report/Conference proceedingForeword/postscript/introductionpeer-review

13 Downloads (Pure)

Abstract

Vision-language models (VLMs) are transforming the landscape of biomedical research and healthcare by enabling the seamless integration and interpretation of complex multimodal data, including medical images and clinical texts. Recognizing the growing impact of these models, the first international workshop on Vision-Language Models for Biomedicine (VLM4Bio) was held in conjunction with ACM Multimedia 2024. The workshop aimed to address the critical need for advanced techniques that can leverage VLMs in applications such as medical imaging, diagnostics, and personalized treatment. As healthcare data increasingly involves both visual and textual information, VLM4Bio provided a platform for interdisciplinary collaboration between experts in natural language processing, computer vision, biomedical engineering, and AI ethics. This paper provides an overview of the inaugural edition of the VLM4Bio workshop, summarizing the key discussions, contributions, and future directions for expanding the workshop's scope and influence in subsequent editions.
Original languageEnglish
Title of host publicationVLM4Bio '24
Subtitle of host publicationproceedings of the First International Workshop on Vision-Language Models for Biomedical Applications
Place of PublicationNew York
PublisherAssociation for Computing Machinery
Pages1-2
Number of pages2
ISBN (Electronic)9798400712074
DOIs
Publication statusPublished - 2024
EventFirst International Workshop on Vision-Language Models for Biomedical Applications (1st : 2024) - Melbourne, Australia
Duration: 28 Oct 20241 Nov 2024
Conference number: 1st

Conference

ConferenceFirst International Workshop on Vision-Language Models for Biomedical Applications (1st : 2024)
Abbreviated titleVLM4Bio 2024
Country/TerritoryAustralia
CityMelbourne
Period28/10/241/11/24

Keywords

  • Vision-Language Models (VLMs)
  • Multimodal Biomedical AI
  • Visual Question Answering (VQA)
  • Clinical Decision Support Systems
  • Healthcare Applications

Cite this