FAM-Logo: forward compatible multimodal framework for Few-Shot Logo Incremental classification

Sujuan Hou, Jianxin Zhan, Hao Xiong*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

Abstract

Logo classification has gained increasing attention for its various applications, including copyright protection, product recommendation and contextual advertising. However, the rapid emergence of new companies and product logo categories presents challenges to existing logo classification models. Often, only a limited number of examples are available for these new categories. A robust classification model should incrementally learn and recognize new logo classes while maintaining its ability to discriminate between existing ones. To address this, we formulate the task of Logo Few-Shot Class-Incremental Learning (Logo-FSCIL). In this work, we propose a new paradigm for Logo-FSCIL. Specifically, we adopt a forward-compatible embedding space reservation strategy and develop a prompt-based multimodal alignment framework to mitigate catastrophic forgetting. Furthermore, taking into account the unique attributes of logos, we devise an enhancement alignment strategy guided by textual information. Experimental results on three logo datasets demonstrate that our method significantly outperforms state-of-the-art FSCIL benchmarks. Extensive experiments on three generic datasets further validate the generalization capability of our approach. Datasets and code can be found at https://github.com/housujuan123/Logo-FSCIL.

Original languageEnglish
Title of host publicationACMMM CL 2024
Subtitle of host publicationProceedings of the 1st Continual Learning meets Multimodal Foundation Models: Fundamentals and Advances
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery (ACM)
Pages31-38
Number of pages8
ISBN (Electronic)9798400711886
DOIs
Publication statusPublished - 28 Oct 2024
Event1st Continual Learning meets Multimodal Foundation Models: Fundamentals and Advances, ACMMM CL 2024 - Melbourne, Australia
Duration: 28 Oct 20241 Nov 2024

Conference

Conference1st Continual Learning meets Multimodal Foundation Models: Fundamentals and Advances, ACMMM CL 2024
Country/TerritoryAustralia
CityMelbourne
Period28/10/241/11/24

Keywords

  • Few-Shot Class Incremental Learning
  • Logo Classification

Cite this