Multi-label classification based on subcellular region-guided feature description for protein localisation

Priyanka Rana, Erik Meijering, Arcot Sowmya, Yang Song

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

4 Citations (Scopus)

Abstract

In this paper, we present a multi-label classification pipeline and a novel feature descriptor for the protein subcellular localisation. The challenge here is the development of a computational model that can classify multi-site proteins on a highly imbalanced dataset with a long-tail distribution and multi-label images. To address this challenge, we design a Location-Sorted Random Projections feature descriptor to represent image intensity and gradient of the protein of interest in reference to the correlated cellular region. Multilabel Synthetic Minority Over-sampling Technique is optimised to generate synthetic features with labels to handle class imbalance. Our method achieves the state-of-the-art performance on a large-scale public dataset and demonstrates excellent performance for the minority classes.
Original languageEnglish
Title of host publicationProceedings of the 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)
Place of PublicationNice
PublisherIEEE:Institute of Electrical Electronics Engineers Inc
Pages1929-1933
Number of pages5
ISBN (Electronic)9781665412469
ISBN (Print)9781665429474
DOIs
Publication statusPublished - 16 Apr 2021
Externally publishedYes
Event18th IEEE International Symposium on Biomedical Imaging, ISBI 2021 - Nice, France
Duration: 13 Apr 202116 Apr 2021

Publication series

NameIEEE International Symposium on Biomedical Imaging
PublisherIEEE
ISSN (Print)1945-7928

Conference

Conference18th IEEE International Symposium on Biomedical Imaging, ISBI 2021
Country/TerritoryFrance
CityNice
Period13/04/2116/04/21

Keywords

  • Protein subcellular localisation
  • multilabel classification
  • sorted random projections

Fingerprint

Dive into the research topics of 'Multi-label classification based on subcellular region-guided feature description for protein localisation'. Together they form a unique fingerprint.

Cite this