LowResourceNLU at BLP-2023 task 1 & 2: enhancing sentiment classification and violence incitement detection in Bangla through aggregated language models

Hariram Veeramani, Surendrabikram Thapa, Usman Naseem

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

5 Citations (Scopus)

Abstract

Violence incitement detection and sentiment analysis hold significant importance in the field of natural language processing. However, in the case of the Bangla language, there are unique challenges due to its low-resource nature. In this paper, we address these challenges by presenting an innovative approach that leverages aggregated BERT models for two tasks at the BLP workshop in EMNLP 2023, specifically tailored for Bangla. Task 1 focuses on violence-inciting text detection, while task 2 centers on sentiment analysis. Our approach combines fine-tuning with textual entailment (utilizing BanglaBERT), Masked Language Model (MLM) training (making use of BanglaBERT), and the use of standalone Multilingual BERT. This comprehensive framework significantly enhances the accuracy of sentiment classification and violence incitement detection in Bangla text. Our method achieved the 11th rank in task 1 with an F1-score of 73.47 and the 4th rank in task 2 with an F1-score of 71.73. This paper provides a detailed system description along with an analysis of the impact of each component of our framework.

Original languageEnglish
Title of host publicationProceedings of the First Workshop on Bangla Language Processing (BLP-2023)
Place of PublicationStroudsburg
PublisherAssociation for Computational Linguistics (ACL)
Pages230–235
Number of pages6
ISBN (Electronic)9798891760585
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event1st Workshop on Bangla Language Processing, BLP 2023 - Singapore, Singapore
Duration: 7 Dec 20237 Dec 2023

Conference

Conference1st Workshop on Bangla Language Processing, BLP 2023
Country/TerritorySingapore
CitySingapore
Period7/12/237/12/23

Fingerprint

Dive into the research topics of 'LowResourceNLU at BLP-2023 task 1 & 2: enhancing sentiment classification and violence incitement detection in Bangla through aggregated language models'. Together they form a unique fingerprint.

Cite this