Skip to main navigation Skip to search Skip to main content

Overlapping word removal is all you need: revisiting data imbalance in hope speech detection

  • Hariharan RamakrishnaIyer LekshmiAmmal
  • , Manikandan Ravikiran
  • , Gayathri Nisha
  • , Navyasree Balamuralidhar
  • , Adithya Madhusoodanan
  • , Anand Kumar Madasamy
  • , Bharathi Raja Chakravarthi
  • National Institute of Technology Karnataka
  • Georgia Institute of Technology
  • University of Galway

Research output: Contribution to a Journal (Peer & Non Peer)Articlepeer-review

3 Citations (Scopus)

Abstract

Hope speech detection is a new task for finding and highlighting positive comments or supporting content from user-generated social media comments. For this task, we have used a Shared Task multilingual dataset on Hope Speech Detection for Equality, Diversity, and Inclusion (HopeEDI) for three languages English, code-switched Tamil and Malayalam. In this paper, we present deep learning techniques using context-aware string embeddings for word representations and Recurrent Neural Network (RNN) and pooled document embeddings for text representation. We have evaluated and compared the three models for each language with different approaches. Our proposed methodology works fine and achieved higher performance than baselines. The highest weighted average F-scores of 0.93, 0.58, and 0.84 are obtained on the task organisers{'} final evaluation test set. The proposed models are outperforming the baselines by 3{\%}, 2{\%} and 11{\%} in absolute terms for English, Tamil and Malayalam respectively.

Original languageEnglish
Pages (from-to)1837-1859
Number of pages23
JournalJournal of Experimental and Theoretical Artificial Intelligence
Volume36
Issue number8
DOIs
Publication statusPublished - 2024
Externally publishedYes

Keywords

  • Hope Speech Detection
  • Language modelling
  • data imbalance
  • focal loss
  • text classification

Fingerprint

Dive into the research topics of 'Overlapping word removal is all you need: revisiting data imbalance in hope speech detection'. Together they form a unique fingerprint.

Cite this