Skip to main navigation Skip to search Skip to main content

Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

  • Manikandan Ravikiran
  • , Bharathi Raja Chakravarthi
  • , Anand Kumar Madasamy
  • , Sangeetha Sivanesan
  • , Ratnavel Rajalakshmi
  • , Sajeetha Thavareesan
  • , Rahul Ponnusamy
  • , Shankar Mahadevan
  • Georgia Institute of Technology
  • University of Galway
  • National Institute of Technology Karnataka
  • National Institute of Technology Tiruchirappalli
  • Vellore Institute of Technology, Chennai
  • Eastern University, Sri Lanka
  • Indian Institute of Information Technology and Management
  • Thiagarajar College of Engineering

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

49 Citations (Scopus)

Abstract

Offensive content moderation is vital in social media platforms to support healthy online discussions. However, their prevalence in code-mixed Dravidian languages is limited to classifying whole comments without identifying part of it contributing to offensiveness. Such limitation is primarily due to the lack of annotated data for offensive spans. Accordingly, in this shared task, we provide Tamil-English code-mixed social comments with offensive spans. This paper outlines the dataset so released, methods, and results of the submitted systems.

Original languageEnglish
Title of host publicationDravidianLangTech 2022 - 2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop
EditorsBharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar Madasamy, Parameswari Krishnamurthy, Elizabeth Sherly, Sinnathamby Mahesan
PublisherAssociation for Computational Linguistics (ACL)
Pages261-270
Number of pages10
ISBN (Electronic)9781955917346
Publication statusPublished - 2022
Externally publishedYes
Event2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop, DravidianLangTech 2022 - Dublin, Ireland
Duration: 26 May 2022 → …

Publication series

NameDravidianLangTech 2022 - 2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop

Conference

Conference2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop, DravidianLangTech 2022
Country/TerritoryIreland
CityDublin
Period26/05/22 → …

Fingerprint

Dive into the research topics of 'Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments'. Together they form a unique fingerprint.

Cite this