Studying Socially Unacceptable Discourse Classification (SUD) through different eyes: "Are we on the same page ? " - HAL Access content directly
Conference papers Year : 2023

Studying Socially Unacceptable Discourse Classification (SUD) through different eyes: "Are we on the same page ? "

Abstract

We study Socially Unacceptable Discourse (SUD) characterization and detection in online text. We first build and present a novel corpus that contains a large variety of manually annotated texts from different online sources used so far in state-of-the-art Machine learning (ML) SUD detection solutions. This global context allows us to test the generalization ability of SUD classifiers that acquire knowledge around the same SUD categories, but from different contexts. From this perspective, we can analyze how (possibly) different annotation modalities influence SUD learning by discussing open challenges and open research directions. We also provide several data insights which can support domain experts in the annotation task.
Main file
Thumbnail
CMC-Carneiro_Linardi_Longhi.pdf ( 809.8 Ko ) Download
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-04316521, version 1 (04-12-2023)

Licence

Attribution - CC BY 4.0

Identifiers

  • HAL Id : hal-04316521 , version 1

Cite

Bruno Machado Carneiro, Michele Linardi, Julien Longhi. Studying Socially Unacceptable Discourse Classification (SUD) through different eyes: "Are we on the same page ? ". International Conference on CMC and Social Media Corpora for the Humanities, Sep 2023, Mannheim, Germany, Germany. ⟨hal-04316521⟩
31 View
36 Download
Last update date on 5/12/24
How are these indicators produced

Share

Gmail Facebook Twitter LinkedIn More