Hate Speech Detection Is Not As Easy As You May Think: A Closer Look At Model Validation

Aym� Arango; Jorge P�rez; Barbara Poblete

Venue: ACM SIGIR Conference (SIGIR) 2019
Recognition: Most Influential SIGIR 2019 Paper (Rank No. 13)
Edition: 2026-03
Impact factor: 4
Certificate ID: 065ec9ae8cbb2805

Abstract

Hate speech is an important problem that is seriously affecting the dynamics and usefulness of online social communities. Large scale social platforms are currently investing important resources into automatically detecting and classifying hateful content, without much success. On the other hand, the results reported by state-of-the-art systems indicate that supervised approaches achieve almost perfect performance but only within specific datasets. In this work, we analyze this apparent contradiction between existing literature and actual applications. We study closely the experimental methodology used in prior work and their generalizability to other datasets. Our findings evidence methodological issues, as well as an important dataset bias. As a consequence, performance claims of the current state-of-the-art have become significantly overestimated. The problems that we have found are mostly related to data overfitting and sampling issues. We discuss the implications for current research and re-conduct experiments to give a more accurate picture of the current state-of-the art methods. expand

Download PDF certificate