Hate speech recognizers (HSRs) can be the panacea for containing hate in social media or can result in the biggest form of prejudice-based censorship hindering people to express their true selves. In this paper, we hypothesized how massive use of syntax can reduce the prejudice effect in HSRs. To explore this hypothesis, we propose Unintended-bias Visualizer based on Kermit modeling (KERM-HATE): a syntax-based HSR, which is endowed with syntax heat parse trees used as a post-hoc explanation of classifications. KERM-HATE significantly outperforms BERT-based, RoBERTa-based and XLNet-based HSR on standard datasets. Surprisingly this result is not sufficient. In fact, the post-hoc analysis on novel datasets on recent divisive topics shows that even KERM-HATE carries the prejudice distilled from the initial corpus. Therefore, although tests on standard datasets may show higher performance, syntax alone cannot drive the ‘‘attention“ of HSRs to ethically-unbiased features.

Mastromattei, M., Ranaldi, L., Fallucchi, F., Zanzotto, F.m. (2022). Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled. PEERJ. COMPUTER SCIENCE., 8, e859 [10.7717/peerj-cs.859].

Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled

Zanzotto F. M.
Conceptualization
2022-01-01

Abstract

Hate speech recognizers (HSRs) can be the panacea for containing hate in social media or can result in the biggest form of prejudice-based censorship hindering people to express their true selves. In this paper, we hypothesized how massive use of syntax can reduce the prejudice effect in HSRs. To explore this hypothesis, we propose Unintended-bias Visualizer based on Kermit modeling (KERM-HATE): a syntax-based HSR, which is endowed with syntax heat parse trees used as a post-hoc explanation of classifications. KERM-HATE significantly outperforms BERT-based, RoBERTa-based and XLNet-based HSR on standard datasets. Surprisingly this result is not sufficient. In fact, the post-hoc analysis on novel datasets on recent divisive topics shows that even KERM-HATE carries the prejudice distilled from the initial corpus. Therefore, although tests on standard datasets may show higher performance, syntax alone cannot drive the ‘‘attention“ of HSRs to ethically-unbiased features.
2022
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore INF/01 - INFORMATICA
Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
English
Con Impact Factor ISI
Artificial Intelligence
Bias
Data Mining and Machine Learning
Explainability
Hate speech
Natural Language and Speech
Neural networks
Syntax
Mastromattei, M., Ranaldi, L., Fallucchi, F., Zanzotto, F.m. (2022). Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled. PEERJ. COMPUTER SCIENCE., 8, e859 [10.7717/peerj-cs.859].
Mastromattei, M; Ranaldi, L; Fallucchi, F; Zanzotto, Fm
Articolo su rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/294947
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 5
social impact