Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled

IRIS

Hate speech recognizers (HSRs) can be the panacea for containing hate in social media or can result in the biggest form of prejudice-based censorship hindering people to express their true selves. In this paper, we hypothesized how massive use of syntax can reduce the prejudice effect in HSRs. To explore this hypothesis, we propose Unintended-bias Visualizer based on Kermit modeling (KERM-HATE): a syntax-based HSR, which is endowed with syntax heat parse trees used as a post-hoc explanation of classifications. KERM-HATE significantly outperforms BERT-based, RoBERTa-based and XLNet-based HSR on standard datasets. Surprisingly this result is not sufficient. In fact, the post-hoc analysis on novel datasets on recent divisive topics shows that even KERM-HATE carries the prejudice distilled from the initial corpus. Therefore, although tests on standard datasets may show higher performance, syntax alone cannot drive the ‘‘attention“ of HSRs to ethically-unbiased features.

Mastromattei, M., Ranaldi, L., Fallucchi, F., Zanzotto, F.m. (2022). Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled. PEERJ. COMPUTER SCIENCE., 8, e859 [10.7717/peerj-cs.859].

Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled

Mastromattei M.;Ranaldi L.;Fallucchi F.;Zanzotto F. M.^{Conceptualization}

2022-01-01

Abstract

Hate speech recognizers (HSRs) can be the panacea for containing hate in social media or can result in the biggest form of prejudice-based censorship hindering people to express their true selves. In this paper, we hypothesized how massive use of syntax can reduce the prejudice effect in HSRs. To explore this hypothesis, we propose Unintended-bias Visualizer based on Kermit modeling (KERM-HATE): a syntax-based HSR, which is endowed with syntax heat parse trees used as a post-hoc explanation of classifications. KERM-HATE significantly outperforms BERT-based, RoBERTa-based and XLNet-based HSR on standard datasets. Surprisingly this result is not sufficient. In fact, the post-hoc analysis on novel datasets on recent divisive topics shows that even KERM-HATE carries the prejudice distilled from the initial corpus. Therefore, although tests on standard datasets may show higher performance, syntax alone cannot drive the ‘‘attention“ of HSRs to ethically-unbiased features.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2022
			
	Status di pubblicazione
	
				Pubblicato
			
	DOI dell'articolo
	
				https://dx.doi.org/10.7717/peerj-cs.859
			
	Rilevanza
	
				Rilevanza internazionale
			
	Tipo
	
				Articolo
			
	Referee
	
				Esperti anonimi
			
	Settore disciplinare dell'articolo (valido fino a 24/06/2024)
	
				Settore INF/01 - INFORMATICA
Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
			
	Lingua del contenuto
	
				English
			
	Impact Factor ISI
	
				Con Impact Factor ISI
			
	Parole chiave
	
				Artificial Intelligence
Bias
Data Mining and Machine Learning
Explainability
Hate speech
Natural Language and Speech
Neural networks
Syntax
			
	Citazione
	
				Mastromattei, M., Ranaldi, L., Fallucchi, F., Zanzotto, F.m. (2022). Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled. PEERJ. COMPUTER SCIENCE., 8, e859 [10.7717/peerj-cs.859].
			
	Tutti gli autori
	
						Mastromattei, M; Ranaldi, L; Fallucchi, F; Zanzotto, Fm
					
	Tipologia
	
				Articolo su rivista
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/294947

Citazioni

ND

11

7

social impact