Learning to Generate Examples for Semantic Processing Tasks

IRIS

Even if recent Transformer-based architectures, such as BERT, achieved impressive results in semantic processing tasks, their fine-tuning stage still requires large scale training resources. Usually, Data Augmentation (DA) techniques can help to deal with low resource settings. In Text Classification tasks, the objective of DA is the generation of well-formed sentences that (i) represent the desired task category and (ii) are novel with respect to existing sentences. In this paper, we propose a neural approach to automatically learn to generate new examples using a pre-trained sequence-to-sequence model. We first learn a task-oriented similarity function that we use to pair similar examples. Then, we use these example pairs to train a model to generate examples. Experiments in low resource settings show that augmenting the training material with the proposed strategy systematically improves the results on text classification and natural language inference tasks by up to 10% accuracy, outperforming existing DA approaches.

Croce, D., Filice, S., Castellucci, G., Basili, R. (2022). Learning to Generate Examples for Semantic Processing Tasks. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp.4587-4601). Association for Computational Linguistics (ACL) [10.18653/v1/2022.naacl-main.340].

Learning to Generate Examples for Semantic Processing Tasks

Croce D.;Filice S.;Castellucci G.;Basili R.

2022-01-01

Abstract

Even if recent Transformer-based architectures, such as BERT, achieved impressive results in semantic processing tasks, their fine-tuning stage still requires large scale training resources. Usually, Data Augmentation (DA) techniques can help to deal with low resource settings. In Text Classification tasks, the objective of DA is the generation of well-formed sentences that (i) represent the desired task category and (ii) are novel with respect to existing sentences. In this paper, we propose a neural approach to automatically learn to generate new examples using a pre-trained sequence-to-sequence model. We first learn a task-oriented similarity function that we use to pair similar examples. Then, we use these example pairs to train a model to generate examples. Experiments in low resource settings show that augmenting the training material with the proposed strategy systematically improves the results on text classification and natural language inference tasks by up to 10% accuracy, outperforming existing DA approaches.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022
			
	Luogo del convegno
	
				Seattle, United States
			
	Anno del convegno
	
				2022
			
	Organizzatore/i del convegno
	
				Amazon
			
	Rilevanza del convegno
	
				Rilevanza internazionale
			
	Data di pubblicazione
	
				2022
			
	DOI dell'intervento
	
				https://dx.doi.org/10.18653/v1/2022.naacl-main.340
			
	Settore disciplinare dell'intervento (valido fino a 24/06/2024)
	
				Settore INF/01
Settore ING-INF/05
			
	Lingua del contenuto
	
				English
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Croce, D., Filice, S., Castellucci, G., Basili, R. (2022). Learning to Generate Examples for Semantic Processing Tasks. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp.4587-4601). Association for Computational Linguistics (ACL) [10.18653/v1/2022.naacl-main.340].
			
	Tutti gli autori
	
						Croce, D; Filice, S; Castellucci, G; Basili, R
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/359272

Citazioni

ND

1

ND

social impact