TaxoSBERT: Unsupervised Taxonomy Expansion Through Expressive Semantic Similarity

IRIS

Knowledge graphs are crucial resources for a large set of document management tasks, such as text retrieval and classification as well as natural language inference. Standard examples are large-scale lexical semantic graphs, such as WordNet, useful for text tagging or sentence disambiguation purposes. The dynamics of lexical taxonomies is a critical problem as they need to be maintained to follow the language evolution across time. Taxonomy expansion, in this sense, becomes a critical semantic task, as it allows for an extension of existing resources with new properties but also to create new entries, i.e. taxonomy concepts, when necessary. Previous work on this topic suggests the use of neural learning methods able to make use of the underlying taxonomy graph as a source of training evidence. This can be done by graph-based learning, where nets are trained to encode the underlying knowledge graph and to predict appropriate inferences. This paper presents TaxoSBERT as a simple and effective way to model the taxonomy expansion problem as a retrieval task. It combines a robust semantic similarity measure and taxonomy-driven re-rank strategies. This method is unsupervised, the adopted similarity measures are trained on (large-scale) resources out of a target taxonomy and are extremely efficient. The experimental evaluation with respect to two taxonomies shows surprising results, improving far more complex state-of-the-art methods.

Margiotta, D., Croce, D., Basili, R. (2023). TaxoSBERT: Unsupervised Taxonomy Expansion Through Expressive Semantic Similarity. In Deep Learning Theory and Applications (pp.295-307). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-39059-3_20].

TaxoSBERT: Unsupervised Taxonomy Expansion Through Expressive Semantic Similarity

Margiotta D.;Croce D.;Basili R.

2023-01-01

Abstract

Knowledge graphs are crucial resources for a large set of document management tasks, such as text retrieval and classification as well as natural language inference. Standard examples are large-scale lexical semantic graphs, such as WordNet, useful for text tagging or sentence disambiguation purposes. The dynamics of lexical taxonomies is a critical problem as they need to be maintained to follow the language evolution across time. Taxonomy expansion, in this sense, becomes a critical semantic task, as it allows for an extension of existing resources with new properties but also to create new entries, i.e. taxonomy concepts, when necessary. Previous work on this topic suggests the use of neural learning methods able to make use of the underlying taxonomy graph as a source of training evidence. This can be done by graph-based learning, where nets are trained to encode the underlying knowledge graph and to predict appropriate inferences. This paper presents TaxoSBERT as a simple and effective way to model the taxonomy expansion problem as a retrieval task. It combines a robust semantic similarity measure and taxonomy-driven re-rank strategies. This method is unsupervised, the adopted similarity measures are trained on (large-scale) resources out of a target taxonomy and are extremely efficient. The experimental evaluation with respect to two taxonomies shows surprising results, improving far more complex state-of-the-art methods.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				Proceedings of the 4th International Conference on Deep Learning Theory and Applications, DeLTA 2023
			
	Luogo del convegno
	
				Roma, Italia
			
	Anno del convegno
	
				2023
			
	Numero del convegno
	
				4
			
	Rilevanza del convegno
	
				Rilevanza internazionale
			
	Data di pubblicazione
	
				2023
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1007/978-3-031-39059-3_20
			
	Settore disciplinare dell'intervento (valido fino a 24/06/2024)
	
				Settore INF/01
Settore ING-INF/05
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				Knowledge injection in neural learning
Taxonomy expansion
Unsupervised sentence embeddings
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Margiotta, D., Croce, D., Basili, R. (2023). TaxoSBERT: Unsupervised Taxonomy Expansion Through Expressive Semantic Similarity. In Deep Learning Theory and Applications (pp.295-307). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-39059-3_20].
			
	Tutti gli autori
	
						Margiotta, D; Croce, D; Basili, R
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/359278

Citazioni

ND

0

ND

social impact