Integrating ontological and linguistic knowledge for Conceptual Information Extraction

IRIS

Text understanding makes strong assumptions about the conceptualisation of the underlying knowledge domain. This mediates between the accomplishment of the specific task at the one hand and the knowledge expressed in the target text fragments at the other. However, building domain conceptualisations from scratch is a very complex and time-consuming task. Traditionally, the reuse of available domain resources, although not constituting always the best, has been applied as an accurate and cost effective solution. Here, we investigate the possibility of exploiting sources of domain knowledge (e.g. a subject reference system) to build a linguistically motivated domain concept hierarchy. The limitation connected with the use of domain taxonomies as ontological resources will be firstly discussed in the specific light of IE, i.e. for supporting linguistic inference. We then define a method for integrating the taxonomical domain knowledge and a general-purpose lexical knowledge base, like WordNet. A case study, i.e. the integration of the MeSH, Medical Subject Headings, and WordNet, will be then presented as a proof of the effectiveness and accuracy of the overall approach.

Basili, R., Vindigni, M., Zanzotto, F.m. (2003). Integrating ontological and linguistic knowledge for Conceptual Information Extraction. In Proceedings of IEEE/WIC Web Intelligence (WI 2003) (GSS Conference Rating CORE:B, LiveSHINE:A, MA:B) [10.1109/WI.2003.1241190].

Integrating ontological and linguistic knowledge for Conceptual Information Extraction

BASILI, ROBERTO;VINDIGNI, MICHELE;ZANZOTTO, FABIO MASSIMO

2003-01-01

Abstract

Text understanding makes strong assumptions about the conceptualisation of the underlying knowledge domain. This mediates between the accomplishment of the specific task at the one hand and the knowledge expressed in the target text fragments at the other. However, building domain conceptualisations from scratch is a very complex and time-consuming task. Traditionally, the reuse of available domain resources, although not constituting always the best, has been applied as an accurate and cost effective solution. Here, we investigate the possibility of exploiting sources of domain knowledge (e.g. a subject reference system) to build a linguistically motivated domain concept hierarchy. The limitation connected with the use of domain taxonomies as ontological resources will be firstly discussed in the specific light of IE, i.e. for supporting linguistic inference. We then define a method for integrating the taxonomical domain knowledge and a general-purpose lexical knowledge base, like WordNet. A case study, i.e. the integration of the MeSH, Medical Subject Headings, and WordNet, will be then presented as a proof of the effectiveness and accuracy of the overall approach.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				IEEE/WIC International Conference on Web Intelligence, WI 2003
			
	Rilevanza del convegno
	
				Rilevanza internazionale
			
	Data di pubblicazione
	
				2003
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1109/WI.2003.1241190
			
	Settore disciplinare dell'intervento (valido fino a 24/06/2024)
	
				Settore INF/01 - INFORMATICA
Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				Computer science; Costs; Data mining; Databases; Intelligent structures; Investments; Ontologies; Taxonomy; Text categorization
			
	URL alternativo
	
				http://dx.medra.org/10.1109/WI.2003.1241190
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Basili, R., Vindigni, M., Zanzotto, F.m. (2003). Integrating ontological and linguistic knowledge for Conceptual Information Extraction. In Proceedings of IEEE/WIC Web Intelligence (WI 2003) (GSS Conference Rating CORE:B, LiveSHINE:A, MA:B) [10.1109/WI.2003.1241190].
			
	Tutti gli autori
	
						Basili, R; Vindigni, M; Zanzotto, Fm
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
2003_WI_BasiliVindigniZanzotto.pdf solo utenti autorizzati Licenza: Copyright dell'editore Dimensione 105.07 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	105.07 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/165443

Citazioni

ND

15

0

social impact