Linking entries in protein interaction database to structured text: The FEBS Letters experiment

IRIS

The corpus of the scientific literature has reached such size that a lot of useful data, dispersed throughout millions different articles, are now hard to recover. For instance, many articles in the biological domain describe relationships between entities (gene, proteins, small molecules, etc.) yet this crucial information cannot be efficiently used because of the difficulties in retrieving it automatically from unstructured text. Databases are striving to capture this valuable information and to organize it in a structured format ready for automatic analysis. However, the current database model, based on manual curation, is not sustainable because the limited support is not compatible with complete and accurate coverage of published information. Several proposals have been put forward to increase the efficiency and accuracy of the curation process. Here we present an experiment, designed by the editorial board of FEBS Letters, aimed at integrating each manuscript with a structured summary precisely reporting, with database identifiers and predefined controlled vocabularies, the protein interactions reported in the manuscript. The authors play an important role in this process as they are requested to provide structured information to be appended, in the form of human-readable paragraphs, at the end of traditional summaries. It is envisaged that the structured text will become an integral part of Medline abstracts. In 6 months time the experience gained with this experiment will form the basis for a community discussion to propose a widely accepted strategy for information storage and retrieval. Â© 2008 Federation of European Biochemical Societies.

Ceol, A., Chatr_aryamontri, A., Licata, L., Cesareni, G. (2008). Linking entries in protein interaction database to structured text: The FEBS Letters experiment. FEBS LETTERS, 582(8), 1171-1177 [10.1016/j.febslet.2008.02.071].

Linking entries in protein interaction database to structured text: The FEBS Letters experiment

Ceol, A;Chatr_Aryamontri, A;Licata, L;CESARENI, GIOVANNI

2008-01-01

Abstract

The corpus of the scientific literature has reached such size that a lot of useful data, dispersed throughout millions different articles, are now hard to recover. For instance, many articles in the biological domain describe relationships between entities (gene, proteins, small molecules, etc.) yet this crucial information cannot be efficiently used because of the difficulties in retrieving it automatically from unstructured text. Databases are striving to capture this valuable information and to organize it in a structured format ready for automatic analysis. However, the current database model, based on manual curation, is not sustainable because the limited support is not compatible with complete and accurate coverage of published information. Several proposals have been put forward to increase the efficiency and accuracy of the curation process. Here we present an experiment, designed by the editorial board of FEBS Letters, aimed at integrating each manuscript with a structured summary precisely reporting, with database identifiers and predefined controlled vocabularies, the protein interactions reported in the manuscript. The authors play an important role in this process as they are requested to provide structured information to be appended, in the form of human-readable paragraphs, at the end of traditional summaries. It is envisaged that the structured text will become an integral part of Medline abstracts. In 6 months time the experience gained with this experiment will form the basis for a community discussion to propose a widely accepted strategy for information storage and retrieval. Â© 2008 Federation of European Biochemical Societies.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2008
			
	Status di pubblicazione
	
				Pubblicato
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.febslet.2008.02.071
			
	Rilevanza
	
				Rilevanza internazionale
			
	Tipo
	
				Articolo
			
	Referee
	
				Sì, ma tipo non specificato
			
	Settore disciplinare dell'articolo (valido fino a 24/06/2024)
	
				Settore BIO/18 - GENETICA
			
	Lingua del contenuto
	
				English
			
	Impact Factor ISI
	
				Con Impact Factor ISI
			
	Parole chiave
	
				Database; Information extraction; Network; Protein interaction
			
	Citazione
	
				Ceol, A., Chatr_aryamontri, A., Licata, L., Cesareni, G. (2008). Linking entries in protein interaction database to structured text: The FEBS Letters experiment. FEBS LETTERS, 582(8), 1171-1177 [10.1016/j.febslet.2008.02.071].
			
	Tutti gli autori
	
						Ceol, A; Chatr_aryamontri, A; Licata, L; Cesareni, G
					
	Tipologia
	
				Articolo su rivista
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/31211

Citazioni

30

64

54

social impact