In this paper we briefly describe the ART Lab infrastructure for semantic Big Bata processing. Our most relevant contribution is the definition of an architecture supporting ontology development driven by knowledge acquired from heterogeneous resources, such as documents and web pages. The overall perspective is to propose a gluing architecture driving and supporting the entire flow of information, from data acquisition from external heterogeneous resources to their exploitation for RDF triplification. In such an architecture, the unstructured content analysis capabilities of frameworks such as UIMA are integrated in a coordinated environment supporting the processing, transformation and projection of produced metadata into RDF semantic repositories, which are managed by Semantic Turkey, our platform for Knowledge Acquisition and Management. Further contributions relate to the possibility of easily managing high dimension repositories (e.g., thesauri, vocabularies, etc.), and supporting end users for sharing the 'logics' under the reasoning processes
Fiorelli, M., Pazienza, M.t., Stellato, A., Turbati, A. (2014). ART Lab infrastructure for semantic Big Data processing. In INTERNATIONAL WORKSHOP ON BIG DATA PRINCIPLES, ARCHITECTURES \& APPLICATIONS (BDAA 2014), colocated with International Conference on High Performance Computing \& Simulation (HPCS 2014) (pp. 327-334). IEEE [10.1109/HPCSim.2014.6903704].
ART Lab infrastructure for semantic Big Data processing
Fiorelli, M;PAZIENZA, MARIA TERESA;STELLATO, ARMANDO;
2014-09-22
Abstract
In this paper we briefly describe the ART Lab infrastructure for semantic Big Bata processing. Our most relevant contribution is the definition of an architecture supporting ontology development driven by knowledge acquired from heterogeneous resources, such as documents and web pages. The overall perspective is to propose a gluing architecture driving and supporting the entire flow of information, from data acquisition from external heterogeneous resources to their exploitation for RDF triplification. In such an architecture, the unstructured content analysis capabilities of frameworks such as UIMA are integrated in a coordinated environment supporting the processing, transformation and projection of produced metadata into RDF semantic repositories, which are managed by Semantic Turkey, our platform for Knowledge Acquisition and Management. Further contributions relate to the possibility of easily managing high dimension repositories (e.g., thesauri, vocabularies, etc.), and supporting end users for sharing the 'logics' under the reasoning processesFile | Dimensione | Formato | |
---|---|---|---|
ARTLabframework.pdf
solo utenti autorizzati
Descrizione: articolo pubblicato
Licenza:
Copyright dell'editore
Dimensione
843.06 kB
Formato
Adobe PDF
|
843.06 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.