Semi-Automatic Knowledge Extraction to Enrich Open Linked Data

Elena Baralis (Politecnico di Torino, Italy), Giulia Bruno (Politecnico di Torino, Italy), Tania Cerquitelli (Politecnico di Torino, Italy), Silvia Chiusano (Politecnico di Torino, Italy), Alessandro Fiori (Politecnico di Torino, Italy), and Alberto Grand (Politecnico di Torino, Italy)
Copyright: © 2013 |Pages: 25
EISBN13: 9781466646018|DOI: 10.4018/978-1-4666-2827-4.ch008
OnDemand PDF Download:
OnDemand PDF Download
Download link provided immediately after order completion


In this chapter we present the analysis of the Wikipedia collection by means of the ELiDa framework with the aim of enriching linked data. ELiDa is based on association rule mining, an exploratory technique to discover relevant correlations hidden in the analyzed data. To compactly store the large volume of extracted knowledge and efficiently retrieve it for further analysis, a persistent structure has been exploited. The domain expert is in charge of selecting the relevant knowledge by setting filtering parameters, assessing the quality of the extracted knowledge, and enriching the knowledge with the semantic expressiveness which cannot be automatically inferred. We consider, as representative document collections, seven datasets extracted from the Wikipedia collection. Each dataset has been analyzed from two point of views (i.e., transactions by documents, transactions by sentences) to highlight relevant knowledge at different levels of abstraction.
InfoSci-OnDemand Powered Search