Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/2117/100716

Dimensional enrichment of statistical linked open data
Varga, Jovan; Vaisman, Alejandro; Romero Moral, Óscar; Etcheverry, Lorena; Bach Pedersen, Torben; Thomsen, Christian
Universitat Politècnica de Catalunya. Departament d'Enginyeria de Serveis i Sistemes d'Informació; Universitat Politècnica de Catalunya. MPI - Modelització i Processament de la Informació
On-Line Analytical Processing (OLAP) is a data analysis technique typically used for local and well-prepared data. However, initiatives like Open Data and Open Government bring new and publicly available data on the web that are to be analyzed in the same way. The use of semantic web technologies for this context is especially encouraged by the Linked Data initiative. There is already a considerable amount of statistical linked open data sets published using the RDF Data Cube Vocabulary (QB) which is designed for these purposes. However, QB lacks some essential schema constructs (e.g., dimension levels) to support OLAP. Thus, the QB4OLAP vocabulary has been proposed to extend QB with the necessary constructs and be fully compliant with OLAP. In this paper, we focus on the enrichment of an existing QB data set with QB4OLAP semantics. We first thoroughly compare the two vocabularies and outline the benefits of QB4OLAP. Then, we propose a series of steps to automate the enrichment of QB data sets with specific QB4OLAP semantics; being the most important, the definition of aggregate functions and the detection of new concepts in the dimension hierarchy construction. The proposed steps are defined to form a semi-automatic enrichment method, which is implemented in a tool that enables the enrichment in an interactive and iterative fashion. The user can enrich the QB data set with QB4OLAP concepts (e.g., full-fledged dimension hierarchies) by choosing among the candidate concepts automatically discovered with the steps proposed. Finally, we conduct experiments with 25 users and use three real-world QB data sets to evaluate our approach. The evaluation demonstrates the feasibility of our approach and shows that, in practice, our tool facilitates, speeds up, and guarantees the correct results of the enrichment process.
Peer Reviewed
-Àrees temàtiques de la UPC::Informàtica::Enginyeria del software
-Semantic web
-Linked data
-OLAP technology
-Linked open data
-Multidimensional data modeling
-OLAP
-Semantic web
-Web semàntica
-Tecnologia OLAP
Artículo - Versión presentada
Artículo
         

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

Varga, Jovan; Etcheverry, Lorena; Vaisman, Alejandro; Romero Moral, Óscar; Bach Pedersen, Torben; Thomsen, Christian
Varga, Jovan; Romero Moral, Óscar; Bach Pedersen, Torben; Thomsen, Christian
Varga, Jovan; Dobrokhotova, Ekaterina; Romero Moral, Óscar; Bach Pedersen, Torben; Thomsen, Christian
Varga, Jovan; Romero Moral, Óscar; Pedersen, Torben; Thomsen, Christian
Varga, Jovan; Romero Moral, Óscar; Pedersen, Torben; Thomsen, Christian