You are viewing a javascript disabled version of the site. Please enable Javascript for this site to function properly.
Go to headerGo to navigationGo to searchGo to contentsGo to footer
In content section. Select this link to jump to navigation

A note on intelligent exploration of semantic data

Abstract

Welcome to this special issue of the Semantic Web (SWJ) journal. The special issue compiles three technical contributions that significantly advance the state-of-the-art in exploration of semantic data using semantic web techniques and technologies.

1.Introduction

Semantic data, including structured and unstructured data linked with knowledge models representing meaning, is becoming widely available in a diversity of contexts and usages (Zaveri, Rula et al. [11]). This can make significant contribution to the implementation and the broader utilisation of semantic web in human practice. Users with various backgrounds and expectations are becoming exposed to semantic data, and need to explore such data for different purposes (searching, browsing, summarising, making sense, taking decisions, or learning) (Shneiderman [8]) (Brunetti, García [2]) (Thakker et al. [9]). Semantic data exploration is becoming a key activity in a range of application domains, such as government and public organisations (Hendler, Holm et al. [4], Höffner, Martin et al. [5]), education (Dietze, Yu et al. [3]), life science (William, Harland et al. [10]), cultural heritage (Hyvönen [6]), media and entertainment (Kobilarov, Scott et al. [7]) to name a few. Several novel interfaces and interaction means for exploration of semantic data are being proposed, for example semantic data browsers, ontology/content visualisation and exploration environments, semantic wikis, intelligent agents. However, the efforts have not been consolidated to capitalise on experience in tackling key issues in exploration of semantic data, and to bring together the main themes of: (a) Human Factors: the utility and the added value of exploration approaches to humans; (b) Technology: new technological strands and how these strands impact the algorithms for interaction and exploration; (c) Application Domains: domains that have seen a change because of bringing in semantic data exploration approaches, and the uncharted application domains community has yet to address; and (d) Evaluation: sharing practices on evaluating exploration approaches including gold standards, benchmarks and methodologies.

The main goal of this Special Issue is to explore new directions and approaches in the aforementioned themes to leverage innovative research aimed at tackling exploration challenges while using semantic data. We encouraged the submission of work with important theoretical and practical results, as well as case studies on existing use of technologies and techniques for semantic data exploration.

2.Summary of contributions

In this section, we present the summary of the papers that were accepted for publication in this special issue.

It is common practice to query different linked datasets in order to get as much information as possible on distinct topics. These datasets can have a large number of complex vocabularies and data structures making it difficult to query them for any meaningful analysis. For this reason, there is a need for systems that can produce on-demand rewritings of queries. One of the challenges in query rewriting is the preservation of semantics due to heterogeneity of the vocabularies. Hence, the quality estimation of the produced rewriting becomes crucial. In the paper titled “Estimating query rewriting quality over LOD”, Torre et al. present a framework that, given a query written in the vocabulary the user is more familiar with, the system rewrites the query in terms of the vocabulary of a target dataset. Moreover, their system also informs about the quality of the rewritten query with two scores: firstly, a similarity factor which is based on the rewriting process itself, and secondly, a quality score offered by a predictive model. This model is constructed by a machine learning algorithm that learns from a set of queries and their intended (gold standard) rewritings. They have validated the feasibility of their framework in a real scenario.

In the paper titled “Quality Metrics for RDF Graph Summarization”, Zneika et al. tackle the issue of graph summarization from the perspective of semantic data exploration. RDF Graph Summarization pertains to the process of extracting concise but meaningful summaries from RDF Knowledge Bases (KBs) representing as close as possible the actual contents of the KB both in terms of structure and data. The authors present a quality framework that works at two levels: the level of the ideal summary of the KB that could be provided by an expert user and the level of the instances contained by the KB. For the first level, they are computing how close the proposed summary is to the ideal solution (when this is available) by defining and computing its precision, recall and F-measure against the ideal solution. For the second level, they are computing if the existing instances are covered (i.e. can be retrieved) and in what degree by the proposed summary. Authors have tested their quality framework against three well-known RDF Graph Summarization algorithms.

The third paper of the issue, titled “Using Knowledge Anchors to Facilitate User Exploration of Data Graphs” by Al-Tawil et al. focuses on the novel concept of knowledge expansion during data exploration. Authors introduce an exploration support mechanism underpinned by the subsumption theory of meaningful learning (Ausubel [1]), which postulates that new knowledge is grasped by starting from familiar concepts in the graph which serve as knowledge anchors from where links to new knowledge are made. A core component of their framework based on subsumption theory of meaningful learning is to the automatic identification of such knowledge anchors in a data graph (KADG). Authors present several metrics for identifying KADG which are evaluated against familiar concepts in human cognitive structures. These metrics and framework is evaluated in a task-driven experimental user study compared to free data graph exploration.

We hope that the readers will find the articles of this special issue to be informative and useful.

References

[1] 

D.P. Ausubel, A subsumption theory of meaningful verbal learning and retention, The Journal of General Psychology 66: (2) ((1962) ), 213–224. doi:10.1080/00221309.1962.9711837.

[2] 

J.M. Brunetti and R. García, User-centered design and evaluation of overview components for semantic data exploration, Aslib Journal of Information Management 66: (5) ((2014) ), 519–536. doi:10.1108/AJIM-12-2013-0153.

[3] 

S. Dietze et al., Linked education: Interlinking educational resources and the web of data, in: The 27th ACM Symposium on Applied Computing (SAC-2012), Special Track on Semantic Web and Applications, Trento, Italy, (2012) .

[4] 

J. Hendler, J. Holm, C. Musialek and G. Thomas, US government linked open data: Semantic.data.gov, The IEEE Intelligent Systems 27: (3) ((2012) ), 25–31. doi:10.1109/MIS.2012.27.

[5] 

K. Höffner, M. Martin and J. Lehmann, LinkedSpending: Openspending becomes linked open data, Semantic Web 7: (1) ((2016) ), 95–104. doi:10.3233/SW-150172.

[6] 

E. Hyvönen, Publishing and using cultural heritage linked data on the semantic web, Synthesis Lectures on the Semantic Web: Theory and Technology 2: (1) ((2012) ), 1–159.

[7] 

G. Kobilarov et al., Media meets semantic web – how the BBC uses dbpedia and linked data to make connections, in: European Semantic Web Conference, Springer, Heraklion, Greece, (2009) , pp. 723–737.

[8] 

B. Shneiderman, The eyes have it: A task by data type taxonomy for information visualizations, in: Proceedings of the IEEE Symposium on Visual Languages, Boulder, CO, USA, (1996) , pp. 336–343.

[9] 

D. Thakker, F. Yang-Turner and D. Despotakis, User interaction with linked data: An exploratory search approach, International Journal of Distributed Systems and Technologies (IJDST) 7: (1) ((2016) ), 79–91. doi:10.4018/IJDST.2016010105.

[10] 

A.J. William, L. Harland, P.T. Groth, S. Pettifier, C. Christine, E.L. Willighagen and B. Mons, Open PHACTS: Semantic interoperability for drug discovery, Drug Discovery Today 17: (21–22) ((2012) ), 1188–1198. doi:10.1016/j.drudis.2012.05.016.

[11] 

A. Zaveri, A. Rula, A. Maurino, R. Pietrobon, J. Lehmann and S. Auer, Quality assessment for linked data: A survey, Semantic Web 7: (1) ((2016) ), 63–93. doi:10.3233/SW-150175.