Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Hybrid Query Execution on Linked Data With Complete Results

Samita Bai, Shakeel A. Khoja

Source Title: International Journal on Semantic Web and Information Systems (IJSWIS) 17(1)

DOI: 10.4018/IJSWIS.2021010102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The link traversal strategies to query Linked Data over WWW can retrieve up-to-date results using a recursive URI lookup process in real-time. The downside of this approach comes with the query patterns having subject unbound (i.e. ?S rdf:type:Class). Such queries fail to start up the traversal process as the RDF pages are subject-centric in nature. Thus, zero-knowledge link traversal leads to the empty query results for these queries. In this paper, the authors analyze a large corpus of real-world SPARQL query logs and identify the Most Frequent Predicates (MFPs) occurring in these queries. The knowledge of these MFPs helps in finding and indexing a limited number of triples from the original data set. Additionally, the authors propose a Hybrid Query Execution (HQE) approach to execute the queries over this index for initial data source selection followed by link traversal process to fetch complete results. The evaluation of HQE on the latest real data benchmarks reveals that it retrieves at least five times more results than the existing approaches.

Article Preview

Top

1. Introduction

The traditional Web provides an enormous amount of information which is generally unstructured. This motivates the Semantic Web community to introduce more structured and meaningful data on the Web, also the tools and techniques for publishing and retrieving this data (Garzon, 2020). These efforts result in the concept of Linked Data. The Linked Data is different from the typical unstructured Web of documents hence it is also known as ‘Web of Data’ (WoD). The WoD is a novel collection of structured data which is distributed across the Web, defined using standard Semantic Web technologies i.e. Resource Description Framework ¹ (RDF) and SPARQL Protocol and RDF Query Language² (SPARQL) and published under Linked Data principles³ (Umbrich, 2015).

To utilize the full potential of WoD, an initiative is taken to provide an unrestricted access to this data known as “Linked Open Data” (LOD) cloud⁴. The link traversal strategy allows users to query LOD cloud live. Unlike traditional Web, the use of centralized approaches for searching the contents based on optimized indices is not a viable solution for data over LOD cloud. As this data is dynamic, so the copied data dumps can become out-of-date and stale query results can be obtained. Hence more emphasis is given to query this data live to cater its dynamicity. The link traversal approach relies on Linked Data principles and can fetch fresh results on-the-fly for the SPARQL queries often with slower response times. It employs a recursive URI lookup mechanism to traverse Linked Data sources using the follow-your-nose approach (i.e., dereferencing HTTP links) (Hartig, 2011; Hartig et al., 2009). Nevertheless, this technique has a shortcoming when it comes to answering certain query patterns where subject is unbound (e.g. ?S rdf:type:Class) (Scheglmann & Scherp, 2014) and where there is a foreign URI and/or literal at the object position. In the case of above-mentioned query forms, the link traversal for SPARQL queries returns empty results if no prior information of data sources is available also known as ‘zero-knowledge link traversal’. The Linked Data sources are primarily subject-centric and if object is a foreign URI then the original data sources containing query results are not reachable. Furthermore, in case of a literal at the object position will also not allow the URI lookup process to be initiated as literals are non-dereferenceable. So, there is a need to obtain some preliminary information about the data sources for such queries to be answered as zero-knowledge link traversal is not significant.

Mostly, the link traversal approach produces empty results for the SPARQL queries where the data sources mentioned in the object URI do not contain the knowledge of the incoming properties given the subject is unbound. These incoming properties are also known as ‘in-links’ or ‘backlinks’. They can be identified in an RDF data source with triples containing foreign URIs. The process of finding and maintaining of such in-links/backlinks is called ‘backlinking’ (Stefanidakis & Papadakis, 2011). The knowledge of backlinks is generally not provided in the Linked Data sources. The backlinks could drastically improve the performance of the Linked Data crawlers and query engines. However, the identification and maintenance of the in-links or backlinks is quite time consuming and cumbersome task as proved with the help of experimentation (Bai et al., 2018).

Complete Article List

Search this Journal:

Reset

Volume 20: 1 Issue (2024)

Volume 19: 1 Issue (2023)

Volume 18: 4 Issues (2022): 2 Released, 2 Forthcoming

Volume 17: 4 Issues (2021)

Volume 16: 4 Issues (2020)

Volume 15: 4 Issues (2019)

Volume 14: 4 Issues (2018)

Volume 13: 4 Issues (2017)

Volume 12: 4 Issues (2016)

Volume 11: 4 Issues (2015)

Volume 10: 4 Issues (2014)

Volume 9: 4 Issues (2013)

Volume 8: 4 Issues (2012)

Volume 7: 4 Issues (2011)

Volume 6: 4 Issues (2010)

Volume 5: 4 Issues (2009)

Volume 4: 4 Issues (2008)

Volume 3: 4 Issues (2007)

Volume 2: 4 Issues (2006)

Volume 1: 4 Issues (2005)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Hybrid Query Execution on Linked Data With Complete Results

Abstract

1. Introduction

Complete Article List