Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Extracting Ontology Properties from the Web-Tables

Song-il Cha, Z. M. Ma

Source Title: International Journal of Systems and Service-Oriented Engineering (IJSSOE) 3(3)

DOI: 10.4018/jssoe.2012070104

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Web-tables are ubiquitous in Web pages. Since tables themselves are organized structurally and semantically, they are good resources from which we can easily extract ontology. But, most Web-tables are designed for intuitive perception of humans, thus, it has a certain limit to interpret table content using only structural information of the table. So this paper focuses on the method for interpretation of table content based on semantic characteristics of the table. In order to obtain many property elements used for ontology inference, in this paper, the authors discuss how to extract ontology properties from Web-tables. The extracted properties include the following elements: Is-a relationship, class-instance relationship, triple, property domain, property range, symmetric property, transitive property, functional property, and inverse functional property, property for defining super-sub relationship. Through experiment, the authors show that their method can effectively extract property elements from Web-tables.

Article Preview

Top

1. Introduction

Recently, the Semantic Web comprises techniques that promise to dramatically improve the current WWW (World Wide Web) and its use. With the emergence of the Semantic Web and the growing number of heterogeneous data sources, the benefits of ontologies are becoming widely accepted. Web ontologies define terms used as data (metadata) for explaining things of a special domain. Nowadays, researchers are paying attention to automatic transformation of Internet resources in the areas into ontologies (Antoniou & van Harmelen, 2004). The Web is an enormous source of information contained in billions of individual pages. Most information resource on the Web is presented in the form of semi-structured or unstructured documents, encoded as a mixture of loosely structured natural language text and template units.

Web-tables are used mainly for structuring information, and they are the important means of presenting structured information. Table structures represent relations between data in the table. Therefore, we can easily extract ontology from a table based on the features of table structures without the use of syntax analysis (Tanaka & Ishida, 2006). However, understanding of table contents requires table structure comprehension and semantic interpretation, which exceed the complexity of corresponding linguistic tasks. Previous studies for extracting information from Web-Tables are centralized to interpret table structure: Pivk et al., (2004) focused on understanding table-like structures only due to their structural dimension, and the table model consists of Physical, Structural, Functional and Semantic components. The authors transformed the most relevant table types into F-logic frames, and also demonstrated and evaluated the successful generation of frames from HTML (HyperText Mark-up Language) tables. Jung et al., (2006) suggested a method for extracting table-schemata based on table structure and heuristics. Using this method, a table is converted into a table-schema and a triple. Moreover, since it is important to determine whether or not a table has meaningfulness that is related to the structural information provided at the level of the table head, the authors further investigated the types of tables, established the features that distinguished meaningful tables from others, built a training data set using these features and constructed a classification model using a decision tree. However, the work in Jung et al. (2006) cannot handle a table without head and also cannot extract an appropriate head using background color and font. Further, Chavan & Shirgave (2011) introduced a method for determining the meaningfulness of a table and extracting the head contents from meaningful table. The authors applied table mining to general HTML documents, separated meaningful tables from decorative tables, and extracted the information using head.

As specified above, it can be found that the work mentioned above mainly focused on how to extract information from tables, instead of investigating how to extract ontologies from Web-tables, and the detailed ontology extraction rules are not provided. In addition, most of research endeavors to interpret the table by using structural characteristics of the table. But, many Web-tables are designed by humans, thus, it has a certain limit to automatically interpret tables using only structural information of the table. Jung et al., (2007) detected that, generally, a table provides a semantic core element in a head, and proposed a method for obtaining class-instance relationship and triple using heuristics for extracting table schemata based on semantic core element. Although the authors in Jung et al. (2007) proposed heuristics for detecting semantic characteristics based on the location of table cells, they did not mention which becomes semantic core element.

Complete Article List

Search this Journal:

Reset

Volume 13: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 12: 2 Issues (2022): 1 Released, 1 Forthcoming

Volume 11: 2 Issues (2021)

Volume 10: 2 Issues (2020)

Volume 9: 2 Issues (2019)

Volume 8: 4 Issues (2018)

Volume 7: 4 Issues (2017)

Volume 6: 4 Issues (2016)

Volume 5: 4 Issues (2015)

Volume 4: 4 Issues (2014)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Extracting Ontology Properties from the Web-Tables

Abstract

1. Introduction

Complete Article List