Web Information Extraction via Web Views

Wee Keong Ng; Zehua Liu; Zhao Li; Ee Peng Lim

doi:10.4018/978-1-59904-945-8.ch019

Access Full-Text Recommend to Your Library

Buy Instant Access to This Chapter

Instant access upon order completion

Add to Cart

Share

Recommend to Librarian Recommend to Colleague Fair Use Policy

Free Content

Sample PDF

More Information

Rights & Permissions

Favorite Cite Chapter

MLA

Ng, Wee Keong, et al. "Web Information Extraction via Web Views." End-User Computing: Concepts, Methodologies, Tools, and Applications, edited by Steve Clarke, IGI Global Scientific Publishing, 2008, pp. 211-238. https://doi.org/10.4018/978-1-59904-945-8.ch019

APA

Ng, W. K., Liu, Z., Li, Z., & Lim, E. P. (2008). Web Information Extraction via Web Views. In S. Clarke (Ed.), End-User Computing: Concepts, Methodologies, Tools, and Applications (pp. 211-238). IGI Global Scientific Publishing. https://doi.org/10.4018/978-1-59904-945-8.ch019

Chicago

Ng, Wee Keong, et al. "Web Information Extraction via Web Views." In End-User Computing: Concepts, Methodologies, Tools, and Applications, edited by Steve Clarke, 211-238. Hershey, PA: IGI Global Scientific Publishing, 2008. https://doi.org/10.4018/978-1-59904-945-8.ch019

Export Reference

For Librarians

Web Information Extraction via Web Views

Wee Keong Ng (Nanyang Technological University, Singapore), Zehua Liu (Nanyang Technological University, Singapore), Zhao Li (Nanyang Technological University, Singapore), and Ee Peng Lim (Nanyang Technological University, Singapore)

Source Title: End-User Computing: Concepts, Methodologies, Tools, and Applications

DOI: 10.4018/978-1-59904-945-8.ch019

Abstract

With the explosion of information on the Web, traditional ways of browsing and keyword searching of information over web pages no longer satisfy the demanding needs of web surfers. Web information extraction has emerged as an important research area that aims to automatically extract information from target web pages and convert them into a structured format for further processing. The main issues involved in the extraction process include: (1) the definition of a suitable extraction language; (2) the definition of a data model representing the web information source; (3) the generation of the data model, given a target source; and (4) the extraction and presentation of information according to a given data model. In this chapter, we discuss the challenges of these issues and the approaches that current research activities have taken to revolve these issues. We propose several classification schemes to classify existing approaches of information extraction from different perspectives. Among the existing works, we focus on the Wiccap system — a software system that enables ordinary end-users to obtain information of interest in a simple and efficient manner by constructing personalized web views of information sources.

Complete Chapter List

Search this Book:

Reset