Content-Determined Web Page Segmentation and Navigation for Mobile Web Searching

Content-Determined Web Page Segmentation and Navigation for Mobile Web Searching

DOI: 10.4018/978-1-7998-0961-6.ch007
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Nowadays the usage of mobile phones is widely spread in our lifestyle; we use cell phones as a camera, a radio, a music player, and even as a web browser. Since most web pages are created for desktop computers, navigating through web pages is highly fatigued. Hence, there is a great interest in computer science to adopt such pages with rich content into small screens of our mobile devices. On the other hand, every web page has got many different parts that do not have the equal importance to the end user. Consequently, the authors propose a mechanism to identify the most useful part of a web page to a user regarding his or her search query while the information loss is avoided. The challenge here comes from the fact that long web contents cannot be easily displayed in both vertical and horizontal ways.
Chapter Preview
Top

Introduction

Web searching on a mobile device, especially based on a cell phone is very common and convenient. Since most Web pages are with rich content and mainly are specifically developed for desktop computers, it is hard to navigate them with (relatively) small screen of handled devices. Hence, there is a great interest in computer science to adopt such pages with rich content into small screens of mobile devices. However, a Web page has different parts that do not have equal importance to the end user. This fact made us to consider a novel approach to identify the most useful portion of a Web page to the end user regarding its search query. Typically, mobile phones are common in the following characteristics:

  • Small Screen: limited screen area compared to a conventional PC.

  • Limited Keyboard: Limited area for interaction with the device. Therefore it is necessary to put the relevant part at first.

  • Distracted User: A desktop user is mainly in an (relatively) isolated environment with focused attention to its working. Whereas, a smartphone user may be in an open area with distractive surrounding which makes him or her to devote only a portion of his or her attention to the device. Consequently, less scrolling is desired.

Top

Content-Distance Model

Authors in (Sugaya, 2007) have proposed the architecture of a page segmentation based on content-distance. The approach is basically based on checking a page score and the distance of the content according to the distance in the source code of the corresponding Web page. Issues with this approach can be categorized as followings:

  • The distance between elements in the code and between corresponding elements in the actual page might be different, i.e., the distance between 2 cells in the source code vs. the distance of corresponding cells in the actual table in the actual Web page.

  • Most Web pages contain numerous parts which makes the introduced method slow and time-intensive.

  • The low accuracy of the proposed method (55-70% based on the authors report) makes it not a desirable technique.

The proposed solution (a hybrid version of the framework) still can be time-intensive for the fact it needs to check a considerable sets of conditions.

Efforts on displaying web pages on small screens fall into following categories:

Complete Chapter List

Search this Book:
Reset