Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Content Similarity

Handbook of Research on Text and Web Mining Technologies
The degree of similarity between two Web sites (or Web pages), based on the textual content (terms appearing in them) of the two Web sites.
Published in Chapter:
Web Mining to Identify People of Similar Background
Quanzhi Li (Avaya, Inc., USA) and Yi-fang Brook Wu (New Jersey Institute of Technology, USA)
Copyright: © 2009 |Pages: 17
DOI: 10.4018/978-1-59904-990-8.ch023
Abstract
This chapter presents a new approach of mining the Web to identify people of similar background. To find similar people from the Web for a given person, two major research issues are person representation and matching persons. In this chapter, a person representation method which uses a person’s personal Web site to represent this person’s background is proposed. Based on this person representation method, the main proposed algorithm integrates textual content and hyperlink information of all the Web pages belonging to a personal Web site to represent a person and match persons. Other algorithms are also explored and compared to the main proposed algorithm. The evaluation methods and experimental results are presented.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR