Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Content Similarity Measure

Handbook of Research on Text and Web Mining Technologies
At the core of text summarization evaluation is the measuring of content similarity between two summaries. The content similarity measure can be lexical (based on word or sentence units, e.g. cosine similarity measures) or semantic (based on semantic content units, e.g. Summarization Content Units in the pyramid method).
Published in Chapter:
Performance Evaluation Measures for Text Mining
Hanna Suominen (Turku Centre for Computer Science (TUCS), Finland & University of Turku, Finland)
Copyright: © 2009 |Pages: 24
DOI: 10.4018/978-1-59904-990-8.ch041
Abstract
The purpose of this chapter is to provide an overview of prevalent measures for evaluating the quality of system output in seven key text mining task domains. For each task domain, a selection of widely used, well applicable measures is presented, and their strengths and weaknesses are discussed. Performance evaluation is essential for text mining system development and comparison, but the selection of a suitable performance evaluation measure is not a straightforward task. Therefore this chapter also attempts to give guidelines for measure selection. As measures are under constant development in many task domains and it is important to take the task domain characteristics and conventions into account, references to relevant performance evaluation events and literature are provided.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR