Reference Hub4
Text-to-Text Similarity of Sentences

Text-to-Text Similarity of Sentences

Vasile Rus, Mihai Lintean, Arthur C. Graesser, Danielle S. McNamara
ISBN13: 9781609607418|ISBN10: 1609607414|EISBN13: 9781609607425
DOI: 10.4018/978-1-60960-741-8.ch007
Cite Chapter Cite Chapter

MLA

Rus, Vasile, et al. "Text-to-Text Similarity of Sentences." Applied Natural Language Processing: Identification, Investigation and Resolution, edited by Philip M. McCarthy and Chutima Boonthum-Denecke, IGI Global, 2012, pp. 110-121. https://doi.org/10.4018/978-1-60960-741-8.ch007

APA

Rus, V., Lintean, M., Graesser, A. C., & McNamara, D. S. (2012). Text-to-Text Similarity of Sentences. In P. McCarthy & C. Boonthum-Denecke (Eds.), Applied Natural Language Processing: Identification, Investigation and Resolution (pp. 110-121). IGI Global. https://doi.org/10.4018/978-1-60960-741-8.ch007

Chicago

Rus, Vasile, et al. "Text-to-Text Similarity of Sentences." In Applied Natural Language Processing: Identification, Investigation and Resolution, edited by Philip M. McCarthy and Chutima Boonthum-Denecke, 110-121. Hershey, PA: IGI Global, 2012. https://doi.org/10.4018/978-1-60960-741-8.ch007

Export Reference

Mendeley
Favorite

Abstract

Assessing the semantic similarity between two texts is a central task in many applications, including summarization, intelligent tutoring systems, and software testing. Similarity of texts is typically explored at the level of word, sentence, paragraph, and document. The similarity can be defined quantitatively (e.g. in the form of a normalized value between 0 and 1) and qualitatively in the form of semantic relations such as elaboration, entailment, or paraphrase. In this chapter, we focus first on measuring quantitatively and then on detecting qualitatively sentence-level text-to-text semantic relations. A generic approach that relies on word-to-word similarity measures is presented as well as experiments and results obtained with various instantiations of the approach. In addition, we provide results of a study on the role of weighting in Latent Semantic Analysis, a statistical technique to assess similarity of texts. The results were obtained on two data sets: a standard data set on sentence-level paraphrase detection and a data set from an intelligent tutoring system.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.