New Formats and Interfaces for Multi-Document News Summarization and its Evaluation

New Formats and Interfaces for Multi-Document News Summarization and its Evaluation

Bettina Berendt (KU Leuven, Belgium), Mark Last (Ben-Gurion University of the Negev, Israel), Ilija Subašić (KU Leuven, Belgium) and Mathias Verbeke (KU Leuven, Belgium)
DOI: 10.4018/978-1-4666-5019-0.ch010


News production, delivery, and consumption are increasing in ubiquity and speed, spreading over more software and hardware platforms, in particular mobile devices. This has led to an increasing interest in automated methods for multi-document summarization. The authors start this chapter with discussing several new alternatives for automated news summarization, with a particular focus on temporal text mining, graph-based methods, and graphical interfaces. Then they present automated and user-centric frameworks for cross-evaluating summarization methods that output different summary formats and describe the challenges associated with each evaluation framework. Based on the results of the user studies, the authors argue that it is crucial for effective summarization to integrate the user into sense-making through usable, entertaining, and ultimately useful interactive summarization-plus-document-search interfaces. In particular, graph-based methods and interfaces may be a better preparation for people to concentrate on what is essential in a collection of texts, and thus may be a key to enhancing the summary evaluation process by replacing the “one gold standard fits all” approach with carefully designed user studies built upon a variety of summary representation formats.
Chapter Preview

2. Key Concepts And Formats In Text Summarization

Extractive summarization aims at the selection of a subset of the most relevant fragments from a source text into the summary. The fragments can be paragraphs (Salton, Singhal, Mitra, & Buckley, 1997), sentences (Luhn, 1958), keyphrases (Turney, 2000; Litvak, Aizenman, Gobits, Last, & Kandel, 2011) or keywords (Litvak & Last, 2008). Extractive summarization usually consists of ranking, where each fragment of a summarized text gets a relevance score, and extraction, where the top-ranked fragments are gathered into a summary, according to their appearance in the original text.

Complete Chapter List

Search this Book: