Reference Hub20
Logical Structure Recovery in Scholarly Articles with Rich Document Features

Logical Structure Recovery in Scholarly Articles with Rich Document Features

Minh-Thang Luong, Thuy Dung Nguyen, Min-Yen Kan
ISBN13: 9781466609006|ISBN10: 1466609001|EISBN13: 9781466609013
DOI: 10.4018/978-1-4666-0900-6.ch014
Cite Chapter Cite Chapter

MLA

Luong, Minh-Thang, et al. "Logical Structure Recovery in Scholarly Articles with Rich Document Features." Multimedia Storage and Retrieval Innovations for Digital Library Systems, edited by Chia-Hung Wei, et al., IGI Global, 2012, pp. 270-292. https://doi.org/10.4018/978-1-4666-0900-6.ch014

APA

Luong, M., Nguyen, T. D., & Kan, M. (2012). Logical Structure Recovery in Scholarly Articles with Rich Document Features. In C. Wei, Y. Li, & C. Gwo (Eds.), Multimedia Storage and Retrieval Innovations for Digital Library Systems (pp. 270-292). IGI Global. https://doi.org/10.4018/978-1-4666-0900-6.ch014

Chicago

Luong, Minh-Thang, Thuy Dung Nguyen, and Min-Yen Kan. "Logical Structure Recovery in Scholarly Articles with Rich Document Features." In Multimedia Storage and Retrieval Innovations for Digital Library Systems, edited by Chia-Hung Wei, Yue Li, and Chih-Ying Gwo, 270-292. Hershey, PA: IGI Global, 2012. https://doi.org/10.4018/978-1-4666-0900-6.ch014

Export Reference

Mendeley
Favorite

Abstract

Scholarly digital libraries increasingly provide analytics to information within documents themselves. This includes information about the logical document structure of use to downstream components, such as search, navigation, and summarization. In this paper, the authors describe SectLabel, a module that further develops existing software to detect the logical structure of a document from existing PDF files, using the formalism of conditional random fields. While previous work has assumed access only to the raw text representation of the document, a key aspect of this work is to integrate the use of a richer representation of the document that includes features from optical character recognition (OCR), such as font size and text position. Experiments reveal that using such rich features improves logical structure detection by a significant 9 F1 points, over a suitable baseline, motivating the use of richer document representations in other digital library applications.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.