Mining Free Text for Structure

Vladimir A. Kulyukin; Robin Burke

doi:10.4018/978-1-59140-051-6.ch012

Access Full-Text Recommend to Your Library

Buy Instant Access to This Chapter

Instant access upon order completion

Add to Cart

Share

Recommend to Librarian Recommend to Colleague Fair Use Policy

Free Content

Sample PDF

More Information

Rights & Permissions

Favorite Cite Chapter

MLA

Kulyukin, Vladimir A., and Robin Burke. "Mining Free Text for Structure." Data Mining: Opportunities and Challenges, edited by John Wang, IGI Global Scientific Publishing, 2003, pp. 278-300. https://doi.org/10.4018/978-1-59140-051-6.ch012

APA

Kulyukin, V. A. & Burke, R. (2003). Mining Free Text for Structure. In J. Wang (Ed.), Data Mining: Opportunities and Challenges (pp. 278-300). IGI Global Scientific Publishing. https://doi.org/10.4018/978-1-59140-051-6.ch012

Chicago

Kulyukin, Vladimir A., and Robin Burke. "Mining Free Text for Structure." In Data Mining: Opportunities and Challenges, edited by John Wang, 278-300. Hershey, PA: IGI Global Scientific Publishing, 2003. https://doi.org/10.4018/978-1-59140-051-6.ch012

Export Reference

For Librarians

Mining Free Text for Structure

Vladimir A. Kulyukin (Utah State University, USA) and Robin Burke (DePaul University, USA)

Source Title: Data Mining: Opportunities and Challenges

DOI: 10.4018/978-1-59140-051-6.ch012

Abstract

Knowledge of the structural organization of information in documents can be of significant assistance to information systems that use documents as their knowledge bases. In particular, such knowledge is of use to information retrieval systems that retrieve documents in response to user queries. This chapter presents an approach to mining free-text documents for structure that is qualitative in nature. It complements the statistical and machine-learning approaches, insomuch as the structural organization of information in documents is discovered through mining free text for content markers left behind by document writers. The ultimate objective is to find scalable data mining (DM) solutions for free-text documents in exchange for modest knowledge-engineering requirements. The problem of mining free text for structure is addressed in the context of finding structural components of files of frequently asked questions (FAQs) associated with many USENET newsgroups. The chapter describes a system that mines FAQs for structural components. The chapter concludes with an outline of possible future trends in the structural mining of free text.

Complete Chapter List

Search this Book:

Reset