Mining Text Documents for Thematic Hierarchies Using Self-Organizing Maps

Mining Text Documents for Thematic Hierarchies Using Self-Organizing Maps

Hsin-Chang Yang, Chung-Hong Lee
Copyright: © 2003 |Pages: 21
DOI: 10.4018/978-1-59140-051-6.ch008
(Individual Chapters)
No Current Special Offers


Recently, many approaches have been devised for mining various kinds of knowledge from texts. One important application of text mining is to identify themes and the semantic relations among these themes for text categorization. Traditionally, these themes were arranged in a hierarchical manner to achieve effective searching and indexing as well as easy comprehension for human beings. The determination of category themes and their hierarchical structures was mostly done by human experts. In this work, we developed an approach to automatically generate category themes and reveal the hierarchical structure among them. We also used the generated structure to categorize text documents. The document collection was trained by a self-organizing map to form two feature maps. We then analyzed these maps and obtained the category themes and their structure. Although the test corpus contains documents written in Chinese, the proposed approach can be applied to documents written in any language, and such documents can be transformed into a list of separated terms.

Complete Chapter List

Search this Book: