A Graph-Based Biomedical Literature Clustering Approach Utilizing Term's Global and Local Importance Information

A Graph-Based Biomedical Literature Clustering Approach Utilizing Term's Global and Local Importance Information

Zhang Xiaodan (Drexel University, USA), Hu Xiaohua (Drexel University, USA & Jiangxi University of Finance and Economics, China), Xia Jiali (Jiangxi University of Finance and Economics, China), Zhou Xiaohua (Drexel University, USA) and Achananuparp Palakorn (Drexel University, USA)
DOI: 10.4018/978-1-60566-717-1.ch008
OnDemand PDF Download:
$30.00
List Price: $37.50

Abstract

In this article, we present a graph-based knowledge representation for biomedical digital library literature clustering. An efficient clustering method is developed to identify the ontology-enriched k-highest density term subgraphs that capture the core semantic relationship information about each document cluster. The distance between each document and the k term graph clusters is calculated. A document is then assigned to the closest term cluster. The extensive experimental results on two PubMed document sets (Disease10 and OHSUMED23) show that our approach is comparable to spherical k-means. The contributions of our approach are the following: (1) we provide two corpus-level graph representations to improve document clustering, a term co-occurrence graph and an abstract-title graph; (2) we develop an efficient and effective document clustering algorithm by identifying k distinguishable class-specific core term subgraphs using terms’ global and local importance information; and (3) the identified term clusters give a meaningful explanation for the document clustering results.

Complete Chapter List

Search this Book:
Reset