A Framework for Efficient Association Rule Mining in XML Data

A Framework for Efficient Association Rule Mining in XML Data

Ji Zhang, Han Liu, Tok Wang Ling, Robert M. Bruckner, A Min Tjoa
ISBN13: 9781605660585|ISBN10: 1605660582|EISBN13: 9781605660592
DOI: 10.4018/978-1-60566-058-5.ch032
Cite Chapter Cite Chapter

MLA

Zhang, Ji, et al. "A Framework for Efficient Association Rule Mining in XML Data." Database Technologies: Concepts, Methodologies, Tools, and Applications, edited by John Erickson, IGI Global, 2009, pp. 505-526. https://doi.org/10.4018/978-1-60566-058-5.ch032

APA

Zhang, J., Liu, H., Wang Ling, T., Bruckner, R. M., & Min Tjoa, A. (2009). A Framework for Efficient Association Rule Mining in XML Data. In J. Erickson (Ed.), Database Technologies: Concepts, Methodologies, Tools, and Applications (pp. 505-526). IGI Global. https://doi.org/10.4018/978-1-60566-058-5.ch032

Chicago

Zhang, Ji, et al. "A Framework for Efficient Association Rule Mining in XML Data." In Database Technologies: Concepts, Methodologies, Tools, and Applications, edited by John Erickson, 505-526. Hershey, PA: IGI Global, 2009. https://doi.org/10.4018/978-1-60566-058-5.ch032

Export Reference

Mendeley
Favorite

Abstract

In this article, we propose a framework, called XAR-Miner, for mining ARs from XML documents efficiently. In XAR-Miner, raw data in the XML document first are preprocessed to transform either to an Indexed XML Tree (IX-tree) or to Multirelational Databases (Multi-DB), depending on the size of the XML document and the memory constraint of the system, for efficient data selection and AR mining. Concepts that are relevant to the AR mining task are generalized to produce generalized metapatterns. A suitable metric is devised for measuring the degree of concept generalization in order to prevent undergeneralization or overgeneralization. Resulting generalized metapatterns are used to generate large ARs that meet the support and confidence levels. A greedy algorithm is also presented in order to integrate data selection and large itemset generation to enhance the efficiency of the AR mining process. The experiments conducted show that XAR-Miner is more efficient in performing a large number of AR mining tasks from XML documents than the state-of-the-art method of repetitively scanning through XML documents in order to perform each of the mining tasks.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.