Interval Set Representations of Clusters
Pawan Lingras (Saint Mary’s University, Canada), Rui Yan (Saint Mary’s University, Canada), Mofreh Hogo (Czech Technical University, Czech Republic) and Chad West (IBM Canada Limited, Canada)
Copyright: © 2005
The amount of information that is available in the new information age has made it necessary to consider various summarization techniques. Classification, clustering, and association are three important data-mining features. Association is concerned with finding the likelihood of co-occurrence of two different concepts. For example, the likelihood of a banana purchase given that a shopper has bought a cake. Classification and clustering both involve categorization of objects. Classification processes a previously known categorization of objects from a training sample so that it can be applied to other objects whose categorization is unknown. This process is called supervised learning. Clustering groups objects with similar characteristics. As opposed to classification, the grouping process in clustering is unsupervised. The actual categorization of objects, even for a sample, is unknown. Clustering is an important step in establishing object profiles.