Abstract
The clustering techniques suffer from cluster centers initialization and local optima problems. In this chapter, the new metaheuristic algorithm, Sine Cosine Algorithm (SCA), is used as a search method to solve these problems. The SCA explores the search space of given dataset to find out the near-optimal cluster centers. The center based encoding scheme is used to evolve the cluster centers. The proposed SCA-based clustering technique is evaluated on four real-life datasets. The performance of SCA-based clustering is compared with recently developed clustering techniques. The experimental results reveal that SCA-based clustering gives better values in terms of cluster quality measures.
TopBackground
This section describes the related concepts of cluster analysis and related works on metaheuristics-based data clustering techniques.
Key Terms in this Chapter
Clustering: An unsupervised technique for grouping the dataset into classes of similar data.
Exploitation: The ability of finding the optimal solution around a good solution.
Validity Index: Used to measure the goodness of a clustering results comparing to other ones which are created by other clustering algorithms.
Optimization: An act or process of finding an alternative with the most cost effective or highest performance under the given constraints.
Exploration: An act of searching for the purpose of discover unknown information.
Cluster: A collection of data points that are similar to one another within the same cluster and are dissimilar to data points in other clusters.
Metaheuristic: A general algorithmic framework which can be applied to different optimization problems with relatively few modifications to make them adapted to a specific problem.