Pattern Retrieval through Classification from Pattern Warehouse: Issues and Challenges

Pattern Retrieval through Classification from Pattern Warehouse: Issues and Challenges

Ramjeevan Singh Thakur (Department of Computer Application, Maulana Azad National Institute of Technology, Bhopal, India) and Vivek Tiwari (Department of Computer Science and Application, Maulana Azad National Institute of Technology, Bhopal, India)
Copyright: © 2014 |Pages: 10
DOI: 10.4018/ijbir.2014070101
OnDemand PDF Download:
$30.00
List Price: $37.50

Abstract

The pattern is special kinds of data which are created through various data mining techniques and stored in the pattern warehouse through a specialized pattern management system (PMS). Pattern warehouse makes the pattern non-volatile or persists. Now a day's persistent pattern retrieval is a very new and important issue. This paper focuses on problems and challenges with pattern retrieval. One can see the applicability of classification in pattern retrieval as an opportunity and trying to bring attention to probable issues and challenges behind the physical implementation of this concept. This paper concluded that the applicability of classification in pattern retrieval is well feasible. It has also discussed that how of pattern classification is different with data's classification. Classification method should be defined in such a way that it can handle pattern efficiently. So far, little emphasis has been posed on developing an overall classification system for pattern retrieval. This paper concerns only association kinds of patterns. It has presented some issues regarding (i) Decision boundary of pattern classes. (ii) Problem of calculating a reliable estimate of pattern classes. (iii) How to define class boundary (iv) How to handle overlapping of pattern classes (v) Parameter selection for pattern classes estimation (v) Preprocessing of patterns (vi) How to handle classification on demand. (vii) Updating of pattern classes (vii) Finding optimal test conditions.
Article Preview

1. Introduction

Now a day’s many applications generating huge volumes of data (Vazirgiannis et al., 2003). Many techniques have been developed to extract knowledge, especially in the context of data mining. The results of such operations are abstract and compact representations of the original data, which called patterns (Catania, 2004). The pattern gives the semantic representation of raw data. There is required raw data for pattern and vice-versa mapping (Zdenka, 2012). The volume of extracting patterns from various knowledge discovery applications is increasing rapidly, so there is need for effective and efficient pattern management system (Fernández et al.,2011; Mohammad et al., 2009; Vazirgiannis et al., 2003). Patterns can come from clustering, association rules, classification, probabilistic rules and many other ways. Moreover, patterns can be found in images, signals, text and music and of course in the World Wide Web (Barbara and Anna, 2005). Theses all kinds of patterns are stored in pattern warehouse through PBMS (Pattern Base Management system) (Catania, 2004; Jaesoon et al., 2002; Manolis et al., 2007). There are no tools or systems that are designed to deal exclusively with pattern retrieval. All current technologies just try to extend the existing query system to get patterns, but this is not effective because of a special characteristic of patterns and its diversity (Evangelos et al., 2005; Rizzi, et al., 2003; Rizzi, 2004). Pattern retrieval can be improved through applying data mining techniques itself. Problem of diversity or complex structure can be solved by grouping relevant patterns. It is called classification of patterns. This task is not easy because patterns are semantically rich and have different and specialized structure. The main issue of this paper is revealing the difficulties with classification of association type patterns in the context of pattern retrieval. Association rule mining is a popular method for discovering interesting relations between variables in large databases (Thakur et al., 2006; Tiwari and vipin, 2010). Patterns can be a set of association rules discovered from databases using different measures of interestingness (Riccardo et al., 2011).

The objectives of this paper are to discuss the need of pattern management, benefit of pattern classification in context of retrieval and discuss the problems and challenges with pattern classification. The aim only point out the problems which may occur in applying data mining techniques for pattern retrieval on pattern warehouse rather than generating patterns itself. Classification is one of the powerful data mining techniques. For simplicity, research work revolves around the challenges with the classification of patterns.

Complete Article List

Search this Journal:
Reset
Open Access Articles: Forthcoming
Volume 8: 2 Issues (2017): 1 Released, 1 Forthcoming
Volume 7: 2 Issues (2016)
Volume 6: 2 Issues (2015)
Volume 5: 4 Issues (2014)
Volume 4: 4 Issues (2013)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing