Application of Sequential Pattern Mining Algorithm in Commodity Management

Application of Sequential Pattern Mining Algorithm in Commodity Management

Xiaoli Wang (Mudanjiang Medical University, Mudanjiang, China), Fang Wang (Mudanjiang Medical University, Mudanjiang, China), Shi Yan (Mudanjiang Medical University, Mudanjiang, China) and ZhanBo Liu (Mudanjiang Medical University, Mudanjiang, China)
Copyright: © 2018 |Pages: 13
DOI: 10.4018/JECO.2018070108

Abstract

This article describes how by analyzing historical sales data of supermarkets, once data is cleaned, sampled and put through a series of operations, it can be transformed into a sequence database. Finally, the data is used in the SPM of Map-Reduce algorithm to data mining. This experiment has two stages. In the first stage, the sequences of product categories are mined to place the product categories. In the second stage, the products are mined for each category, add profit targets for calculating sequential pattern values. This is so that it can reorder the results to find the most profit products. Thus, readers can adjust products placement and improve the profits of the supermarket.
Article Preview

1. Introduction

In recent years, due to the rapid development of the Internet, many industries have accumulated a huge amount of data, and are showing explosive growth. At present, the speed of data analysis and utilization of database technology is far behind the speed of production and storage. In addition, the results after the database analysis cannot be directly used, still need manual analysis. In this context, data mining technology emerges as the times require. Data mining technology can not only effectively deal with a large number of historical and current data, and to dig out the valuable information from the massive historical data, so as to the actual production and operation, to provide guidance for the development (Wang & Ding, 2009; Agrawal & Srikant, 1995).

Data mining technology can not only process tremendous historical and existing data, and also discover valuable information from such huge historical data, providing guidance to the practical production, operation and development (Gong, Liu & Jia, 2011; Dong & Wen, 2009). Sequence pattern mining, as a significant research topic in the data mining field, is a knowledge discovery process in which frequent sub-sequence is found from sequential database to use as pattern (Hu, 2009; Wang & Fan, 2009; Li, Wang & Chen, 2013) Sequence pattern mining is of practicability and easy comprehension so that it gains wide concern and deep investigation. Through researches in recent years, some typical sequence pattern mining algorithms have been generated. They improved to a certain degree the efficiency of data exploration, however with the age of big data approaching, the scale of data set has been becoming bigger and bigger in more and more complicated structure (Lv & Zhang, 2006; Lin, 2013; Zhang, Hu & Chen, 2007). When traditional sequence pattern mining algorithm is doing data mining, the existence of enormous irrelevant and redundant data increases space-time costs and sometimes causes memory overflow, weakening greatly the performance of traditional sequence pattern mining technique. Besides, sequence patterns excavated by the traditional algorithm are of low quality, unable to meet customer’s actual requirements, no useful information mined. When it enters into the era of big data, the commonest issue facing lots of enterprises is “vast information but poor information”. How to dig out the most valuable information from massive big data collection has become a research focus at present and an issue to be solved urgently (Jiang, Wang & Huang, 2015; Fu &Yang, 2015). The paper proposed SPM of Map-Reduce algorithm based on Map-Reduce and used it in real big dataset. Valuable information is discovered from big dataset and used to guide practical business activities.

Complete Article List

Search this Journal:
Reset
Open Access Articles
Volume 17: 4 Issues (2019): 2 Released, 2 Forthcoming
Volume 16: 4 Issues (2018)
Volume 15: 4 Issues (2017)
Volume 14: 4 Issues (2016)
Volume 13: 4 Issues (2015)
Volume 12: 4 Issues (2014)
Volume 11: 4 Issues (2013)
Volume 10: 4 Issues (2012)
Volume 9: 4 Issues (2011)
Volume 8: 4 Issues (2010)
Volume 7: 4 Issues (2009)
Volume 6: 4 Issues (2008)
Volume 5: 4 Issues (2007)
Volume 4: 4 Issues (2006)
Volume 3: 4 Issues (2005)
Volume 2: 4 Issues (2004)
Volume 1: 4 Issues (2003)
View Complete Journal Contents Listing