Utility and Significance of Vague Set Theory and Advanced Optimization Mechanisms for Uncertainty Management

Utility and Significance of Vague Set Theory and Advanced Optimization Mechanisms for Uncertainty Management

Sowkarthika B (Madhav Institute of Technology and Science, India), Akhilesh Tiwari (Madhav Institute of Technology and Science, India), R. K. Gupta (Madhav Institute of Technology and Science, India) and Uday Pratap Singh (Madhav Institute of Technology and Science, India)
Copyright: © 2018 |Pages: 29
DOI: 10.4018/978-1-5225-3531-7.ch010

Abstract

In this digital world, tremendous data are generated in every field. Useful information is inferred out of this data, which is valuable for effective decision making. Data mining extracts the interesting information from huge volumes of data. Association rule (AR) mining is one of the core areas of data mining where interesting information is extracted in the form of rules. Traditional AR mining is incapable of handling uncertain situations. In order to handle uncertainty, mathematical tools like vague theory can be utilized with AR mining methodologies for the development of novel vague theory based algorithms, which will be more suitable in effectively handling vague situations that helps framing effective selling strategy. Since an organization can't analyze the huge rule set obtained from these algorithms, every resultant rule should have a certain ratio of factors customized to the interest of the organization that can be achieved through optimization algorithms. This chapter explores the significance of vague theory and optimization means for effective uncertainty management.
Chapter Preview
Top

Introduction

Data is being generated and gathered in every field which results in an increase in the size of the data. Any organization is not in need of data. It needs information that is inferred out of data. That is, the data is useless until and unless it is converted to useful information. Since manual examination is entirely incomprehensible with huge volume of data, automated tools came into the picture. Data Mining is an automated tool which helps in extracting useful or interesting patterns or information from huge volumes of data like database or other data repositories. There are various tasks pertaining to data mining. Some of the essential tasks are Association Rule mining (AR mining), Classification and Clustering. Since classification and clustering need AR mining to be performed, and AR mining has direct relevance with real-life scenarios, the authors focus more on AR mining.

AR mining is useful in examining and analyzing the customer’s behavior. Association in retail store refers to the patterns that contain items that are frequently purchased together. It gives the frequently occurring patterns in the transaction data. This basically gives the relationship between the products that is useful in strategic business decisions and product marketing. The problem of AR mining is defined by Agrawal (Han & Kamber, 2006) as:

Let I = {i1,i2,….in}{\displaystyle I=\{i_{1},i_{2},\ldots, i_{n}\}} be a set of n {\displaystyle n}nnnbinary attributes called items.Let D = {t1,t2,…,tn}{\displaystyle D=\{t_{1},t_{2},\ldots, t_{m}\}} be a set of transactions called the database.

Each transaction in {\displaystyle D}DDD has a unique transaction identifier TID and contains a subset of the items in I. IA rule is defined as an implication of the form {\displaystyle X\Rightarrow Y}AB where A, B are the set of items i.e., A,BI . Also, AB = ϕ. For example, from BreadJam, the retailer could get an idea that whenever Bread is purchased, Jam is also purchased. Based on the rule generated, some discount could be offered or the store layout could be changed accordingly thereby boosting the sales. Association rules are represented by ‘if-then’ rules. In AB, A is the antecedent and B is the consequent and it implies ‘if A is purchased, then B is more likely to be purchased’. Antecedent is the set of items in the database and consequent is the set of items that are in relationship with the antecedent. The two important criteria in identifying a relationship and making a rule interesting are support and confidence. AR is considered interesting only when it satisfies minimum support and minimum confidence thresholds. The thresholds are set by the domain experts or users who have rich knowledge of the system. These play a key role in making any business decisions. A strong association rule is generated if the support and confidence of a rule are greater than minimum support and minimum confidence thresholds.

Support gives how frequent A and B occur in the database of transactions D. Support of the rule is the probability of the occurrence of items A and B in the database D.

Confidence is measured as the fraction of the transactions containing A also contain B. This is the conditional probability.

Here, the domain under discussion is market-basket analysis where, for a given database of customer transactions with each transaction containing set of items, the goal is to find groups of items which are frequently purchased together. A set of items is referred to as itemset. A rule that contains k-items and these items are occurring frequently together in the given set of transactions is referred to as k-frequent itemset. It is represented by Lk.

Complete Chapter List

Search this Book:
Reset