Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A Taxonomy of Data Mining Problems

Nayem Rahman

Source Title: International Journal of Business Analytics (IJBAN) 5(2)

DOI: 10.4018/IJBAN.2018040105

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Much of the research in data mining and knowledge discovery has focused on the development of efficient data mining algorithms. Researchers and practitioners have developed data mining techniques to solve diverse real-world data mining problems. But there is no single source that identifies which techniques solve what problems and how, the advantages and limitations, and real-life use-cases. Lately, identifying data mining techniques and corresponding problems that they solve has drawn significant attention. In this paper, the author describes the progress made in developing data mining techniques and then classify them in terms of data mining problems taxonomy to help assist practitioners in using appropriate data mining techniques that solve business problems. This will allow researchers to expand the body of knowledge in this discipline. This article proposes a data mining problems taxonomy based on data mining techniques being used. Prominent data mining problems include classification, optimization, prediction, partitioning, relationship, pattern matching, recommendation, ranking, sequential patterns and anomaly detection. The data mining techniques that are used to solve these data mining problems in general fall under top 10 data mining algorithms.

Article Preview

Top

Introduction

In this early 21^st century organizations capture all business activities in some computer storage systems. Data are also gathered or used from the systems owned by others. Over a period of time data growth has increased significantly in organizations. Emergence of the Internet, social networking tools (e.g., Twitter, Facebook and LinkedIn), and online shopping sites allows for capturing huge data volume related to business. In 2014 the US government had mandated release of huge volume of privacy-protected Healthcare and Medicare data which could be used by researchers, policy makers, business organizations and general public for analysis and decision making (US Govt. Health and Human Services Office, 2014). With the advent of commodity hardware (to process big data), computer processing power (thanks to Moore’s Law), maturity of computer engineering, software engineering, network bandwidth and increasingly low cost of data storage companies are able to capture, process, transform and store large volume of data. Organizations find business value in the data and have come to rely on with their decision-making process.

Traditional Business Intelligence (BI) tools consists of reports, interactive query and Online Analytical Processing (OLAP) all of which can provide intelligence as to what happened in the past. In the past reporting was based on what happened. These days business would like to understand what is going to happen now and in the future using predictive analytics (for example). With the increase in fraudulent activities there is a desire to detect it immediately – using anomaly detection (credit card fraud). Here data mining techniques and algorithms come into picture and play a prominent role in providing solutions to complex business problems.

Data mining techniques are used to discover previously unknown and valuable interesting patterns and relations in large data sets (Phyu, 2009). Given that the amount of data has been growing in enterprise data warehouses, it is apparent that data-driven decision making will help organizations achieve competitive advantage. However, the challenge of using this data to achieve business success is dependent on efficient data mining methods (Wu et al., 2014) that help in extracting hidden knowledge and translating that into business values.

In this study, the author presents the data mining problems that are aligned with providing solution to business problems. These data mining problems include anomaly detection (Ogwueleka et al., 2011), prediction, classification, pattern recognition (Jain, 2010), sequence discovery, data visualization (Shaw et al., 2001) and recommendation system. The author also presents the data mining techniques used to solve data mining problems. These are Bayesian networks, neural networks, decision trees, association rules, clustering, support vector machines, logistic regression, and k-nearest neighbors.

Last two decades most research was conducted on the theoretical and computational process of data mining and knowledge discovery (Shaw et al., 2001). Now is the time to evaluate those data mining techniques and classify them from data mining problem taxonomy perspective. This will allow users to choose an optimal algorithm to solve a particular data mining problem. This paper discusses data mining problem taxonomy issues by presenting a complete taxonomy of data mining problems from the context of real world applications.

Complete Article List

Search this Journal:

Reset

Volume 11: 1 Issue (2024)

Volume 10: 1 Issue (2023)

Volume 9: 6 Issues (2022): 4 Released, 2 Forthcoming

Volume 8: 4 Issues (2021)

Volume 7: 4 Issues (2020)

Volume 6: 4 Issues (2019)

Volume 5: 4 Issues (2018)

Volume 4: 4 Issues (2017)

Volume 3: 4 Issues (2016)

Volume 2: 4 Issues (2015)

Volume 1: 4 Issues (2014)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

A Taxonomy of Data Mining Problems

Abstract

Introduction

Complete Article List