Quality and Effectiveness of ERP Software: Data Mining Perspective

Quality and Effectiveness of ERP Software: Data Mining Perspective

Stephen Makau Mutua (Meru University of Science and Technology, Kenya) and Raphael Angulu (Masinde Muliro University of Science and Technology, Kenya)
DOI: 10.4018/978-1-5225-7678-5.ch002
OnDemand PDF Download:
No Current Special Offers


Over time, the adoption of ERP systems has been wide across many small, medium, and large organizations. An ERP system is supposed to inform the strategic decision making of the organization; therefore, the information drawn from the ERP system is as important as the data stored in it. Poor data quality affects the quality information in it. Data mining is used to discover trends and patterns of an organization. This chapter looks into the way of integrating these data mining into an ERP system. This is conceptualized in three crucial views namely the outer, inner, and the knowledge discovery view. The outer view comprises of the collection of various entry points, the inner view contains the data repository, and the knowledge discovery view offers the data mining component. Since the focus is data mining, the two strategies of supervised and unsupervised are discussed. The chapter then concludes by presenting the probable problems within which each of these two strategies (classification and clustering) can be put into place within the mining process of an ERP system.
Chapter Preview


At the end of this chapter, the reader is expected to;

  • 1.

    Explain the value and place of data in an ERP system

  • 2.

    Describe the metrics of data quality desirable in an ERP system

  • 3.

    Explain the importance of data quality in ensuring the quality and effectiveness of an ERP System

  • 4.

    Understand the various ways in which data mining can be integrated into an ERP System

  • 5.

    Understand the different approaches to extract information from the data collected in an ERP system



Enterprise Resource Planning (ERP) is a software which an organization uses to integrate all its data and processes into one single system. This has several implications. Since all the data is centralized, it is much easier to draw data insights from multiple business processes at once. Similarly, new mechanisms are required to analyze and understand the data in order to draw some intelligence from them. Given their myriad advantages, ERP adoption rates have been constantly on the increase since their inception. Even though they were initially viewed as majorly applicable in only large organizations; the narrative has since changed and their adoption has been witnessed in both small and medium organizations including learning institutions, non-governmental organizations and even health facilities. This in effect has proliferated the demand for ERP systems across the various domains in which they are expected to operate while meeting their anticipated expectations. Whereas it is greatly acceptable and desirable for an organization to streamline information flow and control, reduce labor and operations costs, and enhance efficiency; the success of an ERP system is greatly reliant on the quality of its data and its interaction with the various organization’s data points.

Like any other software system, an ERP will automatically fall victim to the computer adage of “Garbage in Garbage Out”. This is so because, in its design and implementation, an ERP software is typically an integrated collection of applications that collect, process, store, manage and interpret data from multiple points spread across the organization. This data is centrally stored in a database or a repository from which each of the business applications draws its lifeline. Consequently, in its salient nature, the effectiveness of an ERP system can greatly be measured by the collective efficiency of individual applications which in turn rely on the quality of data collected.

This chapter discusses in detail the value of data quality in an ERP system and the various metrics that measure it. The importance of integrating data mining approaches into the design and implementation of an ERP system are then discussed. To further elaborate the integration, a number of these specific approaches that can be amalgamated into the system are discussed in detail.

Key Terms in this Chapter

Data Mining: Refers to the process of extracting patterns, trends, and knowledge from a pool of an organization’s data using algorithms.

Classification: A data mining category of data mining challenges that seek to group data into already known sets (classes); hence, the training of the algorithms is considered to have been supervised before the actual task is executed.

Effectiveness: The extent to which a system functions as intended offering the expected results.

Enterprise Resource Planning (ERP): An integrated software system comprising of all the organization’s core processes and backed up by an appropriate information and communication technologies (ICT).

Business Intelligence: A term used to refer to collective technologies, infrastructure, algorithms and visualization techniques that are used in collecting, organizing, and storing data, knowledge extraction, and presentation of information for business strategic decision-making process.

Data Quality: Refers to the feature that data can be relied upon for accurate decision-making process, planning, and projections.

Clustering: A closely related term to classification. However, unlike classification whose probable data sets are known prior to the actual execution, clustering is blind and learns from the provided data sets without any knowledge; hence, training is unsupervised.

Complete Chapter List

Search this Book: