Requirements Formulation Neglect as a Major Cause of Poor Data Quality in Information Systems

Requirements Formulation Neglect as a Major Cause of Poor Data Quality in Information Systems

Jenghyearn Moon, Kecheng Liu
Copyright: © 2022 |Pages: 20
DOI: 10.4018/IJOCI.304886
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

This is a report on how much lack or neglect of requirements formulation affects data quality in information systems. The issue of data quality in information systems is centred around their conceptual data models as they capture all the data perspectives that the systems manage. Investigations on case studies in a way of triangulation were attempted to find out that there are over 30% (near 40%) of unnecessary data replication or redundancy when elicitation of requirements is poor, which is serious enough to impel the success or failure of information systems.
Article Preview
Top

1. Introduction

1.1 Importance of Data Quality Issue in Information Systems

Quality has a link to the notion of zero defects in process modelling (Crosby 1995, Wing 2019), and data quality is no exception to this observation (Nagle 2018). Information systems can be viewed from at least perspectives: the data perspectives and the application process. Of these two, the data perspective is more important as many application domains are data intensive and changes in data influence the application process logic, but not necessarily vice versa.

Data quality problem undermines the usability of data. Data only matters if it is correct and useful (Webster 2014). Research or business based on data with errors do not meet the requirements of good research or business in term of accuracy and consistency (Anderson 2008), and this phenomenon will likely incur biased or wrong conclusions and vulnerability of causing ‘big errors’ brought by big data. The grandiosity of data needs to not only refer to its large volume, complex data structure and fine granularity but also the significance of data quality and usage in data (Firmani 2016, Liu 2016).

Data modelling techniques were developed for the specification of the data perspective in a way of providing concepts for the description of data types and structural relations between them (Frederiks 1997). Although many data modelling techniques have been proposed in the literature (Allemang 2008, Batini 2018, Hingorani 2017, Mancas 2021, Mkhinini 2020, Poblet 2019), the question of how to achieve a good data quality remains yet to be answered. There has been speculation that the lack of a management science for data quality may contribute to low data quality levels (Fox 1994, Redman 1998).In fact, the lack of a management science contributed to data quality issues adversely.

According to HBR study (Nagle 2017), only 3 percent of companies meet data quality standards, 47 percent of newly created data contain at least one critical error and half of the data quality scores are below 57 percent. It is important to realise that data are different from other resources and require different management techniques, thus they need a science, at a minimum, that can provide a solid foundation for addressing data quality issue. Nevertheless, even data science failed to address the issue or guideline of data quality correctly (Cao 2017, Newman 2016, Priestley 2019), since it instead mostly focuses on extracting useful information from datasets. Unlike vast array of literature on software-related requirement engineering within the context of software engineering, literature devoted to data-related conceptual modelling quality is limited regardless of ER model, object-oriented model or whatever (Cherfi 2002).

Complete Article List

Search this Journal:
Reset
Volume 14: 1 Issue (2024): Forthcoming, Available for Pre-Order
Volume 13: 1 Issue (2023)
Volume 12: 4 Issues (2022)
Volume 11: 4 Issues (2021)
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 4 Issues (2016)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing