Integration of Data Mining and Business Intelligence in Big Data Analytics: A Research Agenda on Scholarly Publications

Integration of Data Mining and Business Intelligence in Big Data Analytics: A Research Agenda on Scholarly Publications

Atik Kulakli
DOI: 10.4018/978-1-7998-5781-5.ch002
(Individual Chapters)
No Current Special Offers


The purpose of this chapter is to analyze and explore the research studies for scholarly publication trends and patterns related to the integration of data mining in particular business intelligence in big data analytics domains published in the period of 2010-2019. Research patterns explore in highly prestigious sources that have high impact factors and citations counted in the ISI Web of Science Core Collection database (indexes included SCI-Exp and SSCI). Bibliometric analysis methods applied for this study under the research limitations. Research questions formed based on bibliometric principles concentrating fields such as descriptive of publication, author productivity, country-regions distribution, keyword analysis with contribution among researchers, citation analysis, co-citation patterns searched. Findings showed strong relations and patterns on these important research domains. Besides this chapter would useful for researchers to obtain an overview of publication trends on research domains to be concerned for further studies and shows the potential gaps in those fields.
Chapter Preview


Data management becomes the center of the decision-making process; Business Intelligence and Analytics (BI&A) has emerged as accessible (Buhl et al., 2013) research area to be studied in academia and industry. With the movement of Big Data (BD) initiatives, these popular fields become more attractive for researchers (Maté et al., 2015). Big data could be defined as the most unstructured and extremely massive data sets to be analyzed to reveal trends, associations, and patterns, related to human interactions (e.g., Social media data). In comparison, data mining is structured data in the databases to generate new information by mining the relations in data patterns. The data warehousing platforms should be established incorporation level with necessary data marts to analyze the large data sets. Moreover, big data analytics (as proper business decision tool); is a complex process of examining large and various data sets to uncover information in unstructured platforms with a variety of data sources and systems to discover hidden patterns, correlations, customer insights, and decisions as well as behaviors.

Data Mining (DM) with Big Data relations studied by Wu et al. (2014), it has outlined that Big Data concentrates large-volume, multiple sourced, complex, unstructured and autonomous (various) sources from the entire data platforms. In the research, the HACE theorem studied and proposed the Big Data processing model with mining techniques. While Big Data and traditional Business Intelligence discussed in different platforms, researchers studied the skill set and knowledge requirements for Big Data and Business Intelligence (Debortoli, Müller & vom Brocke, 2014) and conducted latent semantic analysis (LSA). According to findings, business knowledge is a crucial indicator of successful development and execution. Followed by Business Intelligence needs bigger than Big Data competencies, whereas Big Data is much more “human-capital-intensive” than Business Intelligence projects. Sun, Sun & Strang (2018) examined to use of Big Data Analytic services (service-oriented architecture-BASOA) to enhance Business Intelligence. Big Data ontology has three-layer analyses, such as descriptive, predictive, and prescriptive.

Since the number of academic publications has been increased and become very complex to follow, the trends would be a critical point for researchers and audiences. The impact of outcomes has resulted in directing research efforts to be concentrated on the most recent development fields in parallel. Scholars conduct different qualitative and quantitative studies to explore, discover, analyze, and organize the information to provide through wider scientific communities. Therefore, bibliometric methods as a usual proven approach provide systematic, transparent, process-based on the statistical measurement of science, and scientific activities (Aria & Cuccurullo, 2017).

This chapter aims to analyze and explore the research studies on big data, data mining in particular business intelligence, and big data analytics domains published in a recent research period (a decade). The study uses the Web of Science Core Collection database as a primary research platform and analysis the data for the period of 2010 to 2019. In this chapter, the author used a science mapping method with open-source Bibliometrix R-package that conducting bibliometric studies on scholarly high ranked sources. The chapter organized based on the IGI Global’s structured template which devoted in seven main sections as Introduction, Background (literature review), Main focus (issues, controversies, problem statement, research methodology, bibliometric study definitions, and data collection strategy), Results and Discussions (analysis, findings, addressing the research questions), Solutions and Recommendations followed by Future research directions (Theoretical implications and research needs, Practical implications, and Limitations and future research), and Conclusion. The study also shows the research patterns and concentration areas in scholarly publications to find out the gap to highlight future research direction needs.

Key Terms in this Chapter

Big Data: Is enormous data set (mostly unstructured data) that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to interactions.

Data Mining: Is the practice of examining large pre-existing (structured) databases in order to generate new information. The data warehousing platforms should be established incorporation level with necessary data marts to analyze the large data sets.

Bibliometric Analysis: Bibliometric is defined as the use of statistical methods to analyze the bibliometric publications data such as peer-reviewed journal articles, books, conference proceedings, periodicals, reviews, reports, and related documents. It has been widely used to present the relations of research domains with quantitative methods.

Business Intelligence: It is a collection of processes, architectures, and technologies that turn raw data into useful knowledge driving productive business behavior. To turn data into actionable knowledge it is a suite of tools and services.

Citation Analysis: Is a way of measuring the relative importance or impact of an author, an article, or a publication by counting the number of times that author, article, or other works have cited publication.

Big Data Analytics: Is the often complicated process of examining large and varied data sets, or big data, to uncover information—such as hidden patterns, unknown correlations, market trends, and customer preferences—that can help organizations make informed business decisions.

Co-Citation Analysis: Provides a forward-looking assessment on document similarity in contrast to Bibliographic Coupling, which is retrospective. The citations a paper receives in the future depend on the evolution of an academic field; thus, co-citation frequencies can still change.

Complete Chapter List

Search this Book: