Knowledge Discovery and Big Data Analytics: Issues, Challenges, and Opportunities

Knowledge Discovery and Big Data Analytics: Issues, Challenges, and Opportunities

Vinoth Kumar Jambulingam (VIT University, India) and V. Santhi (VIT University, India)
Copyright: © 2019 |Pages: 16
DOI: 10.4018/978-1-5225-7501-6.ch011
OnDemand PDF Download:
No Current Special Offers


The era of big data has come with the ability to process massive datasets from heterogeneous sources in real-time. But the conventional analytics can't be able to manage such a large amount of varied data. The main issue that is being asked is how to design a high-performance computing platform to effectively carry out analytics on big data and how to develop a right mining scheme to get useful insights from voluminous big data. Hence this chapter elaborates these challenges with a brief introduction on traditional data analytics followed by mining algorithms that are suitable for emerging big data analytics. Subsequently, other issues and future scope are also presented to enhance capabilities of big data.
Chapter Preview


As the era of data communication and expertise reaches across several fields quickly, most of the information has its origin in digital communication in addition internet nowadays. Lyman, P. and Varian, H. (2002) showed in a study that the new knowledge present in digital devices have crossed already over ninety percent all through the 21st millennium, whereas the scale of that new knowledge was additionally over hundreds of petabytes. In fact, the issues of analyzing the massive information did not rise abruptly, however, are there for many years as it has been that the data creation is felt easier than finding hidden knowledge or useful patterns from that information. Albeit personal computers nowadays are loT more quickly than those in the early 1960’s, the massive size of information is a pitfall to perform research on the data we've got nowadays. As an answer to the issues of analyzing high volume data, Xu, R. & Wunsch, D (2009) proposed some effective techniques like sampling, density-dependent methods, data condensation, grid-dependent methods, divide and conquer, progressive learning, and distributed computing, are being offered. Obviously, these ways are perpetually accustomed to enhance the efficiency of the mechanisms of data analysis method (Lyman, P. et al., 2002).

Complete Chapter List

Search this Book: