It is an immutable distributed collection of objects. Each dataset in RDD is divided into logical partitions, which may be computed on different nodes of the cluster.
Published in Chapter:
Insight Into Big Data Analytics: Challenges, Recent Trends, and Future Prospects
Mohd Vasim Ahamad (Aligarh Muslim University, India), Misbahul Haque (Aligarh Muslim University, India), and Mohd Imran (Aligarh Muslim University, India)
Copyright: © 2018
|Pages: 13
DOI: 10.4018/978-1-5225-3870-7.ch005
Abstract
In the present digital era, more data are generated and collected than ever before. But, this huge amount of data is of no use until it is converted into some useful information. This huge amount of data, coming from a number of sources in various data formats and having more complexity, is called big data. To convert the big data into meaningful information, the authors use different analytical approaches. Information extracted, after applying big data analytics methods over big data, can be used in business decision making, fraud detection, healthcare services, education sector, machine learning, extreme personalization, etc. This chapter presents the basics of big data and big data analytics. Big data analysts face many challenges in storing, managing, and analyzing big data. This chapter provides details of challenges in all mentioned dimensions. Furthermore, recent trends of big data analytics and future directions for big data researchers are also described.