Emerging Trends of Big Data in Cloud Computing

Emerging Trends of Big Data in Cloud Computing

Poonam Nandal, Deepa Bura, Meeta Singh
Copyright: © 2021 |Pages: 18
DOI: 10.4018/978-1-7998-6673-2.ch003
OnDemand:
(Individual Chapters)
Available
$33.75
List Price: $37.50
10% Discount:-$3.75
TOTAL SAVINGS: $3.75

Abstract

In today's world where data is accumulating at an ever-increasing rate, processing of this big data was a necessity rather than a need. This required some tools for processing as well as analysis of the data that could be achieved to obtain some meaningful result or outcome out of it. There are many tools available in market which could be used for processing of big data. But the main focus on this chapter is on Apache Hadoop which could be regarded as an open source software based framework which could be efficiently deployed for processing, storing, analyzing, and to produce meaningful insights from large sets of data. It is always said that if exponential increase of data is processing challenge then Hadoop could be considered as one of the effective solution for processing, managing, analyzing, and storing this big data. Hadoop versions and components are also illustrated in the later section of the paper. This chapter majorly focuses on the technique, methodology, components, and methodologies adopted by Apache Hadoop software framework for big data processing.
Chapter Preview
Top

Introduction

The ceaseless increment in the volume and detail of information caught by associations, for example multimedia, Internet of Things (IoT) and the ascent of web-based social networking which has delivered an overpowering stream of information in either organized or unstructured arrangement. Information creation is happening at are line rate, alluded to here in as large data, and has developed as a broadly perceived pattern. Enormous information is evoking consideration from the Academics, governmental organizations, and industries. Enormous information is portrayed by three aspects: (a) data which is used is huge, (b)data which is in use could not be handled with the existing traditional databases, and (c) frequency of data generation, storage, organization and management. However, this enormous information which is generated by big data is changing the way one use to handle business, science, finance, engineering, healthcare, and in the end, the general public. The main focus of Big Data concepts deals with the information of stockpiling the data and using various data mining innovations which have changed the way of information was held by various organizations Olofson and Eastwood (2011). The frequency at which Big Data is increasing is immense. A noteworthy test for data scientists and enthusiasts is that the frequency of data generation is surpassing the capacity to configure as well as implement Big Data in Cloud Computing platform. Its role is no more restricted for managing workloads only, but also for the analyzing the data stored in it.

Cloud computing is a standout amongst one of the most efficient technologies which are used for administration for big business applications and with time has turned itself into a capable design to perform extensive as well as massive scale complex computation. The benefits of Cloud computing incorporate virtualized resources, security, parallel processing capabilities and data administration and its combination with the Data storage. The benefits of Cloud computing platform also comprise cost reduction as the hardware resources are being virtualized and managed effectively, also this helps in providing provisioning, automated task functioning and various other benefits which imparts to efficient administration and better client access Lu and Yang (2013).

With the increase in computational resources to handle and process big data, the data is not only becoming critical for making business decisions but also more valuable in terms of being more comprehendible to the computer. TeraBytes of data is being generated by social networking sites daily. With the increase in shift from traditionally storing data to using cloud platform and paying as per use. The application we use today makes use of various tools which enhances its capacity to store, apprehend, manage and process data that is generated by each one of us, be it of any form that is structured, unstructured or semi structured. There have been remarkable efforts by researchers, data scientists and enthusiasts in the field of big data managing which has resulted in where we are standing today in terms of processing data. Researches in data handling technologies, data mining platforms, file processing system, artificial intelligence, machine learning all are enhancing the way we treat and manage the data we generate daily.

A portion of the main adopters of big data in cloud computing are clients, who used Hadoop technology for processing and storing large data sets which is exceptionally adaptable and versatile platform provided by the cloud vendors like Amazon AWS, IBM and Microsoft Azure, Liu and Chen (2013). Virtualization is one of the base advances material which is extensively used in cloud. It is known as basically creating a logical layer of abstraction which could be hardware related, computer networks or storing data, which helps in enhancing the efficiency of cloud. Singh et al. (2017) compared various load balancing techniques and gave the advantages and disadvantages of each technique.

Complete Chapter List

Search this Book:
Reset