Future Expectations About Big Data Analytics

Future Expectations About Big Data Analytics

Copyright: © 2024 |Pages: 15
DOI: 10.4018/979-8-3693-0413-6.ch019
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

There is no future without big data as users, applications are continuously increasing and processed information expected to use in Healthcare, e-Commerce, Aviation, Education, etc. One can say it is just the beginning of big-data-era (BDE) computing. Everyone expects that ‘ready to use' data must be available instantly and hence new techniques and algorithms must be developed to store handle and find the required ‘relevant' data. However, to predict and find the relevant data many researchers suggested taking care about handling the ‘context'. Many people process big datasets with missing contexts, and considering the datasets or attributes as per their convenience. In future it is also expected to focus on data quality because many applications will use this ready to process data as an input. Decision making is possible with recent machine learning applications and selection of data. It involves carefully selecting the data and removing incomplete, invalid, inaccurate data. Therefore, it is suggested to get ready for the future of big data with innovative ways, techniques, and algorithms.
Chapter Preview
Top

1. Introduction

In many applications big data can be integrated from a variety of sources and termed heterogeneous data (Heterogeneous big data). However, in the future, users ‘u’, and devices ‘d’ will increase which generates heterogeneous data (text, audio, images, video etc.) that requires handling & retrieving important information. Future expectation from big-data processing applications requires developing applications based on speedy algorithms representing this heterogeneous data in aggregate form. However, a centralized coordinator will be necessary if the system involves multiple users, multiple systems etc. In the future, proper coordination is the key challenge in handling and processing the big-data. In the future focus should be on developing applications to handle ‘contextual data’. These applications can be termed as Next-Generation-big-data (NGBD) processing contextual data. Applications of context data are traffic handling, weather forecasting etc. Therefore processing historical as well as current data requires developing NGBD applications. We strongly believe these types of systems will serve the next generation of businesses.

If observing the trend of digitization, one can conclude that ‘n’ users are increasing and the simultaneous increase in the ‘n’ users added large volume of data in the system. However, this large volume of data is heterogeneous because it is integrated from different sources. Transformation of the data in unique representation is necessary for analysis purposes. Big data analytics helps users fulfilling their daily business needs, identify the customer segment and forecast business strategies. Traditional learning does not support in-depth learning whereas recent machine learning (ML), deep learning (DL) uses supervised & unsupervised learning techniques. Though the datasets are large & consist of ‘n’ records/values using classification methods one can classify the features of the datasets. Among those records/values, one can select a few records as a representative value instead of considering all the records from the given dataset which is known as sampling.

To store this large volume of data on a bigger platform ‘cloud’ service is necessary and one can go for open source cloud service providers. Though, this big data resides on the cloud there could be some risks associated with the storage. These risks can be mitigated using the various applications such as Cassandra service if there is any issue with the data and safe location can be determined by querying the measurable property such as write/read errors with the disks. A future expectation about big data gives more suggestions like:

  • User must use the data more efficiently & effectively

  • To process the large volume of data more sophisticated algorithms are required

  • Along with these parameters there is a more requirement of more intelligent and fast machines.

Key Terms in this Chapter

Homogeneous Data: Data integrated in same type or format.

Regression: It is a method used in machine learning to predict the relationship between independent and dependent variables.

Heterogeneous Data: Data from various sources and may be in different formats.

Big Data: It refers to large volume of data and has characteristics such as volume, variety and veracity etc.

Contextual Data: Data refers to specific events, thing or related to any person.

Classification: It is a supervised method used in machine learning to classify the given input data.

Pattern Mining: Knowing the trends during analysis of the data used for future predictions.

Complete Chapter List

Search this Book:
Reset