Machine Learning for Big Data

Machine Learning for Big Data

Supriya M. S. (Ramaiah University of Applied Sciences, India) and Keerthana Sasidaran (Ramaiah University of Applied Sciences, India)
Copyright: © 2021 | Pages: 21
DOI: 10.4018/978-1-7998-6673-2.ch004

Abstract

Big data and machine learning currently play an important role in various applications and in research. These approaches are explored in depth in this chapter. The chapter starts with a summary of big data and its implementation in a number of fields, and then deals with the problems that big data presents and the need for other technology to resolve these issues/challenges. Big data can best be used with the aid of the machine learning model, even though they are not directly related. Thus, the paradigms of machine learning that support big data can be combined with big data technology, thus providing insight into a range of big data machine learning approaches and techniques. Although big data cannot rely solely on the few paradigms of machine learning, the underlying problems are addressed. New machine learning algorithms are needed that can explore the full scale of the big data process and enable software engineering firms to come up with better solutions.
Chapter Preview
Top

Introduction And Background

The proper investigative frameworks can tackle complex issues, settle on discerning choices, improve application interface and furthermore produce financially savvy execution. Enormous information is a terminology used for big data indexes having increasingly differed and compound structure through the challenges of setting away, dissecting and foreseeing for additional processes. The process of analysis into huge methods of data to reveal hidden examples and mysterious associations named as big data analytics. These valued data for organizations with the assistance of cumulative extravagant and additional thoughtful little of knowledge and in receipt of a favourable position over the opposition. Thus, usage of big data has to break down and implemented as exactly as might be anticipated underneath the environments. Calculation and stowage, robotics and sensor technology are therefore hefty. It is therefore important to develop intelligence systems that deals with real-time and past data in order to maintain the performance of these technologies. Thus, big data analytics research play a vital role in data mining and data processing (Bendre & Thool, 2016).

The Internet of Things (IoT), maintains gathered data over the internet after various devices along with sensor network is on the growth. RFID tags captures transaction data that perform on products such as goods conveyed via supply chain. Big data also applies in dynamic and bursting knowledge on social media platforms. According to the International Data Corporation (IDC) report, the innovation in big data and administration industry is quickly rising field wherein billions of dollars will be put resources into the not so distant future on the general business. According to IDC director's view, the growth through IT (Information Technology) companies will increase rapidly in means of clients and their offer market. It predicts that the technology like big data and its service industry will rise at a combined annual growth rate of 26.24 per cent, to hit $23.8 billion by 2016, in 2018 it may reach $41.52 billion and up to $48.6 billion in 2019 (Bendre & Thool, 2016).

Big data should be capable of providing certain resources, methods and processes for loading, extracting and improving handling using equal computing capacity for widespread compound operations and surveillance. Designing a big data framework for examining, handling and keeping missing information will bring about a development of particular problems since multifaceted nature of huge data. Mainly, gathering and integrating data with dispersed locations is a difficult task because of the several independent data sources and a vast ability. Hence, there is a need for system that can possibly deal with handling and big data processing and similarly trail the reassurance of the performance properties in terms of accessing speed, scalability, recovery, and privacy. The big data must effectively extract information from a large data-sets at various levels, so that features can be exposed in order to improve decisions taken and obtain more benefits in realistic and non-realistic environments.

The revolutionary steps of big data is characterised into three components (Sagiroglu & Sinanc, 2013):

  • Variety: The classification of big data is categorised into structured, semi-structured and unstructured. The type of inserting the data into a tagged warehouse can be called as structured data that can be easily sorted whereas, the process is random and difficult to analyse in unstructured data. On the other hand, semi-structured has tags that can separate data element though has no fixed fields

  • Volume: At present, the size of the data is usually in terabytes and petabytes. The fabulous scale and ascent of information surpasses customary store and examination methods

  • Velocity: Not only big data uses this component but also other processes. For time fewer processes, as big data enhances that organizational value by streaming

Complete Chapter List

Search this Book:
Reset