Mining Big Data and Streams

Mining Big Data and Streams

Hoda Ahmed Abdelhafez (Suez Canal University, Egypt)
DOI: 10.4018/978-1-5225-7598-6.ch008

Abstract

Mining big data is getting a lot of attention currently because businesses need more complex information in order to increase their revenue and gain competitive advantage. Therefore, mining the huge amount of data as well as mining real-time data needs to be done by new data mining techniques/approaches. This chapter will discuss big data volume, variety, and velocity, data mining techniques, and open source tools for handling very large datasets. Moreover, the chapter will focus on two industrial areas telecommunications and healthcare and lessons learned from them.
Chapter Preview
Top

Challenges Of Big Data Systems

Big data has five key elements: Volume, Velocity, Variety, Veracity and value. These 5 V’s are considered challenges of Big Data systems (Yin & Kaynak, 2015; Ishwarappa & Anuradha, 2015; Marr, 2015).

Volume refers to the huge amount of data. Many companies have large archived data in the form of logs but do not have the capacity to manipulate and analyze that data using traditional database technology. Now big data technology can help store and use these datasets in order to gain benefits from them.

Complete Chapter List

Search this Book:
Reset