Intelligent Big Data Analytics in Health

Intelligent Big Data Analytics in Health

Ebru Aydindag Bayrak (Istanbul University – Cerrahpaşa, Turkey) and Pinar Kirci (Istanbul University – Cerrahpaşa, Turkey)
DOI: 10.4018/978-1-5225-8567-1.ch014
OnDemand PDF Download:
No Current Special Offers


Intelligent big data analytics and machine learning systems have been introduced to explain for the early diagnosis of neurological disorders. A number of scholarly researches about intelligent big data analytics in healthcare and machine learning system used in the healthcare system have been mentioned. The authors have explained the definition of big data, big data samples, and big data analytics. But the main goal is helping researchers or specialists in providing opinion about diagnosing or predicting neurological disorders using intelligent big data analytics and machine learning. Therefore, they focused on the healthcare systems using these innovative ways in particular. The information of platform and tools about big data analytics in healthcare is investigated. Numerous academic studies based on the detection of neurological disorders using both machine learning methods and big data analytics have been reviewed.
Chapter Preview


The concept of big data was first used by Michael Cox and David Ellsworth at Proceedings of the 8th Conference on Visualization held in 1997, entitled “Application Controlled Demand Paging for Out-of-core Visualization”. In the same study, it was mentioned that the datasets were too big and the computer system filled up the memory, disks and even external disks, and this problem was called “Big Data Problem” (Aktan, 2018).

The term big data was used for using larger volumes of scientific data for visualization. Although there are a large number of definitions of big data in the literature, the most popular definition comes from IBM. Big data could be characterized by any or all of three “V” words as suggested by IBM. V means that volume, variety, and velocity (O’Leary, 2013).

2.5 quintillion bytes of data was created by people, that is to say ninety percent of data (%90) has just been created in the last two years. This data is generated social media posts, videos, cell phone GPS signals or sensors. Here it is, this data is called Big Data (IBM, n.d.a.).

According to Gartner Incorporation “Big data is high-volume, high-variety and/or high-velocity information assets that demand cost-effective, innovative forms of information processing that enable enhanced insight, decision making, and process automation” (Gartner IT Glossary, n.d.).

The concept of big data; can be defined as a problem that occurs when traditional database management systems are inadequate when the data is stored, analyzed and managed (Sağıroğlu, 2017).

Big data indicate to growing dataset that involves unstructured, structured and semi-structured data by contrast with traditional data. The term big data was defined using the three main characteristics (3V) by most scientists and experts (Oussous, Benjelloun, Lahcen and Belfkih, 2017).

  • Volume: It means the size of data which is varying from different data unit (terabyte, petabyte). Digital devices and applications (smartphones, IoT, social networks, logs,…) are generate big volumes of digital data. According to the report of International Data Corporation (IDC) the volume of data will increase from 898 exabytes to 6.6 zettabytes between 2012 and 2020. In other words, data will grow more than % 25 per a year.

  • Variety: Big data is a variety of different formats (logs, videos, sensors,…) and sources. So it means the diversity of datasets.

  • Velocity: Data is generated in a fast way that is means speed of data change.

The three components of big data can be summarized as in Figure 1. In addition to the three 3V’s, other dimensions of big data have also been mentioned. These include (Gandomi and Haider, 2015):

  • Veracity: This concept was coined by IBM to represent the uncertainty in some sources of data. We can give example such as customer sentiments in social media that are uncertain and include personal opinion. Even so they are valuable for analyzing information.

  • Variability: It refers to the variation in the data flow rates, was introduced by SAS (Statistical Analysis Software).

  • Value: It is defined by Oracle to define attribute of big data. Clearly it can be explained creating a value to organizations using big data analysis in the decision-making.

When research is done in both academic and business literature Big Data has been identified four key themes to which refers: Information, Technologies, Methods and Impact (De Mauro, Greco & Grimaldi, 2015).

Key Terms in this Chapter

Neurological Disorder: Refers to any disorder on the nervous system. Alzheimer’s disease, Parkinson’s disease, autism, stroke, etc. are some neurological disorders explained in this study.

Big Data Samples: Can be counted as social networks, health records, web logs, mobile phones, academic studies, sensors, and call records that surround us.

Machine Learning: Basically refers to the techniques for extracting useful information from hidden patterns. It can be defined a system consist of many methods that learn and improve from data.

Machine Learning Methods: Can be grouped as supervised learning, unsupervised learning, and reinforcement learning.

Alzheimer’s Disease Neuroimaging Initiative (ADNI): Provides database to researchers about the patient of Alzheimer’s disease. The biomarkers are collected and analyzed for early diagnose and following the progression of Alzheimer’s.

Big Data: Can be described large volume of data that are structural, semi-structural, and non-structural, and it provides valuable information for lots of research areas. It is often characterized by the 3V (volume, variety, and velocity), but it has continued to grow up with other characteristic components.

Big Data Analytics: Can be defined basically using analytics techniques on big data to explore worthful information. The analyzing of big data is pretty important for both prediction of future and decision making in all working areas.

Complete Chapter List

Search this Book: