Data Visualization: Creating Mind’s Eye

Data Visualization: Creating Mind’s Eye

Ravishankar Palaniappan
DOI: 10.4018/978-1-4666-5864-6.ch013
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Data visualization has the potential to aid humanity not only in exploring and analyzing large volume datasets but also in identifying and predicting trends and anomalies/outliers in a “simple and consumable” approach. These are vital to good and timely decisions for business advantage. Data Visualization is an active research field, focusing on the different techniques and tools for qualitative exploration in conjunction with quantitative analysis of data. However, an increase in volume, multivariate, frequency, and interrelationships of data will make the data visualization process notoriously difficult. This necessitates “innovative and iterative” display techniques. Either overlooking any dimensions/relationships of data structure or choosing an unfitting visualization method will quickly lead to a humanitarian uninterpretable “junk chart,” which leads to incorrect inferences or conclusions. The purpose of this chapter is to introduce the different phases of data visualization and various techniques which help to connect and empower data to mine insights. It exemplifies on how “data visualization” helps to unravel the important, meaningful, and useful insights including trends and outliers from real world datasets, which might otherwise be unnoticed. The use case in this chapter uses both simulated and real-world datasets to illustrate the effectiveness of data visualization.
Chapter Preview
Top

Introduction

Having we already entered the new world of the ‘Internet of things’ where Facebook being used for long-term customer relationships and Twitter being used for building momentum for new products, it becomes inevitable for every global organization / governments to prep themselves to tame and manage these mountainous, multivariate data for business/social advantage. Today, every global organization is grappling to find ways for “business transformation”, not only to reduce costs and improve the serviceability to their customers but also for their very existence.

Data in the real-world may represent one or more attributes of entities like customers, markets, weather, product feedback, sales or even geospatial information without which 360 degree view of customer or business cannot be examined.

Analyzing and visualizing the connectedness of different attributes and relationships of the “data” is the first & paramount step for business transformation. (Chambers, Cleveland, Tukey, & Kleiner, 1983)

But the reality is that, a cornucopia of electronic data has emerged over decades which are dispersed across the business applications/repositories, leaving every sector and industry with very limited choice for discovery, analysis and visualization of data.

2012 Digital Universe study by International Data Corporation (Gantz & Reinsel, 2012) reveals surprising facts about the growth of the multivariate data in the universe.

  • 1.

    From 2005 to 2020 - the digital universe will grow by a factor of 300 (from 130 exabytes to 40,000 exabytes, or 40 trillion gigabytes). 90% percent of the world data today is created in last 2 years.

  • 2.

    The amount of information in the Digital Universe is doubling every 2 years, currently growing at a rate of more than 7,600 petabytes per day. (see Figure 1)

  • 3.

    By 2020, there will be 7.6 billion people and 200 billion interconnected “things” in internet. These are scanners, ATMs, SMART devices/buildings, security cameras which are empowered to communicate with each other through internet. (Internet of Things).

  • 4.

    Having every industry is enabling their enterprise with sensors and devices for the business effectiveness, machine generated data will account for 40% of the Digital Universe by 2020, up from just 11% in 2005

  • 5.

    But, ONLY less than 1% of the world’s data is being analyzed and put for use. In this, there were secured data which are limited to corporates.

Figure 1.

IDC’s-Digital Universe in 2020 (Gantz & Reinsel, 2012)

978-1-4666-5864-6.ch013.f01

The exponential growth of semi-structured, unstructured and poly-structured data from sensors, logs, web, geospatial, temporal events and customer sentiments texts in social applications like twitter, Facebook, blogs, product reviews etc. aggravate this situation further.

While these statistics show the enormous increase in the data being generated, it brings along the opportunity and challenge of how they can be tamed and analyzed for the betterment of social and business improvement. Currently global corporations, governments and researchers are finding “new & innovative” ways to reap this opportunity to develop “data products” and Big Data analytical solutions for either to reduce operational costs or to improve the customer experience.

In the actual field, every data analyst team (a.k.a. data scientists) sift through the different algorithms not only to acquire and clean the “polluted + raw” datasets from real-world but also to visualize [refine + bin + summarize + filter + smooth] them for effective and timely decisions.

Either overlooking any dimensions/inter-relationships of data structure or choosing an unfitting visualization method will quickly lead to a humanly un-interpretable clutter or “junk chart”, which in turn leads to incorrect inferences or conclusions

Some use cases where Data Visualization playing vital role in Big Data Analytics destine to solve today’s most multifaceted problems of human kind. (see Figure 2)

Complete Chapter List

Search this Book:
Reset