Spatial Data Mining: A Perspective of Big Data

Spatial Data Mining: A Perspective of Big Data

Shuliang Wang, Hanning Yuan
Copyright: © 2014 |Pages: 21
DOI: 10.4018/ijdwm.2014100103
(Individual Articles)
No Current Special Offers


Big data brings the opportunities and challenges into spatial data mining. In this paper, spatial big data mining is presented under the characteristics of geomatics and big data. First, spatial big data attracts much attention from the academic community, business industry, and administrative governments, for it is playing a primary role in addressing social, economic, and environmental issues of pressing importance. Second, humanity is submerged by spatial big data, such as much garbage, heavy pollution and its difficulties in utilization. Third, the value in spatial big data is dissected. As one of the fundamental resources, it may help people to recognize the world with population instead of sample, along with the potential effectiveness. Finally, knowledge discovery from spatial big data refers to the basic technologies to realize the value of big data, and relocate data assets. And the uncovered knowledge may be further transformed into data intelligences.
Article Preview


Big data is a complex data set with the characteristics such as Volume, Variety, Velocity and Veracity (Meyer-Schoenberger, & Cukier, 2013). It exceeds the capacity of current computing techniques and tools (Wu et al., 2014; Rajaraman, & Ullman, 2011). In big data, spatial data specifically accounts for the vast majority. About 80% of data is associated with the spatial position (Grossner, Goodchild, & Clarke, 2008; Shekhar, Xiong, 2007). Any kind of object or structure may be related to a geo-spatial location, such as buildings, roads, places, bridges, or rivers. Spatial data is the basis of data and source of wisdom for people to understand the real-world through the information world by better addressing social, economic, and environmental issues of pressing importance (Hackeloeer et al., 2014).

Big data is closely associated with various applications, in which spatial data mining is principal. Because of the complex types, intrinsic relationships, and implicit autocorrelation in spatial big data, it is much more difficult to extract the useful patterns from spatial datasets than from conventional numeric and categorical datasets (Vatsavai et al., 2012).

From the Real World to the Information World

Human civilization is a process from understanding the real world to creating the information world. It has gone through the following stages: information being sensed from the world, information assisting in memorizing the world, information being recorded and inherited, exchange and communication by information, and understanding the world once again by information (Li, Wang, & Li, 2013). Initially, human take advantage of stones and shells to count according to the principle of one-to-one. And they keep records by tying knots. Later, picture notes are invented to inherit more accurate memory by using simple graphics along with perceptual cue. When the picture becomes relatively fixed by shape symbols, associating with the words in the language, it produces texts. Texts abstract and generalize the world, promote cultural understanding, and prepare the necessary foundation for the development of science. In order to break through the restrictions that the written symbols depend on artificial transcription, carving or engraving, human use machines after industrial revolution to batch mechanization production, which promotes the efficiency of the cultural transmission. Computer pays attention to high-speed computing, and spins off the software from the hardware, contributing to the dissemination of information “electronically” and “automatically”. Internet focuses on network by interrelating computers, and it breaks local information limitation. Mobile communication unbounds human from the machine by making the machine follow user’s movements. The Internet of Things cares for the application of human-machine interoperation by automatically identifying objects, enabling the information sharing between the human and things. Cloud computing centralizes service by consolidating expertise and optimizing the allocation of resources. Big data aims at population data, and takes good advantage of the entire data, breaking the randomness of the samples (Burstein, Holsapple, 2008; Haining, 2003), and demonstrating on big data center and mobile terminal. The abovementioned information technologies ultimately serve for the understanding and transforming of the real world.

Complete Article List

Search this Journal:
Volume 20: 1 Issue (2024)
Volume 19: 6 Issues (2023)
Volume 18: 4 Issues (2022): 2 Released, 2 Forthcoming
Volume 17: 4 Issues (2021)
Volume 16: 4 Issues (2020)
Volume 15: 4 Issues (2019)
Volume 14: 4 Issues (2018)
Volume 13: 4 Issues (2017)
Volume 12: 4 Issues (2016)
Volume 11: 4 Issues (2015)
Volume 10: 4 Issues (2014)
Volume 9: 4 Issues (2013)
Volume 8: 4 Issues (2012)
Volume 7: 4 Issues (2011)
Volume 6: 4 Issues (2010)
Volume 5: 4 Issues (2009)
Volume 4: 4 Issues (2008)
Volume 3: 4 Issues (2007)
Volume 2: 4 Issues (2006)
Volume 1: 4 Issues (2005)
View Complete Journal Contents Listing