Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

High-Level Languages for Geospatial Analysis of Big Data: Strengths and Weaknesses

Symphorien Monsia, Sami Faiz

Source Title: Interdisciplinary Approaches to Spatial Optimization Issues

DOI: 10.4018/978-1-7998-1954-7.ch004

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In recent years, big data has become a major concern for many organizations. An essential component of big data is the spatio-temporal data dimension known as geospatial big data, which designates the application of big data issues to geographic data. One of the major aspects of the (geospatial) big data systems is the data query language (i.e., high-level language) that allows non-technical users to easily interact with these systems. In this chapter, the researchers explore high-level languages focusing in particular on the spatial extensions of Hadoop for geospatial big data queries. Their main objective is to examine three open source and popular implementations of SQL on Hadoop intended for the interrogation of geospatial big data: (1) Pigeon of SpatialHadoop, (2) QLSP of Hadoop-GIS, and (3) ESRI Hive of GIS Tools for Hadoop. Along the same line, the authors present their current research work toward the analysis of geospatial big data.

Chapter Preview

Top

Introduction

Over the last few years, mega-data or big data has become a major concern for many organizations. The term ’Big Data’ refers to data sets that become so large that they become difficult to work with conventional database management systems. These massive data come from several sources among them the Web, sensor networks, satellites, drones, radars, cameras, connected devices (such as smartphones, tablets, etc.), geolocation practices and social networks (such as Twitter, Facebook, Google+, LinkedIn, etc.) online that bring together billions of users.

These phenomena considerably add to the challenges of big data for many organizations and have led to the emergence of Geospatial Big Data, which represents the application of big data issues to geographic data. Geospatial Big Data is therefore an essential component of the larger phenomenon of big data in that geographic data is an important part of the data collected and processed (Lee and Kang 2015). Franklin (1992) estimates that 80% of business data is geographic. An illustrative example is the LP DAAC (Land Processes Distributed Active Archive Center), an archive of terrestrial information originating from space borne sensors aboard NASA (National Aeronautics and Space Administration) satellites, which contains more than 1 petabyte of data and increases every day with new data.

This explosion of geographic data compels the community of researchers and developers of the geospatial domain to store and process them using traditional Big Data frameworks such as Spark (Zaharia et al., 2010), Flink (Carbone et al., 2015), MapReduce (Dean and Ghemawat 2004), Dryad (Isard et al., 2007), Hyracks (Borkar et al., 2011) and Hadoop (White 2015). Although these conventional Big Data systems can handle both geographic and non-geographic data, they display significantly lower performance compared to Geospatial Big Data processing. In fact, the only way to have Geospatial Big Data processed by traditional Big Data platforms is to either treat it as non-spatial data or to write a set of methods or functions as wrappers around existing non-spatial systems. However, doing so does not take any advantage of the properties of spatio-temporal data, which will lead to performance degradation (Eldawy and Mokbel 2016).

As a result, several extensions of traditional Big Data frameworks have emerged in recent years, many of which overcome this limitation by integrating geospatial functionality in a variety of ways among them HadoopGIS (Aji et al., 2013a,b), SpatialHadoop (Eldawy and Mokbel 2015), ESRI GIS Tools for Hadoop (Whitman et al., 2014), STARK (Hagedorn et al., 2017), SpatialSpark (You et al., 2015), GeoTrellis (Kini and Emanuele 2014), Simba (Xie et al., 2016), MD-HBase (Nishimura et al., 2013), GeoSpark (Yu et al., 2015), and GeoMesa (Hughes et al., 2015). In addition, some Geospatial Big Data frameworks are also implemented from-scratch among them BRACE (Wang et al., 2010), SciDB (Stonebraker et al., 2013), RasDaMan (Baumann et al., 1997) and Paradise (DeWitt et al., 1994).

An essential component of these Geospatial Big Data systems that the researchers are particularly interested in is the data query language that provides high-level access to the data in order to free users from any complexity of these systems. This chapter proposes to examine, among the high-level languages proposed in the literature, three open source and popular implementations of SQL on Hadoop intended for the interrogation of Geospatial Big Data: (1) Pigeon of SpatialHadoop, (2) QLSP of Hadoop-GIS and (3) ESRI Hive of GIS Tools for Hadoop. The chapter mainly presents an overview of the contributions and the shortcomings of these query languages. In addition, it presents several possible solutions to overcome the shortcomings mentioned.

The remainder of this chapter is structured as follows. Section 2 briefly describes the MapReduce programming model (including its advantages and disadvantages). Section 3 reviews, among the proposed languages, three open source and popular implementations of SQL on spatial extensions of the Hadoop framework for querying Geospatial Big Data. Section 4 provides a summary of the contributions and limits of the presented languages and briefly describes the authors’ planned research to possibly overcome the challenges raised. Finally, Section 5 lists their conclusion and next steps.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

High-Level Languages for Geospatial Analysis of Big Data: Strengths and Weaknesses

Abstract

Introduction

Complete Chapter List