A Distance-Window Approach for the Continuous Processing of Spatial Data Streams

A Distance-Window Approach for the Continuous Processing of Spatial Data Streams

Salman Ahmed Shaikh, Akiyoshi Matono, Kyoung-Sook Kim
DOI: 10.4018/IJMDEM.2020040102
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Real-time and continuous processing of citywide spatial data is an essential requirement of smart cities to guarantee the delivery of basic life necessities to its residents and to maintain law and order. To support real-time continuous processing of data streams, continuous queries (CQs) are used. CQs utilize windows to split the unbounded data streams into finite sets or windows. Existing stream processing engines either support time-based or count-based windows. However, these are not much useful for the spatial streams containing the trajectories of moving objects. Hence, this paper presents a distance-window based approach for the processing of spatial data streams, where the unbounded streams can be split with respect to the trajectory length. Since the window operation involves repeated computation, this work presents two incremental distance-based window approaches to avoid the repetition. A detailed experimental evaluation is presented to prove the effectiveness of the proposed incremental distance-based windows.
Article Preview
Top

Introduction

Smart cities development and management require real-time processing and analysis of citywide data. Many smart city applications require real-time integration, processing and analysis of spatial stream, for-instance, real-time road traffic analysis, people flow analysis, etc. To support real-time continuous processing of data streams, Continuous Queries (CQs) are used. CQs utilize windows to split the unbounded data streams into finite sets or windows. Precisely, stream tuples arrive continuously, however blocking operators, i.e., aggregation operators, join operator, etc. cannot generate results on an infinite stream. To cope with this problem, windows are used by stream processing engines (Apache Spark Foundation, 2017) (Apache Kafka Foundation, 2018). Windows contain a subset of data streams or in other words bounds the infinite data streams into smaller chunks based on specified criteria to enable the processing of stateful operators. There are mainly two types of windows supported by existing stream processing engines: 1) Time-based window: Groups tuples by time. E.g., Events/Tuples generated in last 24 hours, 2) Count-based window: Groups tuples by count. E.g., A window consisting of recent 1 million tuples. Depending upon how the events are assigned to windows, a window can be classified as sliding or tumbling. 1) Sliding window: A window which slides continuously as the new data arrives or as the time passes. A tuple in sliding window may belong to more than one window, 2) Tumbling window: Tuples are grouped in a single window based on time or count. A tuple in the tumbling window belongs to exactly one window. Besides, different stream processing engines (SPEs) have defined different assigners for different application, for instance, session windows, global windows, etc.

Since the spatial streams contain trajectories of moving objects, bounding the trajectories with respect to time and count is not logical and does not make sense. For instance, consider a moving object, where users are interested in finding the number of traffic signals it has encountered in last x meters. This cannot be done with the help either the count-based or time-based window. Thus, users need a window which can keep track of distance of a moving object and can answer such queries.

  • Example 1: Assume a mobile platform m scanning a road segment through 3D scanner and generating a stream of 3D spatial data. As the data stream arrives, the user is interested in detecting the different objects (people, cars, trees, etc.) in it and execute continuous queries on it by bounding the stream for a particular length of trajectory. Let the user would like to know the number of different objects detected in last x distance units. Since the speed of m is not uniform, using the existing window semantics, one can obtain the number of objects detected in last t time units however cannot obtain the number of objects detected in last x distance units. The window capable of bounding or tracking the length of trajectory to answer the continuous queries is called the distance-based window.

Keeping in view the importance of the distance-based window, this paper presents a distance-window based approach for the continuous processing of spatial data streams to perform operations on a fixed trajectory length. Since the window operation involves repeated computation, especially in case of sliding window, this work presents two efficient incremental distance-based window approaches, namely ITL (Incremental Trajectory Length) and IDA (Incremental Distance Array).

This paper is an extended version of our earlier work (Salman et al, 2019). In (Salman et al, 2019), authors presented the semantics of a Naive distance-based window. However, the Naive window implementation is too expensive to be practical, hence more efficient and incremental approaches are desired for it. The main contributions of this work can be summarized as follows:

  • Incremental distance-based window approaches

    • ITL (Incremental Trajectory Length)

    • IDA (Incremental Distance Array)

  • Detailed experimental evaluation to prove the effectiveness of the proposed incremental distance-based window

Complete Article List

Search this Journal:
Reset
Volume 15: 1 Issue (2024)
Volume 14: 1 Issue (2023)
Volume 13: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 12: 4 Issues (2021)
Volume 11: 4 Issues (2020)
Volume 10: 4 Issues (2019)
Volume 9: 4 Issues (2018)
Volume 8: 4 Issues (2017)
Volume 7: 4 Issues (2016)
Volume 6: 4 Issues (2015)
Volume 5: 4 Issues (2014)
Volume 4: 4 Issues (2013)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing