Improved Fall Detection Model on GRU Using PoseNet

Improved Fall Detection Model on GRU Using PoseNet

Hee-Yong Kang, Yoon-Kyu Kang, Jongbae Kim
Copyright: © 2022 |Pages: 11
DOI: 10.4018/IJSI.289600
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

This paper investigate an improved detection method that estimates the acceleration of the head and shoulder key point position and position change using the skeleton key point information extracted using PoseNet from the image obtained from the low-cost 2D RGB camera, and improves the accuracy of fall judgment. This paper propose a fall detection method based on the post-fall characteristics of the post-fall, the speed of changes in the main point of the human body, and the change in the width and height ratio of the body's bounding box. The public data set was used to extract human skeletal features and train deep learning, GRU, and as a result of experiments, this paper find the following feature extraction methods. High classification accuracy can be achieved, and the proposed method showed a 99.8% fall detection success rate more effectively than the conventional method using raw skeletal data.
Article Preview
Top

A vision-based fall detection system acquires images through video equipment, detects and uses images to detect a fall. Deleting the background from the image (Abobakr et al., 2018; Chen et al., 2020) by extracting and characterizing the human skeleton data from the video, it detects a fall by characterizing the shape or silhouette of a person in the video (Abobakr et al., 2018; Kingma & Ba, 2015). Recently, the method of recognizing human activities based on skeletal data has focused on detecting falls (Lie et al., 2018). Skeleton data is extracted using a 3D depth camera such as Kinect developed by Microsoft or 2D RGB video through CNN-based technology such as PoseNet (Adhikari et al., 2019) In other words, PoseNet can conveniently and quickly extract skeleton data from images captured by a camera. Based on the skeleton information of the human body, it has not only high accuracy, but also simple and inexpensive features (Chen et al., 2020). The extracted skeleton data is spatial-temporal data that changes with time. Recurrent Neural Network (RNN) is a powerful method of processing time-series data classification, but it takes a long time to learn and a part of the result is a problem in which the weight disappears (vanishing gradient problem) or an exploding gradient problem occurs. To solve this problem, the use of LSTM (Long-short term memory) and GRU (gated recurrent unit) was introduced (Chen et al., 2020)(Solbach & Tsotsos, 2017). LSTM does not have the same problem as RNN.

In general, 2D RGB cameras are installed and used by CCTV in hospital rooms and industrial sites of medical facilities. There may be concerns about the protection of some privacy, but it is believed that the camera will not be regulated by the security of privacy regulation because the purpose is to extract the skeleton of the body. GRU has the advantage of short learning time due to its simple structure as a modification of LSTM.

This propose a Fall Detection Method Based on Skeleton Key points Group(FDSG) that classifies and infers human poses from the extracted skeleton data from PoseNet to detects falls pose using artificial intelligence, GRU.

Complete Article List

Search this Journal:
Reset
Volume 12: 1 Issue (2024)
Volume 11: 1 Issue (2023)
Volume 10: 4 Issues (2022): 2 Released, 2 Forthcoming
Volume 9: 4 Issues (2021)
Volume 8: 4 Issues (2020)
Volume 7: 4 Issues (2019)
Volume 6: 4 Issues (2018)
Volume 5: 4 Issues (2017)
Volume 4: 4 Issues (2016)
Volume 3: 4 Issues (2015)
Volume 2: 4 Issues (2014)
Volume 1: 4 Issues (2013)
View Complete Journal Contents Listing