Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Human Action Recognition Based on YOLOv7

Chenwei Liang, Wei Qi Yan

Source Title: Deep Learning, Reinforcement Learning, and the Rise of Intelligent Systems

DOI: 10.4018/979-8-3693-1738-9.ch006

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Human action recognition is a fundamental research problem in computer vision. The accuracy of human action recognition has important applications. In this book chapter, the authors use a YOLOv7-based model for human action recognition. To evaluate the performance of the model, the action recognition results of YOLOv7 were compared with those using CNN+LSTM, YOLOv5, and YOLOv4. Furthermore, a small human action dataset suitable for YOLO model training is designed. This data set is composed of images extracted from KTH, Weizmann, MSR data sets. In this book chapter, the authors make use of this data set to verify the experimental results. The final experimental results show that using the YOLOv7 model for human action recognition is very convenient and effective, compared with the previous YOLO model.

Chapter Preview

Top

Introduction

Normally, surveillance videos usually contain a series of actions (Yan, 2019). Recognizing the actions in these videos can provide huge benefits, such as recognizing the person who fell in time and assisting him to avoid follow-up problems from the fall down. Therefore, it is very necessary to evaluate or analyse human action in videos. Human action recognition generally refers to judging or analysing the classes of human actions in videos (Soomro et al., 2014). Concisely, it is to correctly classify human actions into known action classes.

While recognizing these actions, it also brings a huge workload. Therefore, a rapid and efficient action recognition method becomes very important. The relevant methods in deep learning can meet the requirements and solve this problem. As a machine learning method, deep learning has been widely employed since it was proposed (Yan, 2021). The purpose of this approach is to allow computers to be trained, with the ability to analyse and identify specific data (Gao et al., 2021).

Human action recognition has been a topic of interest within academic discourse. In the past, a substantial body of research on the recognition of human actions has utilized traditional machine-learning techniques, such as the extraction of visual characteristics or motion trajectories. Now, deep learning methods are more widely utilized. The deep learning technique is prevalent not only within computer vision but also across fields of study, including NLP (Natural Language Processing) etc (Wiriyathammabhum et al., 2016). As more and more researchers use deep learning methods to recognize actions, the recognition efficiency improves over time. Currently, researchers have proposed several recognition algorithms, including CNN (Khan et al., 2020), Two-Stream (Simonyan et al., 2014), C3D (Convolution 3 Dimension) (Tran et al., 2015), and RNN (Du et al., 2017), etc.

Similar to the Convolutional Neural Network (CNN), the You Only Look Once (YOLO) model has an input layer, convolutional layer, pooling layer, and fully connected layer. The aforementioned study conducted by Redmon et al. (2016) establishes the fundamental framework for a comprehensive Convolutional Neural Network (CNN) architecture. However, YOLO exhibits a clear differentiation from the conventional CNN model. The achievement of end-to-end object detection necessitates the use of a distinct CNN model. This enhancement results in improved computational efficiency of the YOLO model. This is one of the reasons why this study selects the YOLOv7 model for human action recognition.

This study employs the YOLOv7 framework to construct a comprehensive network for human action recognition. The YOLO algorithm, which stands for “You Only Look Once,” is a visual object identification method that utilizes a convolutional neural network. This approach was first introduced in 2016 by Redmon et al. One of the key benefits of this particular approach is in its inherent simplicity and efficiency, which allows for swift execution. According to Cao et al. (2023), the YOLOv7 model exhibits notable advancements in terms of both running speed and structure. This research effort primarily focuses on the investigation of fundamental human actions. This study contributes to the existing body of knowledge in this area by evaluating whether using the YOLOv7 model is effective in human action recognition.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Human Action Recognition Based on YOLOv7

Abstract

Introduction

Complete Chapter List