Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Study and Analysis of Visual Saliency Applications Using Graph Neural Networks

Gayathri Dhara, Ravi Kant Kumar

Source Title: Concepts and Techniques of Graph Neural Networks

DOI: 10.4018/978-1-6684-6903-3.ch008

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

GNNs (graph neural networks) are deep learning algorithms that operate on graphs. A graph's unique ability to capture structural relationships among data gives insight into more information rather than by analyzing data in isolation. GNNs have numerous applications in different areas, including computer vision. In this chapter, the authors want to investigate the application of graph neural networks (GNNs) to common computer vision problems, specifically on visual saliency, salient object detection, and co-saliency. A thorough overview of numerous visual saliency problems that have been resolved using graph neural networks are studied in this chapter. The different research approaches that used GNN to find saliency and co-saliency between objects are also analyzed.

Chapter Preview

Top

Introduction

Overview of Visual Attention

The human brain is extremely efficient at assembling information about the environment in real time. We constantly collect information about our surroundings through our five senses, but the deeper layers of the brain do not deal with all the inbound sensory information. Humans are capable of quickly identifying the most interesting points in a scene based on external visual stimuli. A critical aspect of computer vision is identifying the most salient pixels or regions in an image. We perceive any type of information with varying levels of attention and involvement because the majority of arriving sensory information is filtered away by our brains. Even a highly sophisticated biological brain would find it as a challenging task to positively identify all interesting targets in its visual field. A solution, which is used by humans, is to break up the entire visual field into smaller parts. This serialization of visual scene analysis is facilitated by visual attention mechanisms. Each region is easier to analyze and can be processed separately. A pixel, object, or person with high visual saliency captures our attention when compared with its neighbors.

“Visual attention” is a cognitive process involved in selecting relevant information from cluttered visual scenes and filtering out irrelevant data from them. There are two sources of visual attention: bottom-up, pre-attentive saliency of the retinal input, and slower, top-down, memory, and volition-based processing based on a task.

Visual Salience

A visual salience (or visual saliency) is the distinct subjective perceptual quality that measure how likely human eyes will fixate on that area which makes some items in the world stand out from their neighbors and immediately grab our attention, that are visually salient stimuli. Humans are uniquely capable of determining salient objects (attention centers) visually more accurately and quickly than any machine. Salient object detection (SOD) is used by machines to solve this problem.

What Does Saliency Object Detection (SOD) Mean?

“A technique used to analyze image surroundings and to extract the impressive parts from the background is termed as saliency detection”. Salient object detection is an important task inspired by the human visual attention mechanism and is utilized by machines to overcome the challenge of visual attention by humans. The significance of SOD in computer vision applications stems from its ability to minimize computing complexity (Ahmed et al., 2022).

Co-Saliency Mean (Co-SOD)

Co-salient object detection (Co-SOD) is a recently developing and flourishing branch of SOD. In contrary to focusing and computing the saliency of only one image, the algorithms of Co-SOD focus on detecting the salient objects which are common in multiple input images. Detecting co-saliency between associated images entails finding common salient regions between them. Traditional methods of salient object detection only require one input image, but co-salient detection techniques require a group of images (Zhang et al., 2018a). In co-saliency detection, the main challenge is to exploit both intra- and inter-image salient cues simultaneously. Unlike traditional saliency detection tasks, which only consider intra-image saliency, this approach focuses on inter-image saliency.

Key Terms in this Chapter

Visual Saliency: The degree to which a specific location or region in an image or video stands out and attracts attention.

Node/Vertex: A representation of an object in a graph.

Co-Saliency Detection: A process of detecting and segmenting common salient objects or regions in multiple images.

Top-Down Saliency: Saliency that is driven by high-level factors such as task demands and prior knowledge.

Attention: A mechanism that allows a neural network to focus on specific parts of an input.

Graph Neural Network (GNN): A type of neural network designed to work with graph data structures, where the nodes and edges in a graph are used as input and output.

Graph Convolutional Network (GCN): A type of GNN that uses convolutional layers to learn features of the nodes in a graph.

Co-Saliency Dataset: A collection of multiple images that share common salient objects or regions, used for training and evaluation of co-saliency models.

EDGE: A representation of the relationship between two nodes in a graph.

Co-Saliency Pooling: A technique for aggregating information from multiple images to generate a co-saliency map.

Saliency: A property of visual stimuli that makes them stand out from their surroundings and attract attention.

Co-Saliency: A property of multiple images that makes them share common salient objects or regions.

Co-Saliency Integration: A technique for integrating co-saliency information with other computer vision tasks, such as object recognition and segmentation.

Graph: A data structure that represents objects (nodes) and their relationships (edges).

Message Passing: A process by which information is passed between nodes in a graph.

Saliency Map: A map that represents the degree of saliency of each location or region in an image or video.

Visual Attention: A type of attention that focuses on specific parts of an image or video.

Co-Saliency Map: A map that represents the degree of co-saliency of each location or region in multiple images.

Bottom-Up Saliency: Saliency that is driven by low-level features of the input, such as color, brightness, and orientation.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference