Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Improving Live Augmented Reality With Neural Configuration Adaptation

Ning Chen, Sheng Zhang, Sang Lu Lu

Source Title: Principles and Applications of Adaptive Artificial Intelligence

DOI: 10.4018/979-8-3693-0230-9.ch007

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Instead of relying on remote clouds, today's augmented reality (AR) applications send videos to nearby edge servers for analysis to optimize user's quality of experience (QoE). Lots of studies have been conducted to help adaptively choose the best video configuration, e.g., resolution and frame per second (fps). However, prior works only consider network bandwidth and ignores the video content itself. In this chapter, the authors design Cuttlefish, a system that generates video configuration decisions using reinforcement learning (RL) based on network condition as well as the video content. Cuttlefish does not rely on any pre-programmed models or specific assumptions on the environments. Instead, it learns to make configuration decisions solely through observations of the resulting performance of historical decisions. Cuttlefish automatically learns the adaptive configuration policy for diverse AR video streams and obtains a gratifying QoE. The experimental results show that Cuttlefish achieves a 18.4%-25.8% higher QoE than the other prior designs.

Chapter Preview

Top

1. Introduction

Augmented Reality (AR) is a technology that allows virtual objects to be overlaid on the real world. With the increasing demand for intelligent mobile devices, AR is becoming more popular among users with diverse requirements. According to work (Azuma et al., 2001), an AR system should have the following attributes: the ability to combine real and virtual objects in a real environment, to geometrically align virtual objects with real ones in the real world, and to run interactively and in real time. AR technology has been applied to a wide range of fields, including tourism, entertainment, marketing, surgery, logistics, manufacturing, maintenance, and others (Westerfield et al., 2015; Akçayır & Akçayır, 2017). Report (Virtual Reality and Augmented Reality Device Sales to Hit 99 Million Devices in 2021, 2017) forecasts that the shipment of AR/VR devices will reach 99 million in 2021, and the market will reach 108 billion dollars by then (The reality of VR/AR growth, 2017). Existing mobile AR systems, such as ARKit, Microsoft HoloLens (Microsoft HoloLens, 2020), and the announced Magic Leap One (Magic Leap One, 2020), facilitate the interaction between humans and the virtual world.

With the emergence of Mobile Edge Computing (MEC) (Shi et al., 2016; Satyanarayanan, 2017; Roman et al., 2018), object detection in AR applications has shifted from remote clouds to edge servers, benefiting from the reduced latency and increased reliability. In this approach, the AR device encodes and uploads the video to the edge server for detection and rendering, before downloading the processed video. State-of-the-art object detection algorithms, such as YOLO (Redmon et al., 2016; Redmon & Farhadi, 2017; Farhadi & Redmon, 2018), are utilized by the AR system on the edge, which adopts a single-stage detector strategy for regression-based detection of the boundary coordinates and corresponding class probability.

Current AR systems are not equipped to handle the performance gap caused by several factors. Firstly, the fluctuating network throughout over time causes inconsistencies in performance. Secondly, Quality of Experience (QoE) requirements, such as accuracy and latency of detecting, and fluency of video play, often conflict with each other. Finally, the time-shifted moving velocities of target objects pose a challenge. To illustrate the impact of AR video configuration on user QoE, we take fps and resolution selection as an example. We divide the total time of interest into multiple slots of equal length and define fps as the number of frames per slot. Higher resolution images, divided into multiple grid cells in YOLOv3, improve detecting accuracy but cause longer transmission delays. Similarly, videos encoded with a high fps lead to better fluency but cause larger uploading and detecting delays. Encoding videos with an exorbitant configuration may lead to a deteriorating QoE and degraded network status, but assigning a poor configuration abates the network utilization as well as QoE. Moving trends of objects in terms of moving velocity and direction are also unknown, which presents additional challenges. High-speed objects require a high fps to guarantee fluency, but a much lower fps suffices if the objects are almost static. Thus, the video configuration must match the time-varying network bandwidth and the moving velocities of objects in the videos. These challenges will be described in greater detail in next section.

We propose a novel approach for adaptive configuration of AR video that does not rely on detailed analytical performance modeling but instead embraces inference. Our approach is inspired by recent successes in deep reinforcement learning (DRL) (Mnih et al., 2015, 2016; Henderson et al., 2018) in diverse fields such as the Alpha-go game (Silver et al., 2017), video streaming (Mao et al., 2017), and job scheduling (Mao, Schwarzkopf, et al., 2019). To this end, we introduce Cuttlefish, an intelligent encoder that employs a learning-based approach to select the optimal video configuration without relying on any pre-programmed models or specific assumptions.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Improving Live Augmented Reality With Neural Configuration Adaptation

Abstract

1. Introduction

Complete Chapter List