Adaptive Edge-Preserving Smoothing and Detail Enhancement for H.263 and H.264 Video

Adaptive Edge-Preserving Smoothing and Detail Enhancement for H.263 and H.264 Video

Ji-Hye Kim (Sogang University, Korea), Ji Won Lee (Sogang University, Korea), Rae-Hong Park (Sogang University, Korea), Min-Ho Park (PIXTREE, Korea) and Jae-Seob Shin (PIXTREE, Korea)
DOI: 10.4018/978-1-4666-3958-4.ch005
OnDemand PDF Download:
$30.00
List Price: $37.50

Abstract

For removing undesirable artifacts in video coding, a large number of filtering methods have been proposed as post-processing and in-loop processing. This chapter proposes a pre-processing method of motion-adaptive edge-preserving smoothing and detail enhancement for H.263 and H.264 video, in which temporal and spatial edges are used to define Region Of Interest (ROI). In the proposed pre-processing method, trilateral filtering with three types of weights (domain, range, and temporal weights) is used for smoothing non-ROI region while preserving temporal/spatial edges. In the proposed pre-processing method, the temporal weight preserves temporal edges within ROIs and smoothes details within non-ROI. The proposed pre-processing method can make video coding more efficient under the restricted bit-rate condition. The parameter values for weight functions of a trilateral filter are selected depending on the classification of motion and edginess, and proper filtering is performed with adaptive parameter values. Experimental results with a number of H.263 and H.264 test sequences show the effectiveness of the proposed method in terms of the visual quality, the peak signal-to-noise ratio, and the mean opinion score.
Chapter Preview
Top

Introduction

In recent years, rapid increase in the number of cellular customers has led to significant development of mobile network system. Expansion of high data-rate communication in multimedia and data services requires a wide range of bandwidth up to 50−100 Mbps. Applications combined with streaming multimedia services such as real-time conversational voice services, point to point interactive multimedia services including interactive real-time voice, video, and other media (video telephony and white boarding), and video conferencing over wireless have been increasingly becoming a significant part of services in mobile vender market (Etoh & Yoshimura, 2005). In particular, video coding technology is an important part of video streaming services for data transmission and storage.

To ensure video quality on these multimedia services, based on streaming with the encoder such as H.263, H.264, and moving picture experts group-4 (MPEG-4), an effective video enhancement algorithm is required for high-quality images/videos under the low bit-rate and restricted bandwidth. To achieve high quality, it is important to reduce various artifacts such as block artifacts, ringing artifacts, and false-color artifacts. In particular, a video coding algorithm with block-based transform, quantization, and motion prediction produces very annoying block artifacts with low bit-rate and high Quantization Parameter (QP) values (Kim & Jeong, 2007). These artifacts are amplified in multimedia services under the embedded mobile environment such as music video, which contains a large amount of motion and illumination variation between successive frames. Many image/video enhancement approaches have been developed to reduce the artifacts.

Most of deblocking algorithms are post-processing algorithms (Kim et al., 1999; Su et al., 1999; Yu et al., 2004; List et al., 2003). For example, overlapped block motion compensation (Su & Mersereau, 2000) and a post-processing algorithm which applies a smoothing filter according to the amount of detected block artifact (Yu & Zhang, 2004) were proposed. A deblocking algorithm was proposed by adaptively selecting a filter in boundaries (List, Joch, Lainema, Bjontegaard, & Karczewicz, 2003). The post-processing algorithm should be implemented in every decoder device such as mobile phone, thus it is not practical due to high cost and poor efficiency with regard to a large number of decoders. On the other hand, the pre-processing algorithms that are applied to encoder have been rarely studied and they require modifying encoder or parameter values. For example, re-allocation according to Region Of Interest (ROI) for efficient coding was proposed (Chai, Ngan, & Bouzerdoum, 2000).

This chapter proposes a pre-processing algorithm that can be used to improve the performance of video codec, especially focusing on reducing block artifacts without modifying encoder and any information on the encoder. Also to reduce ringing artifacts, many approaches have been proposed such as post-preprocessing based on the visible ringing measure (Oguz, Hu, & Nguyen, 1998) and the method combining deblocking and deringing (Wan, Mrak, Ramzan, & Izquierdo, 2007). Compared with modification or replacement of existing de-blocking filter or post-processing methods, the proposed algorithm has two features to enhance the quality of video. First, the proposed method is independent of codec. Most post-processing algorithms for reducing coding artifacts are dependent on the codec operation and their performances depend on parameter selection of codec. However, our algorithm is dependent only on the characteristics of the input video sequence which is to be compressed by codec and implemented without any modification of codec. Also, according to experimental results, the proposed algorithm with trilateral filtering is more suitable for multimedia streaming service which contains a lot of motion such as music video contents than other existing spatial or temporal filters.

Complete Chapter List

Search this Book:
Reset