Medical Video Summarization using Central Tendency-Based Shot Boundary Detection

Medical Video Summarization using Central Tendency-Based Shot Boundary Detection

G. G. Lakshmi Priya, S. Domnic
Copyright: © 2013 |Pages: 11
DOI: 10.4018/ijcvip.2013010105
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Due to the advancement in multimedia technologies and wide spread usage of internet facilities; there is rapid increase in availability of video data. More specifically, enormous collections of Medical videos are available which has its applications in various aspects like medical imaging, medical diagnostics, training the medical professionals, medical research and education. Due to abundant availability of information in the form of videos, it needs an efficient and automatic technique to manage, analyse, index, access and retrieve the information from the repository. The aim of this paper is to extract good visual content representatives – Summary of keyframes. In order to achieve this, the authors propose a new method for video shot segmentation which in turn leads to extraction of better keyframes as representative for summary. The proposed method is experimented and evaluated using publically available medical videos. As a result, better precision and recall is obtained for shot detection when compared to that of the recent related methods. Evaluation of video summary is done using fidelity measure and compression ratio.
Article Preview
Top

1. Introduction

Recent development in multimedia systems resulted in enormous availability and usage of digital videos. More specifically, in medical fields, the availability of multimedia data (videos) goes on increasing. Some developing countries suffer from lack of facilities in accessing this information and are in situation providing inadequate training to the medical professions. Since medical videos are mainly used for educational purposes, video content is usually recorded or edited using the style formats. In order to make use of this huge amount of data for various medical related applications there is a need for tools that can effectively analyse, categorize, search and retrieve the relevant video material. This paved way for various researches in video summarization.

In general, video summary is a sequence of still (representative frames) or motion pictures (video skimming) that provides concise information about the content of the video. Representative frames are the set of keyframes extracted from the video, whereas video skimming is the summary composed of set of shots. Compared to video skim, keyframe sets are more flexible in terms of browsing and navigation purpose.

Various works have been proposed for video summarization, which are categorized into sample based, cluster based and shot based approaches (Truong & Venkatesh, 2007). In the sampling based approaches, keyframes were extracted by randomly choosing of uniformly sampling from the original video. It is the straightforward and easy way to extract keyframes. It may fail to capture real video contents because of the random extraction of keyframes.

Most of video summarization approaches are based on clustering techniques (Hadi et al., 2006; Mundur et al., 2006; Chen et al., 2009; Herranz & Martinez, 2009; Furini et al., 2010). The basic idea of clustering approaches is to generate summary by grouping similar frames / shots and extracting frames from the groups (usually one frame per group). However, it is important to select the features in order to consider and group the similar frames. There are some techniques that produce summaries of accepted qualities but computationally expensive (Mundur et al., 2006, Furini et al., 2010, Amiri & Fathy, 2010). In Borth et al. (2008), Borth uses shot boundary detection to segment the video into shots and the k-means algorithm to determine cluster representatives for each shot that are used as keyframes. In paper Besiris et al. (2009), the author proposed an automatic video summarization technique based on graph theory methodology and the dominant sets clustering algorithm. Ali Amiri and Mahmood Fathy (2010) proposed a Hierarchical keyframe based video summarization system using QR-decomposition. Based on the dynamicity of the shots the number of keyframes is selected. Avila et al. (2011) presented VSUMM, a methodology for production of video summary, is based on color feature extraction from frames and k-means clustering.

In the shot-based approaches, the video is segmented into separated shots and one or more keyframes extracted from each shot. In most of the literature, shot boundary detection is widely used as first step in generation of video summaries. The shot boundary detection is segmentation of the video into its basic meaningful and manageable element, called a shot. A shot is defined as a sequence of frames captured uninterruptedly by a single camera in time and space. In general, Shot boundaries are classified into abrupt and gradual transitions. If an immediate transition takes place from one shot to another then it is considered as abrupt shot or cut transition. When the shot changes gradually with few intermediate frames, then it is gradual transition which includes fade-in, fade-out, dissolve and wipe.

Complete Article List

Search this Journal:
Reset
Volume 14: 1 Issue (2024): Forthcoming, Available for Pre-Order
Volume 13: 1 Issue (2023)
Volume 12: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 11: 4 Issues (2021)
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 2 Issues (2016)
Volume 5: 2 Issues (2015)
Volume 4: 2 Issues (2014)
Volume 3: 4 Issues (2013)
Volume 2: 4 Issues (2012)
Volume 1: 4 Issues (2011)
View Complete Journal Contents Listing