MRF Model-Based Estimation of Camera Parameters and Detection of Underwater Moving Objects

MRF Model-Based Estimation of Camera Parameters and Detection of Underwater Moving Objects

Susmita Panda, Pradipta Kumar Nanda
DOI: 10.4018/IJCINI.2020100101
Article PDF Download
Open access articles are freely available for download
Add to Your Personal Library: Article


The detection of underwater objects in a video is a challenging problem particularly when both the camera and the objects are in motion. In this article, this problem has been conceived as an incomplete data problem and hence the problem is formulated in expectation maximization (EM) framework. In the E-step, the frame labels are the maximum a posterior (MAP) estimates, which are obtained using simulated annealing (SA) and the iterated conditional mode (ICM) algorithm. In the M-step, the camera model parameters, both intrinsic and extrinsic, are estimated. In case of parameter estimation, the features are extracted at coarse and fine scale. In order to continuously detect the object in different video frames, EM algorithm is repeated for each frame. The performance of the proposed scheme has been compared with other algorithms and the proposed algorithm is found to outperform.
Article Preview

1. Introduction

The problem of underwater video object detection has received considerable attention during last decades and appreciable progress has been made in this direction (Emberton, Chittka, & Cavallaro, 2018; Hossain, Alam, Ali, & Amin, 2016; Mohapatra, Mahapatra, Mahapatra, & Swain, 2015; Walther, Edgington, & Koch, 2004).The underwater moving object suffers from limited range of visibility, low contrast, non-uniform lighting, blurring, bright artefacts, colour diminished and noise (Ancuti, Ancuti, Haber, & Bekaert, 2012; Emberton et al., 2018; Zhang et al., 2017). An automated system for detection and tracking of underwater moving objects has been developed, which of interest to the oceanographic researchers ((Mohapatra et al., 2015). Variable lighting condition and the presence of noise from high contrast debris pose challenge for object detection and tracking. Walther et al. (Walther et al., 2004) have proposed a novel method to overcome the above issues. Negrea et al. (Negrea, Thompson, Juhnke, Fryer, & Loge, 2014) have presented an adaptive background subtraction algorithm for detection and motion prediction which is used for tracking. Design of this fully automated system removes the frames without any activity and hence there is cost reduction for fish monitoring.

This problem of underwater object detection can be of two types. In first case, the object moves while the camera is static, and in second case, both the object and camera are in motion. Often in real world scenario, the second case is more prevalent and challenging than the first one. This is valid in a real-world scenario, where neither the camera model parameters nor the object is known, this motivated us to address the issue in this research work.

In this paper, we have attempted to detect the underwater video objects under varying illumination condition. The problem is formulated as an incomplete data problem and the Expectation and Maximization (EM) approach has been adopted to solve the problem. Our main contributions are: (i) three new Spatio Temporal MRF models for classification of pixel labels in the E step, (ii) new features based model parameter estimation using pipelining approach in the M step, (iii) a continuous Underwater video object detection scheme using EM framework, and (iv) the EM algorithm in Multiresolution framework. In the proposed framework, no a priori knowledge of the camera model parameters is necessary. In E-step of the EM algorithm, the video object is segmented based on the video frame model. The problem of frame label estimation is formulated as a Maximum a posterior (MAP) estimation problem and these MAP estimates are obtained by an algorithm which is a combination of Simulated Annealing (SA) and the Iterated Conditional Model (ICM) algorithm. Subsequently, in M-step, the estimated frame labels are used to estimate the intrinsic and extrinsic parameters of the camera model. The proposed features are extracted from the labelled frames and weighted appropriately before being fed to the pipeline. These weighted corner features are used to estimate the camera intrinsic and extrinsic parameters using the 2D optimization method (Zhou, Cui, Peng, & Wang, 2012). E step and M step are repeated to continuously detect the video objects with the moving camera. The camera calibration errors have been computed and the estimated parameters are chosen based on the minimum calibration error. The segmentation accuracy has been validated by four quantitative measures. The advantage of the proposed multiresolution framework is that the execution time substantially reduced as compared to considering the fine scale images. The performance of the proposed algorithm has been compared with the Stolkin’s E-MRF algorithm (Liu, Dai, Wang, Zheng, & Zheng, 2016; Prabowo, Hudayani, Purwiyanti, Sulistiyanti, & Setyawan, 2017; Rustam Stolkin, Greig, Hodgetts, & Gilby, 2008) algorithm and found to be superior.

Complete Article List

Search this Journal:
Volume 17: 1 Issue (2023)
Volume 16: 1 Issue (2022)
Volume 15: 4 Issues (2021)
Volume 14: 4 Issues (2020)
Volume 13: 4 Issues (2019)
Volume 12: 4 Issues (2018)
Volume 11: 4 Issues (2017)
Volume 10: 4 Issues (2016)
Volume 9: 4 Issues (2015)
Volume 8: 4 Issues (2014)
Volume 7: 4 Issues (2013)
Volume 6: 4 Issues (2012)
Volume 5: 4 Issues (2011)
Volume 4: 4 Issues (2010)
Volume 3: 4 Issues (2009)
Volume 2: 4 Issues (2008)
Volume 1: 4 Issues (2007)
View Complete Journal Contents Listing