Ontological Random Forests for Image Classification

Ontological Random Forests for Image Classification

Ning Xu, Jiangping Wang, Guojun Qi, Thomas S. Huang, Weiyao Lin
Copyright: © 2018 |Pages: 16
DOI: 10.4018/978-1-5225-5204-8.ch031
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Previous image classification approaches mostly neglect semantics, which has two major limitations. First, categories are simply treated independently while in fact they have semantic overlaps. For example, “sedan” is a specific kind of “car”. Therefore, it's unreasonable to train a classifier to distinguish between “sedan” and “car”. Second, image feature representations used for classifying different categories are the same. However, the human perception system is believed to use different features for different objects. In this paper, we leverage semantic ontologies to solve the aforementioned problems. The authors propose an ontological random forest algorithm where the splitting of decision trees are determined by semantic relations among categories. Then hierarchical features are automatically learned by multiple-instance learning to capture visual dissimilarities at different concept levels. Their approach is tested on two image classification datasets. Experimental results demonstrate that their approach not only outperforms state-of-the-art results but also identifies semantic visual features.
Chapter Preview
Top

1. Introduction

Most existing image classification algorithms treat categories as completely independent both visually and semantically. However, humans are believed to use semantic relations to classify categories (Collin, 2005). For example, it is unreasonable to distinguish “truck” from “vehicle” since “truck” is a kind of “vehicle”. In addition, it is common for humans to use different features to discriminate different objects. For example, “wheel” is a useful feature to distinguish “car” from “animal” while shape differences are more discriminative to distinguish “truck” from “sedan”.

Although having good performance on some easy image classification datasets such as Caltech 101 (Fei-Fei, 2007) and Caltech 256 (Griffin, 2007), the neglect of semantics makes most existing image classification algorithms (Shao, 2014; Wang, 2010; Zhang, 2014) not only have limited results on challenging problems such as fine-grained image classification (Deng, 2009; Welinder, 2010), but also are at odds with the human visual system.

An ontology is a hierarchical structure consisting of categories and high-level relations such as “is-a” and “part-of”. It encodes semantics in a hierarchical way that is very similar to human perception. Therefore it provides a useful tool to incorporate semantics into frameworks of traditional image classification approaches. However, traditional ontology based algorithms (Marszalek, 2007; Tsai, 2010; Xu, 2014) build ontological classifiers which have a classifier at every ontological node to discriminate the node's sub-categories. This simple framework leads to error propagation such that if an image is misclassified at any intermediate node along the path from the root concept to the leave concept, the prediction will be wrong. This issue is serious due to large intra-class variations of super-categories, i.e., it is difficult to train a good classifier for general concepts such as “animal” and “vehicle”. As a result, previous use of ontologies on image classification mainly aims at improving classification speed instead of classification accuracy.

Figure 1.

The frameworks of traditional ontological classifiers (leftmost), traditional decision trees (middle) and our approach (rightmost). Red dashed lines illustrate the classification paths. Bounding boxes with different colors indicate the features used for training node classifiers

978-1-5225-5204-8.ch031.f01

Complete Chapter List

Search this Book:
Reset