Article Preview
Top1. Introduction
Mobile devices with high resolution built-in camera are becoming ubiquitous. It is desirable to support multimedia information retrieval using the images taken from the camera. As examples, a student can use pictures of a plant to search for information of similar species in a remote digital library. Providing this capability calls for efficient techniques to facilitate Content-Based Image Retrieval (CBIR) in mobile environment. In a CBIR system, images are characterized by their low level features such as color, texture, and shape. Since it is difficult to “describe” a query through those features, RF is widely used and plays an important role (Binderberger & Mehrotra, 2004; Chen et al., 2009; Fu et al., 2008; Goh et al., 2004; Gevers & Smeulders, 2004; Hoi et al., 2006, Kim & Chung, 2003). It helps the system to understand the user's intention. In such a system, the user interacts with the system as follows: In each round, he helps by identifying the relevant images within the returned set; the system then utilizes this feedback to modify the query and thus to improve its retrieval results in the next round. This process can be repeated until the user is satisfied with the results.
Lots of information retrieval systems in mobile environment have been introduced in recent years (Bezerra et al., 2005; Jia et al., 2006; Kim et al., 2006; Lee & Jayant, 2006; Shyu et al., 2006; Sonobe et al., 2004; Sarvas et al., 2004; Tollmar et al., 2004). In Bezerra et al. (2005), the query is composed by keywords and visual sub-queries. The keyword part is submitted to an existing search engine and the resulting images are passed to a query refinement agent that processes the visual part of the query in the client side. In Jia et al. (2006) and Kim et al. (2006), visual features are used to search in a large database and the result is returned to the user for relevance feedback. In summary, in those systems, the user needs to interact with the server for result. The mobile device works as an interface. However, this computation model is not suitable for RF because the wireless bandwidth is limited for exchanging images in each round of user RF. The long delay in feedback would affect the usability of the mobile system. Moreover, communication is generally hundreds of times more demanding on mobile power than computation is (Pottie & Kaiser, 2000; Stemm & Katz, 1997). Constantly sending and receiving queries and results can quickly exhaust mobile device's battery power. In practice, power required by CPU is minimal compared to sending data over the wireless radio. For example, the energy cost of transmitting 1Kb message over a distance of 100 meters is approximately 3 joules. By contrast, a general-purpose processor with 100 MIPS/W power could efficiently execute 3 million instructions for the same amount of energy (Pottie & Kaiser, 2000). Also the storage power for mobile devices is always some hundred mW (Zheng et al., 2003), which is 20-30% of the communication power consumption. Therefore, saving the communication power will be more urgent in mobile RF system.