Improving Image Retrieval by Clustering

Dany Gebara; Reda Alhajj

doi:10.4018/978-1-60566-174-2.ch002

Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Improving Image Retrieval by Clustering

Dany Gebara, Reda Alhajj

Source Title: Artificial Intelligence for Maximizing Content Based Image Retrieval

DOI: 10.4018/978-1-60566-174-2.ch002

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

This chapter presents a novel approach for content-fbased image retrieval and demonstrates its applicability on non-texture images. The process starts by extracting a feature vector for each image; wavelets are employed in the process. Then the images (each represented by its feature vector) are classified into groups by employing a density-based clustering approach, namely OPTICS. This highly improves the querying facility by limiting the search space to a single cluster instead of the whole database. The cluster to be searched is determined by applying on the query image the same clustering process OPTICS. This leads to the closest cluster to the query image, and hence, limits the search to the latter cluster without adding the query image to the cluster, except if such request is explicitly specified. The power of this system is demonstrated on non-texture images from the Corel dataset. The achieved results demonstrate that the classification of images is extremely fast and accurate.

Chapter Preview

Top

Introduction

Since the early 1990’s, there has been considerable research carried out into content-based image retrieval (CBIR) systems. A few systems have been installed commercially, including Query-By-Image-Content (QBIC) (Niblack, Barber, Equitz, Flickner, Glasman, Petkovic, Yanker, Faloutsos, and Taubin, 1993), the VIR Image Engine (Bach, Fuller, Gupta, Hampapur, Gorowitz, Humphrey, Jain, and Shu, 1996), the AltaVista Photofinder, Multimedia Analysis and Retrieval System (MARS) (Huang, Mehrotra, and Ramchandran, 1996), Photobook (Pentland, Picard, and Sclaroff, 1994), Netra (Ma and Manjunath, 1999), RetrievalWare (Dowe, 1993), etc. Actually, the problem of sorting through images to find a particular object of interest is not new. Whether it is paintings in old museum archives, or browsing through the family albums looking for a particular photograph, extracting information from graphic objects has presented many challenges. With the recent advent and growth of the internet, this problem has been taken to a whole new level. Further, as the hardware needed to capture and store images in digital format has become cheaper and more accessible, the number of people and businesses that have started collecting large numbers of images has grown. The first strategy for dealing with such large collections of images was to tag each image with one or more keywords, allowing existing text-based search systems to work with images. This was a great leap forward, but still had limitations; the biggest of which is that someone had to choose and enter keywords for every image. In addition to being a very tedious task, selection of keywords is a very subjective function. Another method was to sort images by type and place them in file folders much like photographs would be placed in albums. This also suffers from similar drawbacks.

In general, images could be classified into two classes, texture and non-texture. Texture images form an important class, where an object within the image is repeated periodically throughout the image. x Some medical images such as X-rays and some topographic images fall under this category. Non-texture images tend to have objects of interest clustered in one or more regions of an image. Figure 1 shows one image from each class.

Figure 1.

Example of Texture and Non-Texture

In order to be able to compare images by content, a feature vector (or representative signature) needs to be calculated for each image. This feature vector is the description of the image to the content-based image retrieval (CBIR) system, which will then conduct its search based on these calculated vectors. Generally, the algorithms used to calculate these feature vectors perform well on some class of images and poorly on others. It therefore follows that a CBIR system should classify an image first, and then use an appropriate algorithm based on the classification.

In terms of querying speed, a faster system is naturally preferred. Hence, if there is a way to avoid scanning the entire database every time a query is submitted, this should result in faster responses to the user. Clustering can be applied to the calculated feature vectors, where the signatures for similar images are grouped as one cluster. When querying, a CBIR system need only to look at a representative for each cluster to narrow the search.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Improving Image Retrieval by Clustering

Abstract

Introduction

Complete Chapter List