UPM Institutional Repository

Effective salience-based fusion model for image retrieval


Citation

Mansourian, Leila (2016) Effective salience-based fusion model for image retrieval. Doctoral thesis, Universiti Putra Malaysia.

Abstract

Recently Bag of Visual Words (BoVW) has shown promising results for image annotation and retrieval tasks. In the traditional BoVW model, all visual words are collected and treated the same, regardless of whether or not they are from an important part or the background of a picture. Traditional Scale Invariant Feature Transform (SIFT) features have no spatial information; therefore, the recognition of diffcult objects requires more attention. The first objective of this thesis was to develop a new BoVW model, the Salient Based Bag of Visual Word (SBBoVW) model, to recognize diffcult objects that previous methods were unable to accurately identify. This new model collects visual words based on their importance and combines several Pyramidal Histogram of visual Words (PHOW) feature vectors from the salient, rectangular part of a picture, as well as from the whole picture, to overcome the above-mentioned problem. After implementation, it was found that this method of feature extraction affects the accuracy of the results, which were more accurate than results obtained using seven other state-of-the-art models. However, the SBBoVW model focused only on gray-scale pictures.Previous research found that integrating color, significantly improved the overall performance of both feature detection and extraction because color is an important characteristic of human vision. Based on the literature, most of the image classification strategies have been developed for gray-based SIFT descriptors. Since color content is ignored, misclassification may occur. The Dominant Color Descriptor (DCD) is the best color descriptor for region color and the focus of improvements because it is a low-dimensional or less expensive descriptor representing colors in images. The DCD uses one to eight colors for each picture, and one to four colors for each region. However, some background colors are not used in the object of an image. Therefore, the second objective of this research was to establish a new Salient Dominant Color Descriptor (SDCD) to estimate the number of colors in a salient region using an easily implemented algorithm. Based on the results, it was found that if the maximum Euclidean color distance (dmax) was set to 20, as suggested by other researchers, more accurate results were obtained.The DCD is both low-dimensional and less expensive for representing image colors compared to the previous BoVW model that concentrated on the Color Scale Invariant Feature Transform (CSIFT), combinations of color SIFTs extracted from different color spaces, and opponent-color SIFTs extracted from opponent color spaces to add color information to a SIFT. Therefore, the final objective of this research was to develop a late fusion model, the SDCD BoVW and SBBoVW model. This model fuses the SDCD BoVW, and SBBoVW models using late fusion from histograms and is a comprehensive model for color object recognition. After implementation, the final proposed model provided more accurate results than the other three state-of-the-art models mentioned here and 19 additional color feature extraction methods.


Download File

[img]
Preview
Text
FSKTM 2016 14 IR.pdf

Download (1MB) | Preview

Additional Metadata

Item Type: Thesis (Doctoral)
Subject: Image processing - Digital techniques
Subject: Computer vision
Subject: Computer graphics
Call Number: FSKTM 2016 14
Chairman Supervisor: Muhamad Taufik Abdullah, PhD
Divisions: Faculty of Computer Science and Information Technology
Depositing User: Mas Norain Hashim
Date Deposited: 21 May 2019 07:45
Last Modified: 21 May 2019 07:45
URI: http://psasir.upm.edu.my/id/eprint/68606
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item