A Scene Recognition Algorithm Based On Multi-Instance Learning

In Bag of Words image presentation model, visual words are generated by unsupervised clustering, which leaves out the spatial relations between words and results in such shorting comings as limited semantic description and weak discrimination. To solve this problem, we propose to substitute visual words by visual phrases in this article. Visual phrases built according to spatial relations between words are semantic distrainable, and they can improve the accuracy of Bag of Words model. Considering the traditional classification method based on Bag of Words model is vulnerable to the background, block and scalar variance of an image, we propose in this article a multiple visual words learning method for image classification, which is based on the concept of visual phrases combined with Multiple Instance Learning. The final classification model is able to show the spatial features of image classes. Experiments performed on standard image testing sets, Caltech 101 and Scene 15, show the satisfying performance of this algorithm.

eISSN:: 1178-5608
Lingua:: Inglese

Frequenza di pubblicazione:: Volume Open
Argomenti della rivista:: Engineering, Introductions and Overviews, other

Feed RSS della rivista

A Scene Recognition Algorithm Based On Multi-Instance Learning

Pubblicato online: 01 dic 2014

Pagine: 1470 - 1492

Ricevuto: 17 mag 2014

Accettato: 12 ott 2014

DOI: https://doi.org/10.21307/ijssis-2017-716

Parole chiaveImage Classification, Multiple Kernel Learning, Bag of Visual Words, Spatial Pyramid Matching

© 2014 Tao Wang et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Parole chiave
Image Classification, Multiple Kernel Learning, Bag of Visual Words, Spatial Pyramid Matching