TY - JOUR
T1 - Leveraging features from background and salient regions for automatic image annotation
AU - Sarin, Supheakmungkol
AU - Fahrmair, Michael
AU - Wagner, Matthias
AU - Kameyama, Wataru
PY - 2012
Y1 - 2012
N2 - In this era of information explosion, automating the annotation process of digital images is a crucial step towards efficient and effective management of this increasingly high volume of content. However, this still is a highly challenging task for the research community. One of the main bottlenecks is the lack of integrity and diversity of features. We propose to solve this problem by utilizing 43 image features that cover the holistic content of the image from global to subject, background and scene. In our approach, salient regions and the background are separated without prior knowledge. Each of them together with the whole image are treated independently for feature extraction. Extensive experiments were designed to show the efficiency and the effectiveness of our approach. We chose two publicly available datasets manually annotated with diverse nature of images for our experiments, namely, the Corel5K and ESP Game datasets. We confirm the superior performance of our approach over the use of a single whole image using sign test with p-value < 0.05. Furthermore, our combined feature set gives satisfactory performance compared to recently proposed approaches especially in terms of generalization even with just a simple combination. We also obtain a better performance with the same feature set versus the grid-based approach. More importantly, when using our features with the state-of-the-art technique, our results show higher performance in a variety of standard metrics.
AB - In this era of information explosion, automating the annotation process of digital images is a crucial step towards efficient and effective management of this increasingly high volume of content. However, this still is a highly challenging task for the research community. One of the main bottlenecks is the lack of integrity and diversity of features. We propose to solve this problem by utilizing 43 image features that cover the holistic content of the image from global to subject, background and scene. In our approach, salient regions and the background are separated without prior knowledge. Each of them together with the whole image are treated independently for feature extraction. Extensive experiments were designed to show the efficiency and the effectiveness of our approach. We chose two publicly available datasets manually annotated with diverse nature of images for our experiments, namely, the Corel5K and ESP Game datasets. We confirm the superior performance of our approach over the use of a single whole image using sign test with p-value < 0.05. Furthermore, our combined feature set gives satisfactory performance compared to recently proposed approaches especially in terms of generalization even with just a simple combination. We also obtain a better performance with the same feature set versus the grid-based approach. More importantly, when using our features with the state-of-the-art technique, our results show higher performance in a variety of standard metrics.
KW - Automatic image annotation
KW - Background
KW - Holistic features extraction
KW - K nearest neighbours
KW - Salient regions
UR - http://www.scopus.com/inward/record.url?scp=84871207775&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84871207775&partnerID=8YFLogxK
U2 - 10.2197/ipsjjip.20.250
DO - 10.2197/ipsjjip.20.250
M3 - Article
AN - SCOPUS:84871207775
SN - 0387-5806
VL - 20
SP - 250
EP - 266
JO - Journal of information processing
JF - Journal of information processing
IS - 1
ER -