Abstract
In content-based image retrieval (CBIR), an image retrieval method combining deep learning semantic feature extraction and regularization Softmax is proposed for the “semantic gap” between the underlying visual features and high-level semantic features. First, the deep Boltzmann machine (DBM) and the convolutional neural network (CNN) in the deep learning method are combined to construct a convolution depth Boltzmann machine (C-DBM), which enables it to extract High-order semantic features of images, and robust to image scaling, affine and other transformations. Then, the Dropout regularized Softmax classifier is used to classify the image features. Finally, the image is retrieved according to the sort output. The experimental results show that the proposed method can extract semantic features effectively and has high retrieval accuracy. The classification precision rate in STL-10 image data set reaches 60.3%.
Similar content being viewed by others
References
Alzu Bi A, Amira A, Ramzan N (2015) Semantic content-based image retrieval: a comprehensive study[J]. Journal of Visual Communication & Image Representation 32:20–54
Budiman A, Fanany MI, Basaruddin C (2014) Stacked DenoisingAutoencoder for feature representation learning in pose-based action recognition[C]// Consum Electron
Chan TH, Jia K, Gao S et al (2014) PCANet: a simple deep learning baseline for image classification[J]. IEEE Trans Image Process 24(12):5017–5032
Feng F, Li R, Wang X (2015) Deep correspondence restricted Boltzmann machine for cross-modal retrieval[J]. Neurocomputing 154(1):50–60
Fu R, Li B, Gao Y, et al. (2017) Content-based image retrieval based on CNN and SVM[C]// IEEE International Conference on Computer & Communications 2017.
Guo JM, Prasetyo H (2015) Content-based image retrieval using features extracted from halftoning-based block truncation coding[J]. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society 24(3):1010–1024
Lavrenko V, Manmatha R, Jeon J (2003) A model for learning the semantics of pictures[C]//2003 Advances in Neural Information Processing Systems(NIPS 2003). Vancouver:NIPS foundation 1:2
Liang J, Liu R (2016) Stacked denoisingautoencoder and dropout together to prevent overfitting in deep neural network[C]// International Congress on Image & Signal Processing, IEEE 213–216
Liao B, Xu J, Lv J et al (2015) An image retrieval method for binary images based on DBN and Softmax classifier[J]. IETE Tech Rev 32(4):10
Liu Y, Zhang D, Lu G, Ma WY (2007) A survey of content-based image retrieval with high-level semantics[J]. Pattern Recogn 40(1):262–282
Ma X, Jie G, Wang H (2015) Hyperspectral image classification via contextual deep learning[J]. Eurasip Journal on Image & Video Processing 32(1):20–28
Niu J, Bu X, Li Z, et al. (2014) An improved bilinear deep belief network algorithm for image classification[C]// Tenth international conference on Computational Intelligence & Security
Salakhutdinov R (2012) Multimodal learning with deep Boltzmann machines[J]. J Mach Learn Res 15(8):1967–2006
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence 22(12):1349–1380
Tang XS, Hao K, Hui W et al (2017) Using line segments to train multi-stream stacked autoencoders for image classification[J]. Pattern Recogn Lett 94:55–61
Vogel J, Schiele B (2007) Semantic modeling of natural scenes for content-based image retrieval[J]. Int J Comput Vis 72(2):133–157
Wei Y, Yang K, Yao H et al (2016) Exploiting the complementary strengths of multi-layer CNN features for image retrieval[J]. Neurocomputing 237:235–241
Xia Z, Wang X, Zhang L et al (2017) A privacy-preserving and copy-deterrence content-based image retrieval scheme in cloud computing[J]. IEEE Transactions on Information Forensics & Security 11(11):2594–2608
Yang J, Yang G (2018) Modified convolutional neural network based on dropout and the stochastic gradient descent optimizer[J]. Algorithms 11(3):28
Yu J, Di H, Wei Z (2017) Unsupervised image segmentation via stacked Denoising auto-encoder and hierarchical patch indexing[J]. Signal Process (143):346–353
Acknowledgments
This work was financially supported byproject of Jilin province science and technology developmentplan,20180623004TC.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wu, Q. Image retrieval method based on deep learning semantic feature extraction and regularization softmax. Multimed Tools Appl 79, 9419–9433 (2020). https://doi.org/10.1007/s11042-019-7605-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7605-5