[go: up one dir, main page]

Skip to main content
Log in

Image retrieval method based on deep learning semantic feature extraction and regularization softmax

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In content-based image retrieval (CBIR), an image retrieval method combining deep learning semantic feature extraction and regularization Softmax is proposed for the “semantic gap” between the underlying visual features and high-level semantic features. First, the deep Boltzmann machine (DBM) and the convolutional neural network (CNN) in the deep learning method are combined to construct a convolution depth Boltzmann machine (C-DBM), which enables it to extract High-order semantic features of images, and robust to image scaling, affine and other transformations. Then, the Dropout regularized Softmax classifier is used to classify the image features. Finally, the image is retrieved according to the sort output. The experimental results show that the proposed method can extract semantic features effectively and has high retrieval accuracy. The classification precision rate in STL-10 image data set reaches 60.3%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Alzu Bi A, Amira A, Ramzan N (2015) Semantic content-based image retrieval: a comprehensive study[J]. Journal of Visual Communication & Image Representation 32:20–54

    Article  Google Scholar 

  2. Budiman A, Fanany MI, Basaruddin C (2014) Stacked DenoisingAutoencoder for feature representation learning in pose-based action recognition[C]// Consum Electron

  3. Chan TH, Jia K, Gao S et al (2014) PCANet: a simple deep learning baseline for image classification[J]. IEEE Trans Image Process 24(12):5017–5032

    Article  MathSciNet  Google Scholar 

  4. Feng F, Li R, Wang X (2015) Deep correspondence restricted Boltzmann machine for cross-modal retrieval[J]. Neurocomputing 154(1):50–60

    Article  Google Scholar 

  5. Fu R, Li B, Gao Y, et al. (2017) Content-based image retrieval based on CNN and SVM[C]// IEEE International Conference on Computer & Communications 2017.

  6. Guo JM, Prasetyo H (2015) Content-based image retrieval using features extracted from halftoning-based block truncation coding[J]. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society 24(3):1010–1024

    Article  MathSciNet  Google Scholar 

  7. Lavrenko V, Manmatha R, Jeon J (2003) A model for learning the semantics of pictures[C]//2003 Advances in Neural Information Processing Systems(NIPS 2003). Vancouver:NIPS foundation 1:2

    Google Scholar 

  8. Liang J, Liu R (2016) Stacked denoisingautoencoder and dropout together to prevent overfitting in deep neural network[C]// International Congress on Image & Signal Processing, IEEE 213–216

  9. Liao B, Xu J, Lv J et al (2015) An image retrieval method for binary images based on DBN and Softmax classifier[J]. IETE Tech Rev 32(4):10

    Article  Google Scholar 

  10. Liu Y, Zhang D, Lu G, Ma WY (2007) A survey of content-based image retrieval with high-level semantics[J]. Pattern Recogn 40(1):262–282

    Article  Google Scholar 

  11. Ma X, Jie G, Wang H (2015) Hyperspectral image classification via contextual deep learning[J]. Eurasip Journal on Image & Video Processing 32(1):20–28

    Article  Google Scholar 

  12. Niu J, Bu X, Li Z, et al. (2014) An improved bilinear deep belief network algorithm for image classification[C]// Tenth international conference on Computational Intelligence & Security

  13. Salakhutdinov R (2012) Multimodal learning with deep Boltzmann machines[J]. J Mach Learn Res 15(8):1967–2006

    MathSciNet  MATH  Google Scholar 

  14. Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence 22(12):1349–1380

    Article  Google Scholar 

  15. Tang XS, Hao K, Hui W et al (2017) Using line segments to train multi-stream stacked autoencoders for image classification[J]. Pattern Recogn Lett 94:55–61

    Article  Google Scholar 

  16. Vogel J, Schiele B (2007) Semantic modeling of natural scenes for content-based image retrieval[J]. Int J Comput Vis 72(2):133–157

    Article  Google Scholar 

  17. Wei Y, Yang K, Yao H et al (2016) Exploiting the complementary strengths of multi-layer CNN features for image retrieval[J]. Neurocomputing 237:235–241

    Google Scholar 

  18. Xia Z, Wang X, Zhang L et al (2017) A privacy-preserving and copy-deterrence content-based image retrieval scheme in cloud computing[J]. IEEE Transactions on Information Forensics & Security 11(11):2594–2608

    Article  Google Scholar 

  19. Yang J, Yang G (2018) Modified convolutional neural network based on dropout and the stochastic gradient descent optimizer[J]. Algorithms 11(3):28

    Article  MathSciNet  Google Scholar 

  20. Yu J, Di H, Wei Z (2017) Unsupervised image segmentation via stacked Denoising auto-encoder and hierarchical patch indexing[J]. Signal Process (143):346–353

Download references

Acknowledgments

This work was financially supported byproject of Jilin province science and technology developmentplan,20180623004TC.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qinghai Wu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, Q. Image retrieval method based on deep learning semantic feature extraction and regularization softmax. Multimed Tools Appl 79, 9419–9433 (2020). https://doi.org/10.1007/s11042-019-7605-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-7605-5

Keywords

Navigation