Developing a feature decoder network with low-to-high hierarchies to improve edge detection

Tao Fang ORCID: orcid.org/0000-0001-8411-4387¹,
Mingqi Zhang¹,
Yingle Fan¹,
Wei Wu¹,
Haitao Gan¹ &
…
Qingshan She¹

340 Accesses
8 Citations
Explore all metrics

Abstract

Low-to-high hierarchical convolutional features can significantly improve edge detection. This paper proposes a feature decoder-based algorithm that employs a Feature Decoder Network (FDN) to extract more information within limited Convolutional Neural Network (CNN) features. Previous studies applied convolutional elements by weight fusion, but we measure a feature decoder as a pyramid by qualifying convolutional layers. The feature decoder fuses CNN features of adjacent layers to judge the edge and non-edge pixels, which can learn the relationship and distinction between low-level edge hierarchies and high-level semantic hierarchies. Furthermore, we use Gaussian blur labels to train the network to optimize network convergence and training. From the experimental results, our proposed algorithm performs better on the BSDS500 (average accuracy (AP) of 0.865) and NYUD (OIS F-measure of 0.775) datasets compared to the state-of-the-art algorithms, including RCF.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Feature enhancement: predict more detailed and crisper edges

Article 21 April 2021

An exclusive U-net for fine and crisp edge detection

Article 07 December 2023

Multi-scale Feature Decode and Fuse Model with CRF Layer for Boundary Detection

References

Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916
Article Google Scholar
Arbelaez P, Pont-Tuset J, Barron J, Marques F, Malik J (2014) Multiscale combinatorial grouping. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 328–335
Bertasius G, Shi J, Torresani L (2015) DeepEdge: a multiscale bifurcated deep network for top-down contour detection. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 4380–4389
Bertasius G, Shi J, Torresani L (2015) High-for-low and lowfor-high: efficient boundary detection from deep object features and its applications to high-level vision. In: Proc IEEE Int Conf Comput Vis pp. 504–512
Canny J (1987) A computational approach to edge detection. In: Readings Comput Vis pp. 184–203
Chen L, Papandreou G, Kokkinos I, Murphy K, Yuille A (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. CoRR IEEE Trans Pattern Anal Mach Intell 40(4):834–848
Article Google Scholar
Cheng M, Liu Y, Hou Q, Bian J, Torr P, Hu S, Tu Z (2016) HFS: hierarchical feature selection for efficient image segmentation. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 867c882
Choi Y, Choi M, Kim M, Ha J, Kim S, Choo J (2018) Stargan: unified generative adversarial networks for multi-domain image-to-image translation. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 8789–8797
Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619
Article Google Scholar
Dollar P, Zitnick C (2014) Fast edge detection using structured forests. IEEE Trans Pattern Anal Mach Intell 37(8):1558–1570
Article Google Scholar
Dollar P, Tu Z, Belongie S (2006) Supervised learning of edges and object boundaries. In: Proc IEEE Conf Comput Vis Pattern Recognitpp. 1964–1971
Duda R, Hart P (1974) Pattern classification and scene analysis. IEEE Trans Automat Contr 19(4):462–463
Article Google Scholar
Fang T, Fan Y, Wu W (2020) Salient contour detection on the basis of the mechanism of bilateral asymmetric receptive fields. Signal Image Video Process. https://doi.org/10.1007/s11760-020-01689-1
Felzenszwalb P, Huttenlocher D (2004) Efficient graphbased image segmentation. Int J Comput Vis 59(2):167–181
Article Google Scholar
Ferrari V, Fevrier L, Jurie F, Schmid C (2007) Groups of adjacent contour segments for object detection. IEEE Trans Pattern Anal Mach Intell 30(1):36–51
Article Google Scholar
Ganin Y, Lempitsky V (2014) N4-fields: neural network nearest neighbor fields for image transforms. In: Asian Conf Comput Vis pp. 536–551
Gupta S, Arbelaez P, Malik J (2013) Perceptual organization and recognition of indoor scenes from RGB-d images. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 564–571
Gupta S, Girshick R, Arbelaez P, Malik J (2014) Learning rich features from RGB-d images for object detection and segmentation. In: Euro Conf Comput Vis pp. 345–360
Hallman S, Fowlkes C (2015) Oriented edge forests for boundary detection. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 1732–1740
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 770–778
He J, Zhang S, Yang M, Shan Y, Huang T (2019) Bi-directional cascade network for perceptual edge detection. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 3828–3837
Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Guadarrama S, Darrell T (2014) Caffe: convolutional architecture for fast feature embedding. In: Proc ACM Int Conf Multi pp. 675–678
Lim J, Zitnick C, Dollar P (2013) Sketch tokens: a learned mid-level representation for contour and object detection. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 3158–3165
Liu Y, Cheng M, Hu X, Wang K, Bai X (2017) Richer convolutional features for edge detection. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 3000–3009
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In Proc IEEE Conf Comput Vis Pattern Recognit pp. 3431–3440
Maninis K, Pont-Tuset J, Arbeláez P, Van-Gool L (2016) Convolutional oriented boundaries. In: Euro Conf Comput Vis pp. 580–596
Mottaghi R, Chen X, Liu X, Cho N, Lee S, Fidler S, Urtasun R, Yuille A (2014) The role of context for object detection and semantic segmentation in the wild. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 891–898
Poma X, Riba E, Sappa A (2020) Dense extreme inception network: towards a robust cnn model for edge detection. In: IEEE Winter Conf Appl Comput Vis pp. 1923–1932
Ren X, Bo L (2012) Discriminatively trained sparse code gradients for contour detection. Int Conf Neural Informa Process Syst pp. 584–592.
Ren Z, Shakhnarovich G (2013) Image segmentation by cascaded region agglomeration. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 2011–2018
Shen W, Wang X, Wang Y, Bai X, Zhang Z (2015) DeepContour: a deep convolutional feature learned by positivesharing loss for contour detection. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 3982–3991
Silberman N, Hoiem D, Kohli P, Fergus R (2012) Indoor segmentation and support inference from RGB-d images. In: Euro Conf Comput Vis pp. 746–760
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Sobel I (1972) Camera models and machine perception. Ph.d Thesis, Stanford University, Stanford
Tang P, Wang H, Kwong S (2017) G-MS2F: GoogLeNet based multi-stage feature fusion of deep CNN for scene recognition. Neurocomputing 225:188–197
Article Google Scholar
Wang X, Wu C, Xiang K, Xiang S, Chen W (2018) An experimental comparison of superpixels detection methods for contour detection. Mach Vis Appl 29(4):677–687
Article Google Scholar
Xie S, Tu Z (2015) Holistically-nested edge detection. In: Proc IEEE Int Conf Comput Vis pp. 1395–1403
Xu D, Ouyang W, Alameda-Pineda X, Ricci E, Wang X, Sebe N (2017) Learning deep structured multi-scale features using attention-gated crfs for contour prediction. Int Conf Neural Informa Process Syst pp. 3961–3970
Yang J, Yang M (2016) Top-down visual saliency via joint CRF and dictionary learning. IEEE Trans Pattern Anal Mach Intell 39(3):576–588
Article Google Scholar
Yang K, Li C, Li Y (2014) Multifeature-based surround inhibition improves contour detection in natural images. IEEE Trans Image Process 23(12):5020–5032
Article MathSciNet Google Scholar
Yang J, Price B, Cohen S, Lee H, Yang M (2016) Object contour detection with a fully convolutional encoder-decoder network. In: Proc IEEE Conf Comput Vis Pattern Recognit pp. 193–202

Download references

Acknowledgments

The work was supported in part by the National Natural Science Foundation of China (61501154). The authors would like to thank Yun Liu for his kind and help in the writing process.

Author information

Authors and Affiliations

Laboratory of Pattern Recognition and Image Processing Hangzhou Dianzi University, Hangzhou, 310018, China
Tao Fang, Mingqi Zhang, Yingle Fan, Wei Wu, Haitao Gan & Qingshan She

Authors

Tao Fang
View author publications
You can also search for this author in PubMed Google Scholar
Mingqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yingle Fan
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Haitao Gan
View author publications
You can also search for this author in PubMed Google Scholar
Qingshan She
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yingle Fan.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fang, T., Zhang, M., Fan, Y. et al. Developing a feature decoder network with low-to-high hierarchies to improve edge detection. Multimed Tools Appl 80, 1611–1624 (2021). https://doi.org/10.1007/s11042-020-09800-x

Download citation

Received: 08 October 2019
Revised: 28 June 2020
Accepted: 02 September 2020
Published: 09 September 2020
Issue Date: January 2021
DOI: https://doi.org/10.1007/s11042-020-09800-x

Developing a feature decoder network with low-to-high hierarchies to improve edge detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Feature enhancement: predict more detailed and crisper edges

An exclusive U-net for fine and crisp edge detection

Multi-scale Feature Decode and Fuse Model with CRF Layer for Boundary Detection

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Developing a feature decoder network with low-to-high hierarchies to improve edge detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Feature enhancement: predict more detailed and crisper edges

An exclusive U-net for fine and crisp edge detection

Multi-scale Feature Decode and Fuse Model with CRF Layer for Boundary Detection

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation