Computer Science > Computer Vision and Pattern Recognition

arXiv:1804.00433 (cs)

[Submitted on 2 Apr 2018 (v1), last revised 16 May 2018 (this version, v2)]

Title:SINet: A Scale-insensitive Convolutional Neural Network for Fast Vehicle Detection

Authors:Xiaowei Hu, Xuemiao Xu, Yongjie Xiao, Hao Chen, Shengfeng He, Jing Qin, Pheng-Ann Heng

View PDF

Abstract:Vision-based vehicle detection approaches achieve incredible success in recent years with the development of deep convolutional neural network (CNN). However, existing CNN based algorithms suffer from the problem that the convolutional features are scale-sensitive in object detection task but it is common that traffic images and videos contain vehicles with a large variance of scales. In this paper, we delve into the source of scale sensitivity, and reveal two key issues: 1) existing RoI pooling destroys the structure of small scale objects, 2) the large intra-class distance for a large variance of scales exceeds the representation capability of a single network. Based on these findings, we present a scale-insensitive convolutional neural network (SINet) for fast detecting vehicles with a large variance of scales. First, we present a context-aware RoI pooling to maintain the contextual information and original structure of small scale objects. Second, we present a multi-branch decision network to minimize the intra-class distance of features. These lightweight techniques bring zero extra time complexity but prominent detection accuracy improvement. The proposed techniques can be equipped with any deep network architectures and keep them trained end-to-end. Our SINet achieves state-of-the-art performance in terms of accuracy and speed (up to 37 FPS) on the KITTI benchmark and a new highway dataset, which contains a large variance of scales and extremely small objects.

Comments:	Accepted by IEEE Transactions on Intelligent Transportation Systems (T-ITS)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1804.00433 [cs.CV]
	(or arXiv:1804.00433v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1804.00433
Journal reference:	IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 3, pp. 1010-1019, 2019
Related DOI:	https://doi.org/10.1109/TITS.2018.2838132

Submission history

From: Xiaowei Hu [view email]
[v1] Mon, 2 Apr 2018 09:27:09 UTC (5,390 KB)
[v2] Wed, 16 May 2018 09:05:29 UTC (5,389 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SINet: A Scale-insensitive Convolutional Neural Network for Fast Vehicle Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SINet: A Scale-insensitive Convolutional Neural Network for Fast Vehicle Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators