[go: up one dir, main page]

skip to main content
research-article

iVR: Integrated Vision and Radio Localization with Zero Human Effort

Published: 09 September 2019 Publication History

Abstract

Smartphone localization is essential to a wide range of applications in shopping malls, museums, office buildings, and other public places. Existing solutions relying on radio fingerprints and/or inertial sensors suffer from large location errors and considerable deployment efforts. We observe an opportunity in the recent trend of increasing numbers of security surveillance cameras installed in indoor spaces to overcome these limitations and revisit the problem of smartphone localization with a fresh perspective. However, fusing vision-based and radio-based systems is non-trivial due to the absence of absolute location, incorrespondence of identification and looseness of sensor fusion. This study proposes iVR, an integrated vision and radio localization system that achieves sub-meter accuracy with indoor semantic maps automatically generated from only two surveillance cameras, superior to precedent systems that require manual map construction or plentiful captured images. iVR employs a particle filter to fuse raw estimates from multiple systems, including vision, radio, and inertial sensor systems. By doing so, iVR outputs enhanced accuracy with zero start-up costs, while overcoming the respective drawbacks of each individual sub-system. We implement iVR on commodity smartphones and validate its performance in five different scenarios. The results show that iVR achieves a remarkable localization accuracy of 0.7m, outperforming the state-of-the-art systems by >70%.

References

[1]
H. Abdelnasser, R. Mohamed, A. Elgohary, M. F. Alzantot, H. Wang, S. Sen, R. R. Choudhury, and M. Youssef. 2016. SemanticSLAM: Using Environment Landmarks for Unsupervised Indoor Localization. IEEE Transactions on Mobile Computing 15, 7 (July 2016), 1770--1782.
[2]
Fadel Adib, Zach Kabelac, Dina Katabi, and Robert C Miller. 2014. 3D tracking via body radio reflections. In Proceedings of the USENIX NSDI.
[3]
Anton Andriyenko and Konrad Schindler. 2011. Multi-target tracking by continuous energy minimization. In Proceedings of the IEEE CVPR.
[4]
M. Azizyan, I. Constandache, and R. Roy Choudhury. 2009. Surroundsense: mobile phone localization via ambience fingerprinting. In Proceedings of the ACM MobiCom.
[5]
Luca Bertinetto, Jack Valmadre, Joao F Henriques, Andrea Vedaldi, and Philip HS Torr. 2016. Fully-convolutional siamese networks for object tracking. In ECCV. Springer.
[6]
Eric Betzig, George H Patterson, Rachid Sougrat, O Wolf Lindwasser, Scott Olenych, Juan S Bonifacino, Michael W Davidson, Jennifer Lippincott-Schwartz, and Harald F Hess. 2006. Imaging intracellular fluorescent proteins at nanometer resolution. Science 313, 5793 (2006), 1642--1645.
[7]
Siyuan Cao and He Wang. 2018. Enabling Public Cameras to Talk to the Public. In PACM on Interactive, Mobile, Wearable and Ubiquitous Technologies.
[8]
Steven D Cochran and Gérard Medioni. 1992. 3-D surface description from binocular stereo. IEEE Transactions on Pattern Analysis & Machine Intelligence 10 (1992), 981--994.
[9]
Piotr Dollár, Ron Appel, and Wolf Kienzle. 2012. Crosstalk cascades for frame-rate pedestrian detection. In Computer Vision-ECCV 2012. Springer, 645--659.
[10]
Erqun Dong, Jingao Xu, Chenshu Wu, Yunhao Liu, and Zheng Yang. 2019. Pair-Navi: Peer-to-Peer Indoor Navigation with Mobile Visual SLAM. In Proceedings of the IEEE INFOCOM.
[11]
Brian Ferris, Dieter Fox, and Neil Lawrence. 2007. WiFi-SLAM using Gaussian process latent variable models. In Proceedings of the IJCAI.
[12]
Ruipeng Gao, Mingmin Zhao, Tao Ye, Fan Ye, Yizhou Wang, Kaigui Bian, Tao Wang, and Xiaoming Li. 2014. Jigsaw: Indoor floor plan reconstruction via mobile crowdsensing. In Proceedings of ACM MobiCom.
[13]
Julius Goldhirsh and Wolfhard J Vogel. 1998. Handbook of propagation effects for vehicular and personal mobile satellite systems. NASA Reference Publication 1274 (1998), 40--67.
[14]
Helmut Grabner, Jiri Matas, Luc Van Gool, and Philippe Cattin. 2010. Tracking the invisible: Learning where the object might be. In Proceedings of the IEEE CVPR.
[15]
Mike Hazas and Andy Ward. 2002. A novel broadband ultrasonic location system. In Proceedings of the ACM Ubicomp.
[16]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE ICCV.
[17]
S. He and S. H. G. Chan. 2016. Wi-Fi Fingerprint-Based Indoor Positioning: Recent Advances and Comparisons. IEEE Communications Surveys Tutorials 18, 1 (Firstquarter 2016), 466--490.
[18]
João F Henriques, Rui Caseiro, Pedro Martins, and Jorge Batista. 2015. High-speed tracking with kernelized correlation filters. IEEE transactions on pattern analysis and machine intelligence 37, 3 (2015), 583--596.
[19]
Sebastian Hilsenbeck, Dmytro Bobkov, Georg Schroth, Robert Huitl, and Eckehard Steinbach. 2014. Graph-based Data Fusion of Pedometer and WiFi Measurements for Mobile Indoor Positioning. In Proceedings of the ACM UbiComp.
[20]
Robert V Hogg and Allen T Craig. 1995. Introduction to mathematical statistics.(5"" edition). Englewood Hills, New Jersey (1995).
[21]
J Huang, D Millman, M Quigley, and D Stavens. 2011. Efficient, generalized indoor WiFi GraphSLAM. In Proceedings of the IEEE International Conference on Robotics and Automation.
[22]
Ramesh Jain, Rangachar Kasturi, and Brian G Schunck. 1995. Machine vision. Vol. 5. McGraw-Hill New York.
[23]
Niall Jenkins. 2015. 245 million video surveillance cameras installed globally in 2014. IHS Technology (2015).
[24]
Wenchao Jiang and Zhaozheng Yin. 2017. Combining passive visual cameras and active IMU sensors for persistent pedestrian tracking. Journal of Visual Communication and Image Representation 48, 4 (2017), 419--431.
[25]
Junghyun Jun, Yu Gu, Long Cheng, Banghui Lu, Jun Sun, Ting Zhu, and Jianwei Niu. 2013. Social-Loc: Improving Indoor Localization with Social Sensing. In Proceedings of the ACM SenSys.
[26]
Michael Kass, Andrew Witkin, and Demetri Terzopoulos. 1988. Snakes: Active contour models. International journal of computer vision 1, 4 (1988), 321--331.
[27]
Jan J Koenderink and Andrea J Van Doorn. 1991. Affine structure from motion. JOSA A 8, 2 (1991), 377--385.
[28]
Manikanta Kotaru, Kiran Joshi, Dinesh Bharadia, and Sachin Katti. 2015. SpotFi:Decimeter Level Localization Using WiFi. In Proceedings of the ACM SIGCOMM.
[29]
Harold W Kuhn. 1955. The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1-2 (1955), 83--97.
[30]
Vincent Lepetit, Francesc Moreno-Noguer, and Pascal Fua. 2009. Epnp: An accurate o (n) solution to the pnp problem. International journal of computer vision 81, 2 (2009), 155.
[31]
Liqun Li, Pan Hu, Chunyi Peng, Guobin Shen, and Feng Zhao. 2014. Epsilon: A visible light based positioning system. In Proceedings of the USENIX NSDI.
[32]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In European conference on computer vision. Springer.
[33]
Baiyang Liu, Junzhou Huang, Lin Yang, and Casimir Kulikowsk. 2011. Robust tracking using local sparse appearance model and k-selection. In Proceedings of the IEEE CVPR.
[34]
Hongbo Liu, Yu Gan, Jie Yang, Simon Sidhom, Yan Wang, Yingying Chen, and Fan Ye. 2012. Push the limit of WiFi based localization for smartphones. In Proceedings of the ACM MobiCom.
[35]
Xiaochen Liu, Yurong Jiang, Puneet Jain, and Kyu-Han Kim. 2018. TAR: Enabling Fine-Grained Targeted Advertising in Retail Stores. In Proceedings of the ACM Mobisys.
[36]
David G Lowe. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60, 2 (2004), 91--110.
[37]
Justin Gregory Manweiler, Puneet Jain, and Romit Roy Choudhury. 2012. Satellites in our pockets: an object positioning system using smartphones. In Proceedings of the ACM Mobisys.
[38]
Jacques McCoun and Lucien Reeves. 2010. Binocular vision: development, depth perception and disorders. Nova Science Publishers, Inc.
[39]
Savvas Papaioannou, Hongkai Wen, Andrew Markham, and Niki Trigoni. 2014. Fusion of radio and camera sensor data for accurate indoor positioning. In Proceedings of the IEEE MASS.
[40]
Chunyi Peng, Guobin Shen, Yongguang Zhang, Yanlin Li, and Kun Tan. 2007. Beepbeep: a high accuracy acoustic ranging system using cots mobile devices. In Proceedings of the ACM MobiSys.
[41]
Hamed Pirsiavash, Deva Ramanan, and Charless C Fowlkes. 2011. Globally-optimal greedy algorithms for tracking a variable number of objects. In Proceedings of the IEEE CVPR.
[42]
Jain Puneet, Manweiler Justin, and Roy Choudhury. Romit. 2015. Overlay: Practical mobile augmented reality. In Proceedings of the 13th Annual International Conference on Mobile Systems.
[43]
Anshul Rai, Krishna Kant Chintalapudi, Venkata N. Padmanabhan, and Rijurekha Sen. 2012. Zee: Zero-effort Crowdsourcing for Indoor Localization. In Proceedings of the ACM MobiCom.
[44]
Torsten Sattler, Bastian Leibe, and Leif Kobbelt. 2011. Fast image-based localization using direct 2d-to-3d matching. In Proceedings of IEEE ICCV.
[45]
Longfei Shangguan, Zheng Yang, Alex X Liu, Zimu Zhou, and Yunhao Liu. 2016. STPP: Spatial-temporal phase profiling-based method for relative RFID tag localization. IEEE/ACM Transactions on Networking 25, 1 (2016), 596--609.
[46]
Y. Shu, Y. Huang, J. Zhang, P. Coué, P. Cheng, J. Chen, and K. G. Shin. 2016. Gradient-Based Fingerprinting for Indoor Localization and Tracking. IEEE Transactions on Industrial Electronics 63, 4 (April 2016), 2424--2433.
[47]
Adrian Smith. 2013. Sequential Monte Carlo methods in practice. Springer Science & Business Media.
[48]
Noah Snavely, Steven M Seitz, and Richard Szeliski. 2006. Photo tourism: exploring photo collections in 3D. In ACM Transactions on Graphics (TOG).
[49]
Jin Teng, Boying Zhang, Junda Zhu, Xinfeng Li, Dong Xuan, and Yuan F Zheng. 2014. EV-Loc: integrating electronic and visual signals for accurate localization. IEEE/ACM Transactions on Networking (TON) 22, 4 (2014), 1285--1296.
[50]
He Wang, Souvik Sen, Ahmed Elgohary, Moustafa Farid, Moustafa Youssef, and Romit Roy Choudhury. 2012. No need to war-drive: unsupervised indoor localization. In Proceedings of the ACM MobiSys.
[51]
Ju Wang, Hongbo Jiang, Jie Xiong, Kyle Jamieson, Xiaojiang Chen, Dingyi Fang, and Binbin Xie. 2016. LiFS: Low Human Effort, Device-Free Localization with Fine-Grained Subcarrier Information. In Proceedings of ACM MobiCom.
[52]
Changchang Wu. 2013. Towards linear-time incremental structure from motion. In Proceedings of IEEE 3D Vision.
[53]
Chenshu Wu, Jingao Xu, Zheng Yang, Nicholas D. Lane, and Zuwei Yin. 2017. Gain Without Pain: Accurate WiFi-based Localization with Fingerprint Spatial Gradient. In PACM on Interactive, Mobile, Wearable and Ubiquitous Technologies.
[54]
Chenshu Wu, Zheng Yang, and Yunhao Liu. 2015. Smartphones based Crowdsourcing for Indoor Localization. IEEE Transactions on Mobile Computing 14, 2 (Feb 2015), 444--457.
[55]
Chenshu Wu, Zheng Yang, and Chaowei Xiao. 2018. Automatic Radio Map Adaptation for Indoor Localization using Smartphones. IEEE Transactions on Mobile Computing 17, 3 (March 2018), 517--528.
[56]
Chenshu Wu, Zheng Yang, Chaowei Xiao, Chaofan Yang, Yunhao Liu, and Mingyan Liu. 2015. Static Power of Mobile Devices: Self-updating Radio Maps for Wireless Indoor Localization. In Proceedings of the IEEE INFOCOM.
[57]
Jie Xiong and Kyle Jamieson. 2013. ArrayTrack: a fine-grained indoor location system. In Proceedings of the USENIX NSDI.
[58]
Han Xu, Zheng Yang, Zimu Zhou, Longfei Shangguan, Ke Yi, and Yunhao Liu. 2015. Enhancing Wifi-based Localization with Visual Clues. In Proceedings of the ACM UbiComp.
[59]
Han Xu, Zheng Yang, Zimu Zhou, Longfei Shangguan, Ke Yi, and Yunhao Liu. 2016. Indoor Localization via Multi-Modal Sensing on Smartphones. In Proceedings of the ACM UbiComp.
[60]
Jingao Xu, Zheng Yang, Hengjie Chen, Yunhao Liu, Xianchun Zhou, Jinbo Li, and Nicholas Lane. 2018. Embracing Spatial Awareness for Reliable WiFi-Based Indoor Location Systems. In Proceedings of the IEEE MASS.
[61]
Zheng Yang, Chenshu Wu, and Yunhao Liu. 2012. Locating in Fingerprint Space: Wireless Indoor Localization with Little Human Intervention. In Proceedings of the ACM MobiCom.
[62]
Zheng Yang, Chenshu Wu, Zimu Zhou, Xinglin Zhang, Xu Wang, and Yunhao Liu. 2015. Mobility Increases Localizability: A Survey on Wireless Indoor Localization Using Inertial Sensors. Comput. Surveys 47, 3, Article 54 (April 2015), 34 pages.
[63]
Moustafa Youssef and Ashok Agrawala. 2008. The Horus Location Determination System. Wireless Networks 14, 3 (June 2008), 357--374.
[64]
Xinglin Zhang, Zheng Yang, Yunhao Liu, and Shaohua Tang. 2016. On reliable task assignment for spatial crowdsourcing. IEEE Transactions on Emerging Topics in Computing 7, 1 (2016), 174--186.
[65]
Pengfei Zhou, Mo Li, and Guobin Shen. 2014. Use it free: Instantly knowing your phone attitude. In Proceedings of ACM Mobicom.

Cited By

View all
  • (2024)Mission: mmWave Radar Person Identification with RGB CamerasProceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems10.1145/3666025.3699340(309-321)Online publication date: 4-Nov-2024
  • (2024)WiVi-GR: Wireless-Visual Joint Representation-Based Accurate Gesture RecognitionIEEE Internet of Things Journal10.1109/JIOT.2023.329237611:2(2701-2711)Online publication date: 15-Jan-2024
  • (2024)Non-contact multimodal indoor human monitoring systems: A surveyInformation Fusion10.1016/j.inffus.2024.102457110(102457)Online publication date: Oct-2024
  • Show More Cited By

Index Terms

  1. iVR: Integrated Vision and Radio Localization with Zero Human Effort

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies  Volume 3, Issue 3
    September 2019
    1415 pages
    EISSN:2474-9567
    DOI:10.1145/3361560
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 September 2019
    Published in IMWUT Volume 3, Issue 3

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Computer Vision
    2. Indoor Localization
    3. Pedestrian Tracking
    4. Wireless

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)55
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 14 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Mission: mmWave Radar Person Identification with RGB CamerasProceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems10.1145/3666025.3699340(309-321)Online publication date: 4-Nov-2024
    • (2024)WiVi-GR: Wireless-Visual Joint Representation-Based Accurate Gesture RecognitionIEEE Internet of Things Journal10.1109/JIOT.2023.329237611:2(2701-2711)Online publication date: 15-Jan-2024
    • (2024)Non-contact multimodal indoor human monitoring systems: A surveyInformation Fusion10.1016/j.inffus.2024.102457110(102457)Online publication date: Oct-2024
    • (2024)Environment Understanding with EdgeSLAMEdge Assisted Mobile Visual SLAM10.1007/978-981-97-3573-0_7(133-157)Online publication date: 6-May-2024
    • (2024)Robust Indoor LocalizationLocation, Localization, and Localizability10.1007/978-981-97-3176-3_8(131-162)Online publication date: 12-Jul-2024
    • (2023)From Coarse to Fine: Two-Stage Indoor Localization with Multisensor FusionTsinghua Science and Technology10.26599/TST.2022.901002928:3(552-565)Online publication date: Jun-2023
    • (2023)A Picture is Worth 1,000 Millimeters: Combining Vision and Wi-Fi to Improve Localization2023 IEEE 24th International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)10.1109/WoWMoM57956.2023.00020(56-66)Online publication date: Jun-2023
    • (2023)EdgeDuet: Tiling Small Object Detection for Edge Assisted Autonomous Mobile VisionIEEE/ACM Transactions on Networking10.1109/TNET.2022.322341231:4(1765-1778)Online publication date: Aug-2023
    • (2023)Locate, Tell, and Guide: Enabling Public Cameras to Navigate the PublicIEEE Transactions on Mobile Computing10.1109/TMC.2021.309272522:2(1010-1024)Online publication date: 1-Feb-2023
    • (2022)Cross Vision-RF Gait Re-identification with Low-cost RGB-D Cameras and mmWave RadarsProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35503256:3(1-25)Online publication date: 7-Sep-2022
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media