Robust and Fair Undersea Target Detection with Automated Underwater Vehicles for Biodiversity Data Collection
<p>Undersea/subsea exploration: (<b>a</b>) Aberdeen oil field in the North Sea; (<b>b</b>) fish in Scotland sea farms.</p> "> Figure 2
<p>Image conversion toward the challenges in underwater conditions: (<b>a</b>) the original image; (<b>b</b>) the converted image by DCGAN; (<b>c</b>) an object in the original image; (<b>d</b>) an object in the converted image.</p> "> Figure 3
<p>The proposed end-to-end DCGAN+SSD architecture.</p> "> Figure 4
<p>The difference in the original images and the images enhanced by DCGAN: (<b>a</b>) the original image; (<b>b</b>) the image converted by DCGAN.</p> "> Figure 4 Cont.
<p>The difference in the original images and the images enhanced by DCGAN: (<b>a</b>) the original image; (<b>b</b>) the image converted by DCGAN.</p> "> Figure 5
<p>The difference in detection success rates between SSD only and DCGAN+SSD. SSD only missed several objects, while DCGAN+SSD could achieve better detection in all cases: (<b>a</b>) object detection by SSD only; (<b>b</b>) object detection by DCGAN+SSD.</p> "> Figure 5 Cont.
<p>The difference in detection success rates between SSD only and DCGAN+SSD. SSD only missed several objects, while DCGAN+SSD could achieve better detection in all cases: (<b>a</b>) object detection by SSD only; (<b>b</b>) object detection by DCGAN+SSD.</p> "> Figure 6
<p>The difference in detection between DCGAN+SSD and PSO+DCGAN+SSD: (<b>a</b>) object detection by DCGAN+SSD; (<b>b</b>) object detection by PSO+DCGAN+SSD.</p> "> Figure 6 Cont.
<p>The difference in detection between DCGAN+SSD and PSO+DCGAN+SSD: (<b>a</b>) object detection by DCGAN+SSD; (<b>b</b>) object detection by PSO+DCGAN+SSD.</p> "> Figure 7
<p>The visual comparison of accuracy rates between the initial DCGAN+SSD model and the optimized PSO+DCGAN+SSD model.</p> "> Figure 8
<p>The comparison between the initial DCGAN+SSD model and the optimized PSO+DCGAN+SSD model according to the degrees of ratio bias, absolute bias, and standard bias.</p> ">
Abstract
:1. Introduction
2. Background and Related Work
2.1. Challenges in Underwater Object Detection: Robustness and Fairness
2.2. Image Conversion via DCGAN
2.3. Balancing Dataset
3. Our DCGAN+SSD Framework
4. Our PSO-Based Model Optimization
4.1. Particle Swarm Optimization
4.2. Hyperparameter Tuning
Algorithm 1. The Pseudo Code of PSO-Based Optimization. |
Initialize the swarm and the search parameters. While (K < the maximum number of iterations) For (i = 1 to number of particles N) { Evaluate particle i; If the fitness of is greater than the fitness of Then update = If the fitness of is greater than that of the global best which is Then update = For (each dimension, i.e., d1, d2, and d3 = dimensions for the learning rate, momentum, and weight decay, respectively), Then update velocity vector vit+1 using the defined PSO velocity updating equation; Update particle position using the defined PSO position updating equation; End (for dimensions) } End For Loop (for particles) End While Loop |
5. Experimental Results
5.1. Comparison of DCGAN+SSD with SSD Only
5.2. Our PSO+DCGAN+SSD Model
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Code Availability
References
- Moniruzzaman, M.; Islam, S.M.S.; Bennamoun, M.; Lavery, P. Deep Learning on Underwater Marine Object Detection: A Survey. In Advanced Concepts for Intelligent Vision Systems. ACIVS 2017; Blanc-Talon, J., Penne, R., Philips, W., Popescu, D., Scheunders, P., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2017; Volume 10617. [Google Scholar]
- Costello, C.; Cao, L.; Gelcich, S.; Cisneros-Mata, M.Á.; Free, C.M.; Froehlich, H.E.; Golden, C.D.; Ishimura, G.; Maier, J.; Macadam-Somer, I.; et al. The future of food from the sea. Nature 2020, 588, 95–100. [Google Scholar] [CrossRef] [PubMed]
- Arrow, K.J. Global Climate Change: A Challenge to Policy. Econ. Voice 2007, 4. [Google Scholar] [CrossRef]
- Wunderling, N.; Willeit, M.; Donges, J.F.; Winkelmann, R. Global warming due to loss of large ice masses and Arctic summer sea ice. Nat. Commun. 2020, 11, 5177. [Google Scholar] [CrossRef]
- Kazour, M.; Terki, S.; Rabhi, K.; Jemaa, S.; Khalaf, G.; Amara, R. Sources of microplastics pollution in the marine environment: Importance of wastewater treatment plant and coastal landfill. Mar. Pollut. Bull. 2019, 146, 608–618. [Google Scholar] [CrossRef] [PubMed]
- Borowitzka, M.A. Intertidal algal species diversity and the effect of pollution. Mar. Freshw. Res. 1972, 23, 73–74. [Google Scholar] [CrossRef]
- Strachan, N.J.C. Recognition of fish species by colour and shape. Image Vis. Comput. 1993, 11, 2–10. [Google Scholar] [CrossRef]
- Sands, T. Development of Deterministic Artificial Intelligence for Unmanned Underwater Vehicles (UUV). J. Mar. Sci. Eng. 2020, 8, 578. [Google Scholar] [CrossRef]
- Shirakura, N.; Kiyokawa, T.; Kumamoto, H.; Takamatsu, J.; Ogasawara, T. Collection of Marine Debris by Jointly Using UAV-UUV with GUI for Simple Operation. IEEE Access 2021, 9, 67432–67443. [Google Scholar] [CrossRef]
- Yan, Z.; Zhang, J.; Tang, J. Modified whale optimization algorithm for underwater image matching in a UUV vision system. Multimed. Tools Appl. 2021, 80, 187–213. [Google Scholar] [CrossRef]
- Dinakaran, R.; Sexton, G.; Şeker, H.; Bouridane, A.; Jiang, R. Image resolution impact analysis on pedestrian detection in smart cities surveillance. In Proceedings of the 1st International Conference on Internet of Things and Machine Learning, Liverpool, UK, 17–18 October 2017; pp. 1–8. [Google Scholar]
- Dinakaran, R.; Easom, P.; Zhang, L.; Bouridane, A.; Jiang, R.; Edirisinghe, E. Distant Pedestrian Detection in the Wild using Single Shot Detector with Deep Convolutional Generative Adversarial Networks. In Proceedings of the 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 14–19 July 2019. [Google Scholar]
- Radford, A.; Metz, L.; Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. In Proceedings of the International Conference on Learning Representations (ICLR), San Juan, Puerto Rico, 2–4 May 2016; pp. 1–16. [Google Scholar]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.-Y.; Berg, A.C. SSD: Single shot multibox detector. In Computer Vision—ECCV 2016; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2016; Volume 9905. [Google Scholar]
- Athira, M.V.; Khan, D.M. Recent Trends on Object Detection and image classification: A review. In Proceedings of the 2020 International Conference on Computational Performance Evaluation (ComPE), Shillong, India, 2–4 July 2020. [Google Scholar]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
- Chiang, C.; Barnes, C.; Angelov, P.; Jiang, R. Deep Learning-based Automated Forest Health Diagnosis from Aerial Images. IEEE Access 2020, 8, 144064–144076. [Google Scholar] [CrossRef]
- Jiang, R.; Chazot, P.; Pavese, N.; Crookes, D.; Bouridane, A.; Celebi, M.E. Private Facial Prediagnosis as an Edge Service for Parkinson’s DBS Treatment Valuation. IEEE J. Biomed. Health Inform. 2022, 26, 2703–2713. [Google Scholar] [CrossRef]
- Storey, G.; Jiang, R.; Keogh, S.; Bouridane, A.; Li, C.-T. 3DPalsyNet: A Facial Palsy Grading and Motion Recognition Framework Using Fully 3D Convolutional Neural Networks. IEEE Access 2019, 7, 121655–121664. [Google Scholar] [CrossRef]
- Storey, G.; Jiang, R.; Bouridane, A. Role for 2D image generated 3D face models in the rehabilitation of facial palsy. IET Healthc. Technol. Lett. 2017, 4, 145–148. [Google Scholar] [CrossRef]
- Jiang, R.; Crookes, D. Shallow Unorganized Neural Networks Using Smart Neuron Model for Visual Perception. IEEE Access 2019, 7, 152701–152714. [Google Scholar] [CrossRef]
- Jiang, Z.; Chazot, P.L.; Celebi, M.E.; Crookes, D.; Jiang, R. Social Behavioral Phenotyping of Drosophila With a 2D–3D Hybrid CNN Framework. IEEE Access 2019, 7, 67972–67982. [Google Scholar] [CrossRef]
- Jiang, R.; Parry, M.L.; Legg, P.A.; Chung, D.H.S.; Griffiths, I.W. Automated 3-D Animation from Snooker Videos with Information-Theoretical Optimization. IEEE Trans. Comput. Intell. AI Games 2013, 5, 337–345. [Google Scholar] [CrossRef]
- Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going Deeper with Convolution. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015. [Google Scholar]
- Shuyang Du, Haoli Guo, Andrew Simpson, “Self-Driving Car Steering Angle Prediction Based on Image Recognition. arXiv 2019, arXiv:1912.05440.
- Ayachi, R.; Said, Y.; Ben Abdelaali, A. Pedestrian Detection Based on Light-Weighted Separable Convolution for Advanced Driver Assistance Systems. Neural Process. Lett. 2020, 52, 2655–2668. [Google Scholar] [CrossRef]
- Zhao, J.; Mathieu, M.; Goroshin, R.; Lecun, Y. Stacked what-where auto-encoders. arXiv 2016, arXiv:1506.02351. [Google Scholar]
- Wang, K.; Liu, M.Z. Object Recognition at Night Scene Based on DCGAN and Faster R-CNN. IEEE Access 2020, 8, 193168–193182. [Google Scholar] [CrossRef]
- Joseph, M.; Kearns, M.; Morgenstern, J.; Neel, S.; Roth, A. Rawlsian Fairness for Machine Learning. In Proceedings of the 3rd Workshop on Fairness, Accountability, and Transparency in Machine Learning, Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD), FATML 2016, New York, NY, USA, 18 November 2016. [Google Scholar]
- Bian, Y.; Wang, J.; Jun, J.J.; Xie, X.-Q. Deep Convolutional Generative Adversarial Network (dcGAN) Models for Screening and Design of Small Molecules Targeting Cannabinoid Receptors. Mol. Pharm. 2019, 16, 4451–4460. [Google Scholar] [CrossRef] [PubMed]
- Zhang, J.; Chen, L.; Zhuo, L.; Liang, X.; Li, J. An Efficient Hyperspectral Image Retrieval Method: Deep Spectral-Spatial Feature Extraction with DCGAN and Dimensionality Reduction Using t-SNE-Based NM Hashing. Remote Sens. 2018, 10, 271. [Google Scholar] [CrossRef] [Green Version]
- Dewi, C.; Chen, R.-C.; Liu, Y.-T.; Tai, S.-K. Synthetic Data generation using DCGAN for improved traffic sign recognition. Neural Comput. Appl. 2021, 1–16. [Google Scholar] [CrossRef]
- Chen, G.; Liu, L.; Hu, W.; Pan, Z. Semi-Supervised Object Detection in Remote Sensing Images Using Generative Adversarial Networks. In Proceedings of the IGARSS 2018—2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 22–27 July 2018; pp. 2503–2506. [Google Scholar]
- Wu, Q.; Chen, Y.; Meng, J. DCGAN-Based Data Augmentation for Tomato Leaf Disease Identification. IEEE Access 2020, 8, 98716–98728. [Google Scholar] [CrossRef]
- Kim, B.; Yuvaraj, N.; Preethaa, K.R.S.; Santhosh, R.; Sabari, A. Enhanced pedestrian detection using optimized deep convolution neural network for smart building surveillance. Soft Comput. 2020, 24, 17081–17092. [Google Scholar] [CrossRef]
- Li, Z.; Jin, Y.; Li, Y.; Lin, Z.; Wang, S. Imbalanced Adversarial Learning for Weather Image Generation and Classification. In Proceedings of the 2018 14th IEEE International Conference on Signal Processing (ICSP), Beijing, China, 2–16 August 2018; pp. 1093–1097. [Google Scholar]
- Eberhart, R.; Kennedy, J. A New Optimizer using particle swarm theory. In Proceedings of the Sixth International Symposium on Micro Machine and Human Science (MHS’95), Nagoya, Japan, 4–6 October 1995. [Google Scholar]
- Bergstra, J.; Bardenet, R.; Bengio, Y.; Kegl, B. Algorithms for Hyper-Parameter Optimization. In Proceedings of the Advances in Neural Information Processing Systems 24 (NIPS 2011), Granada, Spain, 12–17 December 2011. [Google Scholar]
- Fukushima, K. Neocognitron: A self-organizing Neural Network Model for Mechanism of Pattern Recognition Unaffected by shift in position. Biol. Cybern. 1980, 36, 193–202. [Google Scholar] [CrossRef] [PubMed]
- Teng, B.; Zhao, H. Underwater target recognition methods based on the framework of deep learning: A survey. Int. J. Adv. Robot. Syst. 2020, 17, 1–12. [Google Scholar] [CrossRef]
Objects | Object Instances | Correct Detection | False Detection | ||
---|---|---|---|---|---|
SSD | DCGAN-SSD | SSD | DCGAN-SSD | ||
Human | 201 | 5 | 164 | 3 | 12 |
Fish | 62 | 0 | 43 | 0 | 8 |
Plants | 34 | 0 | 14 | 0 | 2 |
Others | 13 | 0 | 3 | 0 | 4 |
Learning Rate | Momentum | Weight Decay | Fitness (in Loss) |
---|---|---|---|
0.000285 | 0.007951 | 0.44710 | 231.5625 |
0.006902 | 0.00908 | 0.82245 | 9.58052 |
0.000521 | 0.001392 | 0.2512 | 198.371 |
0.001138 | 0.000235 | 0.4467 | 132.3029 |
0.005217 | 0.001104 | 0.92989 | 16.11632 |
0.000445 | 0.003728 | 0.7033 | 156.3364 |
0.003516 | 0.003442 | 0.1698 | 77.77496 |
0.001108 | 0.009621 | 0.5448 | 122.4384 |
0.002458 | 0.007613 | 0.7595 | 31.23957 |
0.001261 | 0.002016 | 0.1152 | 163.3016 |
Categories | Object Instances | Detection | Accuracy | ||
---|---|---|---|---|---|
DCGAN-SSD | DCGAN-SSD+PSO | DCGAN-SSD | DCGAN-SSD+PSO | ||
Human | 201 | 164 | 186 | 81.0% | 92.5% |
Fish | 62 | 43 | 58 | 69.3% | 93.0% |
Plants | 34 | 14 | 24 | 41.0% | 70.0% |
Others | 13 | 3 | 8 | 23.0% | 61.5% |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Dinakaran, R.; Zhang, L.; Li, C.-T.; Bouridane, A.; Jiang, R. Robust and Fair Undersea Target Detection with Automated Underwater Vehicles for Biodiversity Data Collection. Remote Sens. 2022, 14, 3680. https://doi.org/10.3390/rs14153680
Dinakaran R, Zhang L, Li C-T, Bouridane A, Jiang R. Robust and Fair Undersea Target Detection with Automated Underwater Vehicles for Biodiversity Data Collection. Remote Sensing. 2022; 14(15):3680. https://doi.org/10.3390/rs14153680
Chicago/Turabian StyleDinakaran, Ranjith, Li Zhang, Chang-Tsun Li, Ahmed Bouridane, and Richard Jiang. 2022. "Robust and Fair Undersea Target Detection with Automated Underwater Vehicles for Biodiversity Data Collection" Remote Sensing 14, no. 15: 3680. https://doi.org/10.3390/rs14153680
APA StyleDinakaran, R., Zhang, L., Li, C.-T., Bouridane, A., & Jiang, R. (2022). Robust and Fair Undersea Target Detection with Automated Underwater Vehicles for Biodiversity Data Collection. Remote Sensing, 14(15), 3680. https://doi.org/10.3390/rs14153680