A Mean-Field Game Control for Large-Scale Swarm Formation Flight in Dense Environments
<p>Visualization of the structure and training process of our GAN-based neural network. Its training process is divided into two coupled alternating training parts—generator and discriminator.</p> "> Figure 2
<p>A large-scale desired formation consisting of twenty-five quadrotors traverses a 3-D environment from the bottom left side to the top right side.</p> "> Figure 3
<p>The visualization of the executed trajectories for the formation flight of large-scale UAVs. The <math display="inline"><semantics> <mrow> <mi>x</mi> <mi>o</mi> <mi>y</mi> </mrow> </semantics></math>-plane projections represent the outline of the shape.</p> "> Figure 4
<p>Illustration of MFG convergence and formation stability.</p> "> Figure 5
<p>Comparison of the executed trajectories for the formation flight of large-scale UAVs about volatility parameter <math display="inline"><semantics> <mrow> <mi>σ</mi> <mo>=</mo> <mn>0</mn> </mrow> </semantics></math>, <math display="inline"><semantics> <mrow> <msqrt> <mn>2</mn> </msqrt> <mo>×</mo> <msup> <mn>10</mn> <mrow> <mo>−</mo> <mn>1</mn> </mrow> </msup> </mrow> </semantics></math> and <math display="inline"><semantics> <mrow> <msqrt> <mn>10</mn> </msqrt> <mo>×</mo> <msup> <mn>10</mn> <mrow> <mo>−</mo> <mn>1</mn> </mrow> </msup> </mrow> </semantics></math>.</p> "> Figure 6
<p>The visualization of the executed trajectories for large-scale UAVs formation flying through an obstacle-rich area. (<b>a</b>) Full view of the trajectories. (<b>b</b>) <math display="inline"><semantics> <mrow> <mi>x</mi> <mi>o</mi> <mi>y</mi> </mrow> </semantics></math>-plane projection trajectories.</p> "> Figure 7
<p>Illustration of the performance of interaction terms for MFG control (<math display="inline"><semantics> <msup> <mi mathvariant="script">F</mi> <mn>1</mn> </msup> </semantics></math>, <math display="inline"><semantics> <msup> <mi mathvariant="script">F</mi> <mn>2</mn> </msup> </semantics></math>, <math display="inline"><semantics> <msup> <mi mathvariant="script">F</mi> <mn>3</mn> </msup> </semantics></math>). (<b>a</b>) Formation similarity error. (<b>b</b>) Minimum distance between UAVs and obstacles. (<b>c</b>) Minimum distance between UAVs.</p> ">
Abstract
:1. Introduction
- A differentiable graph-theory-based mean-field term that quantifies the similarity distance between large-scale three-dimensional formations; a differentiable ellipsoid-based mean-field term that inscribes the potential energy value of dense three-dimensional obstacles.
- A general control framework for complex scenarios of large-scale multiagent systems—mean-field game control, which jointly takes the amount of communication and computation, operating energy consumption, formation similarity, and obstacle avoidance into account.
- A series of simulations with a distributed large-scale aerial swarm system validates the efficiency and robustness of our method. The comparison with baseline methods shows the advanced nature of our method.
2. Two Types of Mean-Field Terms
2.1. Formation Mean-Field Term
2.2. Obstacle Mean-Field Term
3. A Mean-Field Game Control Framework for Complex Scenarios
3.1. Single-UAV Optimal Control Problem
3.2. Mean-Field Game Control Formulation
3.3. GAN-Based Algorithm for Complex-Scenario MFGs
Algorithm 1 GAN-based algorithm for complex-scenario MFGs |
Require: diffusion parameter, H Hamiltonian, g terminal cost, f interaction term. Require: Initialize neural networks and , batch size B. Require: Set and as in (29). while not converged do train : Sample batch where and Obtaining generated data . Update discriminator parameters to minimize train Sample batch where and Update generator parameters to minimize end while |
4. Simulation Results
4.1. Simulation Parameters
4.2. Formation Performance of Large-Scale UAVs
4.3. Effect of Volatility Term on Formation
4.4. Performance of Dense Obstacle Avoidance for Large-Scale UAVs Formation Flying
4.5. Comparison with Baselines
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
CPSs | Cyberphysical Systems |
MF | Mean Field |
MFG | Mean-Field Game |
MFGC | Mean-Field Game Control |
2-D | Two-Dimensional |
3-D | Three-Dimensional |
UAV | Unmanned Aerial Vehicle |
APF | Artificial Potential Field |
GANs | Generative Adversarial Neural Networks |
CA-Net | Coupled Alternating Neural Network |
HJB | Hamilton–Jacobi–Bellman (partial differential equation) |
References
- Lin, X.; Su, G.; Chen, B.; Wang, H.; Dai, M. Striking a Balance Between System Throughput and Energy Efficiency for UAV-IoT Systems. IEEE Internet Things J. 2019, 6, 10519–10533. [Google Scholar] [CrossRef]
- Zhang, J.; Xu, W.; Gao, H.; Pan, M.; Han, Z.; Zhang, P. Codebook-Based Beam Tracking for Conformal Array-Enabled UAV mmWave Networks. IEEE Internet Things J. 2021, 8, 244–261. [Google Scholar] [CrossRef]
- Zhang, J.; Yan, J.; Zhang, P.; Kong, X. Collision Avoidance in Fixed-Wing UAV Formation Flight Based on a Consensus Control Algorithm. IEEE Access 2018, 6, 43672–43682. [Google Scholar] [CrossRef]
- Zhang, J.; Yan, J.; Zhang, P. Fixed-Wing UAV Formation Control Design With Collision Avoidance Based on an Improved Artificial Potential Field. IEEE Access 2018, 6, 78342–78351. [Google Scholar] [CrossRef]
- Shao, S.K.; Peng, Y.; He, C.L.; Du, Y. Efficient Path Planning for UAV Formation via Comprehensively Improved Particle Swarm Optimization. ISA Trans. 2020, 97, 415–430. [Google Scholar] [CrossRef] [PubMed]
- Wu, E.; Sun, Y.; Huang, J.; Zhang, C.; Li, Z. Multi UAV Cluster Control Method Based on Virtual Core in Improved Artificial Potential Field. IEEE Access 2020, 8, 131647–131661. [Google Scholar] [CrossRef]
- Quan, L.; Yin, L.J.; Xu, C.; Gao, F. Distributed Swarm Trajectory Optimization for Formation Flight in Dense Environments. In Proceedings of the IEEE International Conference on Robotics and Automation, Philadelphia, PA, USA, 23–27 May 2022. [Google Scholar]
- Obstacle Avoidance of Resilient UAV Swarm Formation with Active Sensing System in the Dense Environment. Available online: https://arxiv.org/abs/2202.13381 (accessed on 27 February 2022).
- Kim, H.; Park, J.; Bennis, M.; Kim, S.L. Massive UAV-to-Ground Communication and its Stable Movement Control: A Mean-Field Approach. In Proceedings of the IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications, Kalamata, Greece, 25–28 June 2018. [Google Scholar]
- Shiri, H.; Park, J.; Bennis, M. Communication-Efficient Massive UAV Online Path Control: Federated Learning Meets Mean-Field Game Theory. IEEE Trans. Commun. 2020, 68, 6840–6857. [Google Scholar] [CrossRef]
- Chen, D.; Qi, Q.; Zhuang, Z.; Wang, J.; Liao, J.; Han, Z. Mean Field Deep Reinforcement Learning for Fair and Efficient UAV Control. IEEE Internet Things J. 2021, 8, 813–828. [Google Scholar] [CrossRef]
- Xu, W.; Xiang, L.; Zhang, T.; Pan, M.; Han, Z. Cooperative Control of Physical Collision and Transmission Power for UAV Swarm: A Dual-Fields Enabled Approach. IEEE Internet Things J. 2022, 9, 2390–2403. [Google Scholar] [CrossRef]
- Gao, H.; Lee, W.J.; Kang, Y.H.; Li, W.C.; Han, Z.; Osher, S.; Poor, V. Energy-Efficient Velocity Control for Massive Numbers of UAVs: A Mean Field Game Approach. IEEE Trans. Veh. Technol. 2022, 71, 6266–6278. [Google Scholar] [CrossRef]
- Ruthotto, L.; Osher, S.J.; Li, W.; Nurbekyan, L.; Fung, S.W. A Machine Learning Framework for Solving High-Dimensional Mean Field Game and Mean Field Control Problems. Proc. Natl. Acad. Sci. USA 2020, 117, 9183–9193. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Lin, A.T.; Fung, S.W.; Li, W.C.; Nurbekyan, L.; Osher, S.J. Alternating the Population and Control Neural Networks to Solve High-Dimensional Stochastic Mean-Field Games. Proc. Natl. Acad. Sci. USA 2021, 118, e2024713118. [Google Scholar] [CrossRef] [PubMed]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; pp. 2672–2680. [Google Scholar]
- Wang, G.F.; Yao, W.; Zhang, X.; Niu, Z.J. Coupled Alternating Neural Networks for Solving Multi-Population High-Dimensional Mean-Field Games with Stochasticity. IEEE Trans. Cybern. 2021, submitted.
- Tantardini, M.; Ieva, F.; Tajoli, L.; Piccardi, C. Comparing Methods for Comparing Networks. Sci. Rep. 2019, 9, 17557. [Google Scholar] [CrossRef] [PubMed]
- Chang, K.; Xia, Y.; Huang, K. UAV Formation Control Design with Obstacle Avoidance in Dynamic Three-dimensional Environment. SpringerPlus 2016, 5, 1124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Onken, D.; Nurbekyan, L.; Li, X.J.; Fung, S.W.; Osher, S.; Ruthotto, L. A Neural Network Approach for High-Dimensional Optimal Control. arXiv 2022, arXiv:2104.03270v1. [Google Scholar]
- García Carrillo, L.R.; Dzul López, A.E.; Lozano, R.; Pégard, C. Quad Rotorcraft Control; Springer: London, UK, 2013; pp. 23–34. [Google Scholar]
- Lasry, J.M.; Lions, P.L. Mean field games. Jpn. J. Math. 2007, 2, 229–260. [Google Scholar] [CrossRef] [Green Version]
- Schulte, J.M. Adjoint Methods for Hamilton-Jacobi-Bellman Equations. Ph.D. Thesis, University of Munster, Munster, Germany, November 2010. [Google Scholar]
- Arjovsky, M.; Chintala, S.; Bottou, L. Wasserstein Generative Adversarial Networks. In Proceedings of the Machine Learning Research, International Convention Centre, Sydney, Australia, 6–11 August 2017; Volume 70, pp. 214–223. [Google Scholar]
- Gao, H.; Lin, A.; Banez, R.A.; Li, W.C.; Han, Z.; Osher, S.; Poor, H.V. Belief and Opinion Evolution in Social Networks: A High-Dimensional Mean Field Game Approach, In Proceedings of IEEE International Conference on Communications, Montreal, QC, Canada, 14–23 June 2021.
Volatility Parameter | Formation Cost | Collision Avoidance Cost |
---|---|---|
0 | ||
Method | Scene | Scale of UAVs | Scene Complexity | Communication |
---|---|---|---|---|
[4] | Formation flight and obstacle avoidance | Small | 0.67 1 | 2 |
[9] | Cluster flight | Large | 0.5 | |
[13] | Cluster flight and obstacle avoidance | Large | 0.67 | |
[7] | Formation flight and dense obstacle avoidance | Small | 0.83 | |
Ours | Formation flight and dense obstacle avoidance | Large | 1 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, G.; Yao, W.; Zhang, X.; Li, Z. A Mean-Field Game Control for Large-Scale Swarm Formation Flight in Dense Environments. Sensors 2022, 22, 5437. https://doi.org/10.3390/s22145437
Wang G, Yao W, Zhang X, Li Z. A Mean-Field Game Control for Large-Scale Swarm Formation Flight in Dense Environments. Sensors. 2022; 22(14):5437. https://doi.org/10.3390/s22145437
Chicago/Turabian StyleWang, Guofang, Wang Yao, Xiao Zhang, and Ziming Li. 2022. "A Mean-Field Game Control for Large-Scale Swarm Formation Flight in Dense Environments" Sensors 22, no. 14: 5437. https://doi.org/10.3390/s22145437