Wang et al., 2022 - Google Patents
Efficient reinforcement learning for autonomous ship collision avoidance under learning experience reuseWang et al., 2022
- Document ID
- 507680520706712009
- Author
- Wang C
- Zhang X
- Gao H
- Su H
- Zheng K
- Wang W
- Publication year
- Publication venue
- 2022 IEEE International Conference on Unmanned Systems (ICUS)
External Links
Snippet
In this paper, a learning experience reuse-reinforcement learning collision avoidance (LER- RLCA) method is proposed, which can synthesize near-optimal collision avoidance policy with efficient sampling and good seamanship, to solve the local safety sailing of autonomous …
- 230000002787 reinforcement 0 title abstract description 58
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11970161B2 (en) | Apparatus, method and article to facilitate motion planning of an autonomous vehicle in an environment having dynamic objects | |
| Yang et al. | Intelligent path planning of underwater robot based on reinforcement learning | |
| CN111780777B (en) | Unmanned vehicle route planning method based on improved A-star algorithm and deep reinforcement learning | |
| Cao et al. | Target search control of AUV in underwater environment with deep reinforcement learning | |
| Lan et al. | Path planning for underwater gliders in time-varying ocean current using deep reinforcement learning | |
| CN116804879B (en) | A robot path planning framework method combining improved dung beetle algorithm with DWA algorithm | |
| Fang et al. | Autonomous underwater vehicle formation control and obstacle avoidance using multi-agent generative adversarial imitation learning | |
| CN109726866A (en) | Path planning method for unmanned ship based on Q-learning neural network | |
| CN113821035A (en) | Unmanned ship trajectory tracking control method and device | |
| Du et al. | An optimized path planning method for coastal ships based on improved DDPG and DP | |
| CN114077256A (en) | A path planning method for unmanned watercraft | |
| Wen et al. | A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation | |
| Pan et al. | Semantic predictive control for explainable and efficient policy learning | |
| Qu et al. | USV path planning under marine environment simulation using DWA and safe reinforcement learning | |
| Huang et al. | The USV path planning of dueling DQN algorithm based on tree sampling mechanism | |
| Wang et al. | Efficient reinforcement learning for autonomous ship collision avoidance under learning experience reuse | |
| Li et al. | Efficient vehicle trajectory prediction with goal lane segments and dual-stream cross attention | |
| CN116448134B (en) | Vehicle path planning method and device based on risk field and uncertain analysis | |
| Sun et al. | Cooperative pursuit-evasion game for multi-AUVs in the ocean current and obstacle environment | |
| Wang et al. | Domain generalization for vision-based driving trajectory generation | |
| CN115686076B (en) | UAV path planning method based on incremental developmental deep reinforcement learning | |
| Qu et al. | Cooperative Pursuit of Unmanned Surface Vehicles Using Multi-Agent Reinforcement Learning | |
| Wu et al. | Hr-planner: A hierarchical highway tactical planner based on residual reinforcement learning | |
| Akbari et al. | Optimized Path Planning for USVs under Ocean Currents | |
| Jin et al. | Optimization method of DRL-based route planning under limited resources |