Ye et al., 2021 - Google Patents
Efficient robotic object search via hiem: Hierarchical policy learning with intrinsic-extrinsic modelingYe et al., 2021
View PDF- Document ID
- 868889123606827608
- Author
- Ye X
- Yang Y
- Publication year
- Publication venue
- IEEE robotics and automation letters
External Links
Snippet
Despite the significant success at enabling robots with autonomous behaviors makes deep reinforcement learning a promising approach for robotic object search task, the deep reinforcement learning approach severely suffers from the nature sparse reward setting of …
- 238000004805 robotic 0 title abstract description 23
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ramakrishnan et al. | An exploration of embodied visual exploration | |
Pong et al. | Skew-fit: State-covering self-supervised reinforcement learning | |
Shin et al. | Benchmarks and algorithms for offline preference-based reward learning | |
Kulhánek et al. | Visual navigation in real-world indoor environments using end-to-end deep reinforcement learning | |
Srinivas et al. | Universal planning networks: Learning generalizable representations for visuomotor control | |
Ye et al. | Efficient robotic object search via hiem: Hierarchical policy learning with intrinsic-extrinsic modeling | |
CN114460943B (en) | Self-adaptive target navigation method and system for service robot | |
Ma et al. | Contrastive variational reinforcement learning for complex observations | |
Devo et al. | Deep reinforcement learning for instruction following visual navigation in 3D maze-like environments | |
Wang et al. | Multirobot coordination with deep reinforcement learning in complex environments | |
Naveed et al. | Deep introspective SLAM: Deep reinforcement learning based approach to avoid tracking failure in visual SLAM | |
Liu et al. | Efficient preference-based reinforcement learning using learned dynamics models | |
Shin et al. | Offline preference-based apprenticeship learning | |
Bhar et al. | Era of artificial intelligence: Prospects for Indian agriculture | |
Ye et al. | From seeing to moving: A survey on learning for visual indoor navigation (vin) | |
Liang et al. | Knowledge induced deep q-network for a slide-to-wall object grasping | |
CN114880440A (en) | Visual language navigation method and device based on intelligent assistance and knowledge empowerment | |
CN118061186A (en) | Robot planning method and system based on multi-mode large model predictive control | |
Diekmann et al. | CoBeL-RL: A neuroscience-oriented simulation framework for complex behavior and learning | |
Agarwal et al. | Model learning for look-ahead exploration in continuous control | |
Lu | Sports-ACtrans Net: research on multimodal robotic sports action recognition driven via ST-GCN | |
Gym et al. | Deep reinforcement learning with python | |
Petrović et al. | Efficient machine learning of mobile robotic systems based on convolutional neural networks | |
Gavenski et al. | A Survey of Imitation Learning Methods, Environments and Metrics | |
Wulfmeier | Efficient supervision for robot learning via imitation, simulation, and adaptation |