Ye et al., 2021 - Google Patents

Efficient robotic object search via hiem: Hierarchical policy learning with intrinsic-extrinsic modeling

Ye et al., 2021

Document ID: 868889123606827608
Author: Ye X; Yang Y
Publication year: 2021
Publication venue: IEEE robotics and automation letters

External Links

Cited by

Snippet

Despite the significant success at enabling robots with autonomous behaviors makes deep reinforcement learning a promising approach for robotic object search task, the deep reinforcement learning approach severely suffers from the nature sparse reward setting of …

Continue reading at ieeexplore.ieee.org (PDF) (other versions)

238000004805 robotic 0 title abstract description 23

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management

Similar Documents

Publication	Publication Date	Title
Ramakrishnan et al.	2021	An exploration of embodied visual exploration
Pong et al.	2019	Skew-fit: State-covering self-supervised reinforcement learning
Shin et al.	2023	Benchmarks and algorithms for offline preference-based reward learning
Kulhánek et al.	2021	Visual navigation in real-world indoor environments using end-to-end deep reinforcement learning
Srinivas et al.	2018	Universal planning networks: Learning generalizable representations for visuomotor control
Ye et al.	2021	Efficient robotic object search via hiem: Hierarchical policy learning with intrinsic-extrinsic modeling
CN114460943B (en)	2023-07-28	Self-adaptive target navigation method and system for service robot
Ma et al.	2021	Contrastive variational reinforcement learning for complex observations
Devo et al.	2020	Deep reinforcement learning for instruction following visual navigation in 3D maze-like environments
Wang et al.	2021	Multirobot coordination with deep reinforcement learning in complex environments
Naveed et al.	2022	Deep introspective SLAM: Deep reinforcement learning based approach to avoid tracking failure in visual SLAM
Liu et al.	2023	Efficient preference-based reinforcement learning using learned dynamics models
Shin et al.	2021	Offline preference-based apprenticeship learning
Bhar et al.	2019	Era of artificial intelligence: Prospects for Indian agriculture
Ye et al.	2020	From seeing to moving: A survey on learning for visual indoor navigation (vin)
Liang et al.	2019	Knowledge induced deep q-network for a slide-to-wall object grasping
CN114880440A (en)	2022-08-09	Visual language navigation method and device based on intelligent assistance and knowledge empowerment
CN118061186A (en)	2024-05-24	Robot planning method and system based on multi-mode large model predictive control
Diekmann et al.	2023	CoBeL-RL: A neuroscience-oriented simulation framework for complex behavior and learning
Agarwal et al.	2019	Model learning for look-ahead exploration in continuous control
Lu	2024	Sports-ACtrans Net: research on multimodal robotic sports action recognition driven via ST-GCN
Gym et al.	2021	Deep reinforcement learning with python
Petrović et al.	2023	Efficient machine learning of mobile robotic systems based on convolutional neural networks
Gavenski et al.	2024	A Survey of Imitation Learning Methods, Environments and Metrics
Wulfmeier	2019	Efficient supervision for robot learning via imitation, simulation, and adaptation