[go: up one dir, main page]

Ye et al., 2021 - Google Patents

Efficient robotic object search via hiem: Hierarchical policy learning with intrinsic-extrinsic modeling

Ye et al., 2021

View PDF
Document ID
868889123606827608
Author
Ye X
Yang Y
Publication year
Publication venue
IEEE robotics and automation letters

External Links

Snippet

Despite the significant success at enabling robots with autonomous behaviors makes deep reinforcement learning a promising approach for robotic object search task, the deep reinforcement learning approach severely suffers from the nature sparse reward setting of …
Continue reading at ieeexplore.ieee.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/004Artificial life, i.e. computers simulating life
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/005Probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/18Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management

Similar Documents

Publication Publication Date Title
Ramakrishnan et al. An exploration of embodied visual exploration
Pong et al. Skew-fit: State-covering self-supervised reinforcement learning
Shin et al. Benchmarks and algorithms for offline preference-based reward learning
Kulhánek et al. Visual navigation in real-world indoor environments using end-to-end deep reinforcement learning
Srinivas et al. Universal planning networks: Learning generalizable representations for visuomotor control
Ye et al. Efficient robotic object search via hiem: Hierarchical policy learning with intrinsic-extrinsic modeling
CN114460943B (en) Self-adaptive target navigation method and system for service robot
Ma et al. Contrastive variational reinforcement learning for complex observations
Devo et al. Deep reinforcement learning for instruction following visual navigation in 3D maze-like environments
Wang et al. Multirobot coordination with deep reinforcement learning in complex environments
Naveed et al. Deep introspective SLAM: Deep reinforcement learning based approach to avoid tracking failure in visual SLAM
Liu et al. Efficient preference-based reinforcement learning using learned dynamics models
Shin et al. Offline preference-based apprenticeship learning
Bhar et al. Era of artificial intelligence: Prospects for Indian agriculture
Ye et al. From seeing to moving: A survey on learning for visual indoor navigation (vin)
Liang et al. Knowledge induced deep q-network for a slide-to-wall object grasping
CN114880440A (en) Visual language navigation method and device based on intelligent assistance and knowledge empowerment
CN118061186A (en) Robot planning method and system based on multi-mode large model predictive control
Diekmann et al. CoBeL-RL: A neuroscience-oriented simulation framework for complex behavior and learning
Agarwal et al. Model learning for look-ahead exploration in continuous control
Lu Sports-ACtrans Net: research on multimodal robotic sports action recognition driven via ST-GCN
Gym et al. Deep reinforcement learning with python
Petrović et al. Efficient machine learning of mobile robotic systems based on convolutional neural networks
Gavenski et al. A Survey of Imitation Learning Methods, Environments and Metrics
Wulfmeier Efficient supervision for robot learning via imitation, simulation, and adaptation