Hindsight Goal Generation Based on Graph-Based Diversity and Proximity

Requirements

Ubuntu 16.04 or macOS Catalina 10.15.7 (newer versions also work well)
Python 3.5.2 (newer versions such as 3.6.8 should work as well, 3.8 or higher is not suggested)
MuJoCo == 2.00 (see instructions on https://github.com/openai/mujoco-py)
Install requirements with pip install -r requirements.txt

pip install -r requirements.txt

Videos about Kuka Environments can be found here: https://videoviewsite.wixsite.com/gc-hgg
parallel implementation of GC-HGG can be found in branch concurrency.

New Kuka Environments

Training under different environments

The following commands are used to train the agent in different environments with HGG, HER, G-HGG, C-HGG. Note that new Kuka Environments are introduced.

Kuka Environments

## KukaReach
python train.py --tag 400 --learn normal --env KukaReach-v1 
#GC-HGG
python train.py --tag 410 --learn hgg --env KukaReach-v1 --curriculum True --stop_hgg_threshold 0.3
#CHER
python train.py --tag 450 --learn normal --env KukaReach-v1 --curriculum True --batch_size 64 --buffer_size 500 --epoch 10

## KukaPickAndPlaceObstacle

#HGG
python train.py --tag 510 --learn hgg --env KukaPickAndPlaceObstacle-v1 --stop_hgg_threshold 0.3
#GC-HGG
python train.py --tag 520 --learn hgg --env KukaPickAndPlaceObstacle-v1 --graph True --n_x 11 --n_y 11 --n_z 7 --stop_hgg_threshold 0.9 --curriculum True
#G-HGG
python train.py --tag 530 --learn hgg --env KukaPickAndPlaceObstacle-v1 --graph True --n_x 11 --n_y 11 --n_z 7 --stop_hgg_threshold 0.9
#CHER
python train.py --tag 550 --learn normal --env KukaPickAndPlaceObstacle-v1 --curriculum True --batch_size 64 --buffer_size 500

## KukaPickNoObstacle

#HGG
python train.py --tag 610 --learn hgg --env KukaPickNoObstacle-v1 --stop_hgg_threshold 0.3
#GC-HGG
python train.py --tag 620 --learn hgg --env KukaPickNoObstacle-v1 --graph True --n_x 31 --n_y 31 --n_z 15 --stop_hgg_threshold 0.5 --curriculum True
#G-HGG
python train.py --tag 630 --learn hgg --env KukaPickNoObstacle-v1 --graph True --n_x 31 --n_y 31 --n_z 15 --stop_hgg_threshold 0.5
#CHER
python train.py --tag 650 --learn normal --env KukaPickNoObstacle-v1 --curriculum True --batch_size 64 --buffer_size 500

## KukaPushNew

#HER
python train.py --tag 1000 --learn normal --env KukaPushNew-v1 --epoch 10
#HGG
python train.py --tag 1010 --learn hgg --env KukaPushNew-v1 --stop_hgg_threshold 0.3 --epoch 10
#GC-HGG
python train.py --tag 1020 --learn hgg --env KukaPushNew-v1 --stop_hgg_threshold 0.3 --epoch 10 --graph True --n_x 5 --n_y 11 --n_z 7 --curriculum True
#G-HGG
python train.py --tag 1030 --learn hgg --env KukaPushNew-v1 --stop_hgg_threshold 0.3 --epoch 10 --graph True --n_x 5 --n_y 11 --n_z 7
#CHER
python train.py --tag 1050 --learn normal --env KukaPushNew-v1 --epoch 10 --curriculum True

Playing

To look at the agent solving the respective task according to his learned policy, issue the following command:

Kuka Environments

# Scheme: python play.py --env env_id --goal custom --play_path log_dir --play_epoch <epoch number, latest or best>

# KukaReach
python play.py --env KukaReach-v1 --play_path log/400-ddpg-KukaReach-v1-normal --play_epoch best

# KukaPickAndPlaceObstacle
python play.py --env KukaPickAndPlaceObstacle-v1 --play_path log/520-ddpg-KukaPickAndPlaceObstacle-v1-hgg-graph-stop-curriculum --play_epoch best

# KukaPickNoObstacle
python play.py --env KukaPickNoObstacle-v1 --play_path log/620-ddpg-KukaPickNoObstacle-v1-hgg-graph-stop-curriculum --play_epoch best

#KukaPushNew
python play.py --env KukaPushNew-v1 --play_path log/1020-ddpg-KukaPushNew-v1-hgg-graph-stop-curriculum --play_epoch best

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Image		Image
algorithm		algorithm
envs		envs
gym_kuka_mujoco		gym_kuka_mujoco
learner		learner
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ablationstudy.txt		ablationstudy.txt
common.py		common.py
create_figures.py		create_figures.py
play.py		play.py
plot.py		plot.py
requirements.txt		requirements.txt
resource.py		resource.py
test.py		test.py
timing.py		timing.py
train.py		train.py
videos.meta.json		videos.meta.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hindsight Goal Generation Based on Graph-Based Diversity and Proximity

Requirements

New Kuka Environments

Training under different environments

Kuka Environments

Playing

Kuka Environments

About

Releases

Packages

Languages

License

hk-zh/GC-HGG

Folders and files

Latest commit

History

Repository files navigation

Hindsight Goal Generation Based on Graph-Based Diversity and Proximity

Requirements

New Kuka Environments

Training under different environments

Kuka Environments

Playing

Kuka Environments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages