Skip to content

environments for reinforcement learning based on panda-gym

Notifications You must be signed in to change notification settings

zeyang23/panda_robot_task

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

panda_robot_task

Some custom-designed environments for reinforcement learning based on panda-gym.

Pre-trained agents are provided, some of which are from original environments in panda-gym.

May serve as a tutorial for customizing your own environments.

requirements

panda-gym == 2.0.1

stable-baselines3 == 1.5.1a5

sb3-contrib == 1.5.1a5

trained agents

Files can be found under the "trained_agents" folder.

To enjoy the trained agents, run xxx_show.py.

To re-train an agent, run xxx_train.py.

My_PandaReachJointsDense

control-type: joints

gif

image

log

image

Two_PandaReachDense

The robot is supposed to pass two points successively.

gif

image

log

image

Three_PandaReachDense

The robot is supposed to pass three points successively.

gif

image

log

image

PandaPush

gif

image

log

image

Two_PandaPushDense

The robot is supposed to push the object to pass two points successively.

gif

image

log

image

Two_Obj_PandaPushDense

The robot is supposed to push two object to their targets respectively.

gif

image

log

image

My_PandaSlideDense

gif

image

log

image

PandaPickAndPlace

The robot learns to push the object when its target point is on the table and pick the object when its target point is in the air.

This is different from the robot trained by dense reward.

gif

image

log

image

PandaPickAndPlaceDense

gif

image

log

image

My_TwoPickAndPlaceDense

The robot is supposed to pick the object and carry it to pass two points successively.

gif

image

log

image

My_PandaReachPlateJointsDense

The robot is supposed to reach a point while balancing a ball on its plate.

control-type: joints

gif

image

log

image

My_TwoPandaReachPlateJointsDense

The robot is supposed to reach two points successively while balancing a ball on its plate.

control-type: joints

gif

image

log

image

About

environments for reinforcement learning based on panda-gym

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages