RLBaseLine (SKRL)#

1.  Demos#

1.1.  Gym tasks#

The arguments in example_GymTasks_SKRL.py:

gym_task_name = 'Pendulum-v1'
# Available tasks:
# Classic: ['Acrobot-v1', 'CartPole-v1', 'MountainCarContinuous-v0', 'MountainCar-v0', 'Pendulum-v1']
# Box2D: ['BipedalWalker-v3', 'CarRacing-v1', 'LunarLander-v2']  `pip install gymnasium[box2d]`
# MuJoCo: ['Ant-v2', 'HalfCheetah-v2', 'Hopper-v2', 'Humanoid-v2', 'InvertedDoublePendulum-v2',
#          'InvertedPendulum-v2', 'Reacher-v2', 'Swimmer-v2', 'Walker2d-v2']  `pip install -U mujoco-py`

parser.add_argument("--task", type=str, default="Gym_{}".format(gym_task_name))  # Start with 'Gym_'
parser.add_argument("--agent", type=str, default="ppo")  # Available agents: ppo, sac, td3, a2c
parser.add_argument("--render_mode", type=str, default=None)  # Available render_mode: None, "human", "rgb_array"
parser.add_argument("--headless", type=str, default="True")
parser.add_argument("--inference", action="store_true", help="turn to test mode while adding this argument")
parser.add_argument("--ckpt_path", type=str, default=None)

Train gym or gymnasium tasks with SKRL by the following command:

python examples/learning_rl/example_GymTasks_SKRL.py --task Gym_[gym_task_name]

1.2.  Ant#

The arguments in example_Ant_SKRL.py:

gpu_id = 0
parser.add_argument("--task", type=str, default="Ant")
parser.add_argument("--agent", type=str, default="td3")  # Available agents: ppo, sac, td3
parser.add_argument("--num_envs", type=int, default=4096)
parser.add_argument("--sim_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--rl_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--graphics_device_id", type=int, default=gpu_id)
parser.add_argument("--headless", type=str, default="True")
parser.add_argument("--inference", action="store_true", help="turn to test mode while adding this argument")
parser.add_argument("--ckpt_path", type=str, default=None)

Train the IsaacGym Ant task with SKRL by the following command:

python examples/learning_rl/example_Ant_SKRL.py

1.3.  CURICabinet#

The arguments in example_CURICabinet_SKRL.py:

gpu_id = 0
parser.add_argument("--task", type=str, default="CURICabinet")
parser.add_argument("--agent", type=str, default="ppo")
parser.add_argument("--num_envs", type=int, default=4096)
parser.add_argument("--sim_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--rl_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--graphics_device_id", type=int, default=gpu_id)
parser.add_argument("--headless", type=str, default="True")
parser.add_argument("--inference", action="store_true", help="turn to test mode while adding this argument")
parser.add_argument("--ckpt_path", type=str, default=None)

Train the IsaacGym CURICabinet task with SKRL by the following command:

python examples/learning_rl/example_CURICabinet_SKRL.py

1.4.  FrankaCabinet#

The arguments in example_FrankaCabinet_SKRL.py:

gpu_id = 0
parser.add_argument("--task", type=str, default="FrankaCabinet")
parser.add_argument("--agent", type=str, default="ppo")
parser.add_argument("--num_envs", type=int, default=4096)
parser.add_argument("--sim_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--rl_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--graphics_device_id", type=int, default=gpu_id)
parser.add_argument("--headless", type=str, default="True")
parser.add_argument("--inference", action="store_true", help="turn to inference mode while adding this argument")
parser.add_argument("--ckpt_path", type=str, default=None)

Train the IsaacGym FrankaCabinet task with SKRL by the following command:

python examples/learning_rl/example_FrankaCabinet_SKRL.py

1.5.  Humanoid#

The arguments in example_Humanoid_SKRL.py:

parser.add_argument("--task", type=str, default="Humanoid")
parser.add_argument("--agent", type=str, default="PPO")
parser.add_argument("--num_envs", type=int, default=4096)
parser.add_argument("--sim_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--rl_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--graphics_device_id", type=int, default=gpu_id)
parser.add_argument("--headless", type=str, default="True")
parser.add_argument("--inference", action="store_true", help="turn to test mode while adding this argument")
parser.add_argument("--ckpt_path", type=str, default=None)

Train the IsaacGym Humanoid task with SKRL by the following command:

python examples/learning_rl/example_Humanoid_SKRL.py

1.6.  HumanoidAMP#

The arguments in example_HumanoidAMP_SKRL.py:

gpu_id = 1
parser.add_argument("--task", type=str, default="HumanoidAMP")
parser.add_argument("--agent", type=str, default="AMP")
parser.add_argument("--num_envs", type=int, default=4096)
parser.add_argument("--sim_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--rl_device", type=str, default="cuda:{}".format(gpu_id))
parser.add_argument("--graphics_device_id", type=int, default=gpu_id)
parser.add_argument("--headless", type=str, default="True")
parser.add_argument("--inference", action="store_true", help="turn to test mode while adding this argument")
parser.add_argument("--ckpt_path", type=str, default=None)

Train the IsaacGym HumanoidAMP task with SKRL by the following command:

python examples/learning_rl/example_HumanoidAMP_SKRL.py