rofunc.learning.RofuncRL.tasks.isaacgymenv.base.vec_task#
1. Module Contents#
1.1. Classes#
1.2. Functions#
Retrieves param value for the given prop and attr from the sample. |
1.3. Data#
1.4. API#
- rofunc.learning.RofuncRL.tasks.isaacgymenv.base.vec_task.EXISTING_SIM = None#
- rofunc.learning.RofuncRL.tasks.isaacgymenv.base.vec_task.SCREEN_CAPTURE_RESOLUTION = (1027, 768)#
- class rofunc.learning.RofuncRL.tasks.isaacgymenv.base.vec_task.Env(config: Dict[str, Any], rl_device: str, sim_device: str, graphics_device_id: int, headless: bool)[source]#
Bases:
abc.ABC- abstract allocate_buffers()[source]#
Create torch buffers for observations, rewards, actions dones and any additional data.
- abstract step(actions: torch.Tensor) Tuple[Dict[str, torch.Tensor], torch.Tensor, torch.Tensor, Dict[str, Any]][source]#
Step the physics of the environment.
- Args:
actions: actions to apply
- Returns:
Observations, rewards, resets, info Observations are dict of observations (currently only one member called ‘obs’)
- abstract reset() Dict[str, torch.Tensor][source]#
Reset the environment. Returns:
Observation dictionary
- abstract reset_idx(env_ids: torch.Tensor)[source]#
Reset environments having the provided indices. Args:
env_ids: environments to reset
- property observation_space: gym.Space#
Get the environment’s observation space.
- property action_space: gym.Space#
Get the environment’s action space.
- property num_envs: int#
Get the number of environments.
- property num_acts: int#
Get the number of actions in the environment.
- property num_obs: int#
Get the number of observations in the environment.
- class rofunc.learning.RofuncRL.tasks.isaacgymenv.base.vec_task.VecTask(config, rl_device, sim_device, graphics_device_id, headless, virtual_screen_capture: bool = False, force_render: bool = False)[source]#
Bases:
rofunc.learning.RofuncRL.tasks.isaacgymenv.base.vec_task.Env- metadata = None#
- allocate_buffers()[source]#
Allocate the observation, states, etc. buffers.
These are what is used to set observations and states in the environment classes which inherit from this one, and are read in step and other related functions.
- set_sim_params_up_axis(sim_params: isaacgym.gymapi.SimParams, axis: str) int[source]#
Set gravity based on up axis and return axis index.
- Args:
sim_params: sim params to modify the axis for. axis: axis to set sim params for.
- Returns:
axis index for up axis.
- create_sim(compute_device: int, graphics_device: int, physics_engine, sim_params: isaacgym.gymapi.SimParams)[source]#
Create an Isaac Gym sim object.
- Args:
compute_device: ID of compute device to use. graphics_device: ID of graphics device to use. physics_engine: physics engine to use (gymapi.SIM_PHYSX or gymapi.SIM_FLEX) sim_params: sim params to use.
- Returns:
the Isaac Gym sim object.
- get_state()[source]#
Returns the state buffer of the environment (the privileged observations for asymmetric training).
- abstract pre_physics_step(actions: torch.Tensor)[source]#
Apply the actions to the environment (eg by setting torques, position targets).
- Args:
actions: the actions to apply
- abstract post_physics_step()[source]#
Compute reward and observations, reset any environments that require it.
- step(actions: torch.Tensor) Tuple[Dict[str, torch.Tensor], torch.Tensor, torch.Tensor, Dict[str, Any]][source]#
Step the physics of the environment.
- Args:
actions: actions to apply
- Returns:
Observations, rewards, resets, info Observations are dict of observations (currently only one member called ‘obs’)
- zero_actions() torch.Tensor[source]#
Returns a buffer with zero actions.
- Returns:
A buffer of zero torch actions
- reset_idx(env_idx)[source]#
Reset environment with indces in env_idx. Should be implemented in an environment class inherited from VecTask.
- reset()[source]#
Is called only once when environment starts to provide the first observations. Doesn’t calculate observations. Actual reset and observation calculation need to be implemented by user. Returns:
Observation dictionary
- reset_done()[source]#
Reset the environment. Returns:
Observation dictionary, indices of environments being reset