rofunc.learning.RofuncRL.trainers.base_trainer#
1. Module Contents#
1.1. Classes#
1.2. API#
- class rofunc.learning.RofuncRL.trainers.base_trainer.BaseTrainer(cfg: omegaconf.DictConfig, env: Union[gym.Env, gymnasium.Env], device: Optional[Union[str, torch.device]] = None, env_name: Optional[str] = None, inference: bool = False)[source]#
Initialization
- inference_flag = None#
Experiment log directory
- start_time = None#
Evaluation and inference configurations
- eval_rew_mean = 0#
Environment
- train()[source]#
Main training loop.
Reset the environment
- For each step:
Pre-interaction
Obtain action from agent
Interact with environment
Store transition
Reset the environment
Post-interaction
Close the environment