LogReward¶
- class torchrl.trainers.LogReward(logname='r_training', log_pbar: bool = False, reward_key: Optional[Union[str, tuple]] = None)[來源]¶
獎勵記錄器掛鉤。
- 參數:
logname (str, optional) – 要記錄的獎勵名稱。預設值為
"r_training"
。log_pbar (bool, optional) – 如果
True
,獎勵值將記錄在進度列上。預設值為False
。reward_key (str 或 tuple, optional) – 在輸入批次中尋找獎勵的鍵。預設值為
("next", "reward")
範例
>>> log_reward = LogReward(("next", "reward")) >>> trainer.register_op("pre_steps_log", log_reward)