-
Notifications
You must be signed in to change notification settings - Fork 57
Error running PPO baseline in docker #151
Comments
Can you add Make sure you remove this command after you have solved the error. |
@ykarmesh Thank you. Do you have any idea of why this is happening?
|
The policy is predicting NaN actions. This usually happens when:
One of the most common reason to encounter this issue is that the reward received from the Env is NaN. The default reward for both ObjectNav and Instance ImageNav is dependent on the distance to goal measure. When the distance to goal goes to NaN, the reward also goes NaN. The distance to goal is calculated using the Navmesh generated based on the robot configuration. If there is a mismatch between the robot configuration used during episode generation and training, it is possible to encounter this issue. There was a bug related to this issue in the configs present in this repository which I have solved in this commit. Can you try the new config and see if the error goes away. Note: The configs in the Habitat-Lab repository already have the correct parameters, so if you were using config from that repository, my fix is not relevant. |
I've been trying to run the PPO baseline inside the 2023 docker image and I keep getting this error and I don't know how to solve it. Does anyone have a suggestion on what to try next? Please let me know if you need any more information.
The text was updated successfully, but these errors were encountered: