Learning rate value causes Infinite/nan loss value #26

seo-95 · 2021-01-20T15:01:23Z

Hi,
first of all, thank you for sharing the code to reproduce your results. I was trying to train the model from scratch on VG using your configuration R-50-updn.yaml and I have found a potential problem. With the learning rate of 0.02 you have in your configs (both base grids and base bottom-up) the training script produces infinite/nan loss values causing the training to diverge.
The training works after decreasing the learning rate to 0.002.
I was wondering if there is something I am missing.
Thank you!

The text was updated successfully, but these errors were encountered:

endernewton · 2021-02-06T09:31:34Z

Have you retried after it failed? sometimes the randomness can cause that.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learning rate value causes Infinite/nan loss value #26

Learning rate value causes Infinite/nan loss value #26

seo-95 commented Jan 20, 2021 •

edited

Loading

endernewton commented Feb 6, 2021

Learning rate value causes Infinite/nan loss value #26

Learning rate value causes Infinite/nan loss value #26

Comments

seo-95 commented Jan 20, 2021 • edited Loading

endernewton commented Feb 6, 2021

seo-95 commented Jan 20, 2021 •

edited

Loading