You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have noticed that the repository contains three training scripts (small.sh, medium.sh, and large.sh) meant for different model sizes. However, upon reviewing the scripts, it appears that none of the hyperparameters are varied across these scripts.
Given that these scripts are intended for different model sizes, it is expected that the hyperparameters such as num_channels, batch_size, max_L and others should be adjusted to better suit the specific needs of the small, medium, and large models.
Thank you for addressing this issue.
The text was updated successfully, but these errors were encountered:
I have noticed that the repository contains three training scripts (
small.sh
,medium.sh
, andlarge.sh
) meant for different model sizes. However, upon reviewing the scripts, it appears that none of the hyperparameters are varied across these scripts.Given that these scripts are intended for different model sizes, it is expected that the hyperparameters such as
num_channels
,batch_size
,max_L
and others should be adjusted to better suit the specific needs of the small, medium, and large models.Thank you for addressing this issue.
The text was updated successfully, but these errors were encountered: