fix(hybrid optim): fp32_grad not scaled when use offload_cpu #1261
Job | Run time |
---|---|
1m 31s | |
1m 42s | |
1m 52s | |
2m 1s | |
1m 41s | |
1m 41s | |
1m 44s | |
1m 45s | |
1m 28s | |
3m 18s | |
1m 41s | |
20m 24s |
Job | Run time |
---|---|
1m 31s | |
1m 42s | |
1m 52s | |
2m 1s | |
1m 41s | |
1m 41s | |
1m 44s | |
1m 45s | |
1m 28s | |
3m 18s | |
1m 41s | |
20m 24s |