pytorch MPI raw speed #73

martinjaggi · 2018-09-20T09:54:22Z

we found this benchmark here:
https://github.com/diux-dev/cluster/tree/master/pytorch_distributed_benchmark

will be interesting to have a look if we observe similar speed, and code is probably useful too.
note that their benchmark is only raw communication all-reduce, no learning. this is relevant if one is communication bound. so we might likely see this scenario when training linear models soon

martinjaggi · 2018-10-03T19:27:14Z

BTW pytorch 1.0 has a new backend for distributed, called C10D which we should give a try. is used both in torch.distributed package and torch.nn.parallel.DistributedDataParallel

martinjaggi added the feature label Sep 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytorch MPI raw speed #73

pytorch MPI raw speed #73

martinjaggi commented Sep 20, 2018

martinjaggi commented Oct 3, 2018

pytorch MPI raw speed #73

pytorch MPI raw speed #73

Comments

martinjaggi commented Sep 20, 2018

martinjaggi commented Oct 3, 2018