Reduce memory by using all_gather_into_tensor
#524
Job | Run time |
---|---|
3m 9s | |
3m 0s | |
6m 9s |
all_gather_into_tensor
#524
Job | Run time |
---|---|
3m 9s | |
3m 0s | |
6m 9s |