Reduce memory by using all_gather_into_tensor
#552
Job | Run time |
---|---|
3m 12s | |
3m 9s | |
6m 21s |
all_gather_into_tensor
#552
Job | Run time |
---|---|
3m 12s | |
3m 9s | |
6m 21s |