Not converge when larger batch size is used in Stage2 #56

NickChang97 · 2023-08-23T02:05:14Z

Hi, the default batch size is 1, did you try the larger batch size. In my experiments, when the batch size >= 4, the loss can not be converged to a satisfied results despite tuning various hyper-parameters.

Doubiiu · 2023-08-23T09:27:20Z

Hi, I didn't try that in our experiments. I think this VQ-Stuff is unstable in terms of training, thus a 'proper' combination of these hyper-parameters is a must to make it work.

yangyifan18 · 2024-10-30T13:03:00Z

I meet the same problem！I trained VAE with batch size 1, but the results seems to be wrong when test with batch size 8. This does not make sense because batch_size should not affect test performance. I think there might be something wrong with the view() operation in VectorQuantizer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not converge when larger batch size is used in Stage2 #56

Not converge when larger batch size is used in Stage2 #56

NickChang97 commented Aug 23, 2023

Doubiiu commented Aug 23, 2023

yangyifan18 commented Oct 30, 2024

Not converge when larger batch size is used in Stage2 #56

Not converge when larger batch size is used in Stage2 #56

Comments

NickChang97 commented Aug 23, 2023

Doubiiu commented Aug 23, 2023

yangyifan18 commented Oct 30, 2024