The batch size setting #8

jinyyy666 · 2017-12-01T20:36:38Z

Hi guys,

Really appreciate your elegent code!

Right now I playing with the batch size setting. For Mnist dataset, when I set the batch size = 1, I will get nan for the weight. I guess that is because the learning rate is too large for the batch = 1. Any ideas why this is happening?

Thanks,

Jimmy

zhxfl · 2017-12-02T04:32:44Z

It's the character of stochastic gradient descent algorithm.

jinyyy666 · 2017-12-03T04:42:39Z

Thanks for the reply! I am googling this issue and finding out that it is quite common when the update is very small. By setting the batch = 1, the update at each time step is very small. But that might lead to numerical instablity as pointed out in this link: https://datascience.stackexchange.com/questions/15962/why-is-learning-rate-causing-my-neural-networks-weights-to-skyrocket

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The batch size setting #8

The batch size setting #8

jinyyy666 commented Dec 1, 2017

zhxfl commented Dec 2, 2017

jinyyy666 commented Dec 3, 2017

The batch size setting #8

The batch size setting #8

Comments

jinyyy666 commented Dec 1, 2017

zhxfl commented Dec 2, 2017

jinyyy666 commented Dec 3, 2017