Is there any error about the softmax operation? #10

donnyyou · 2018-01-02T07:30:13Z

"The coupling coefficients between capsule i and all the capsules in the layer above sum to 1", softmax should be computed along the channel of capsules, and you computed along the channel of route nodes.

BoPang1996 · 2018-03-19T13:07:19Z

I think here is some problem too. The "dim" parameter for softmax should be 0 in my view of point.

janericlenssen · 2018-04-03T16:29:36Z

I second that. I think it should be dim=0. However, it does not train successfully if i change it.

zzzz94 · 2018-05-07T07:55:28Z

I agree with @mrjel . But when I let dim=0, changed line 55 to
self.route_weights = nn.Parameter(0.01 * torch.randn(num_capsules, num_route_nodes, in_channels, out_channels))
and removed line 108(maybe not necessary ), I got 99.27% acc on the test set (epoch 5).

tengteng95 · 2019-01-09T08:39:47Z

@zzzz94 hi, Thanks for you nice suggestions. I wonder why we need to set route_weights a relatively lower values by multiplying 0.01?
When I set dim=0, and remove line 108, the net works like a random guess and the accuracy is ~10%. However, it works well with a lower weight for route_weights.

Looking forward for your response!

CoderHHX · 2019-02-14T07:05:52Z

@zzzz94 Thanks for your solution firstly. @h982639009 notice that before setting dim = 0, the dimension of c_ij is 1132 where setting dim = 1, the dimension is 10. This small trick can make the input weights have similar magnitude. I think if you enlarge the learning rate at the start, this problem can be also solved.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any error about the softmax operation? #10

Is there any error about the softmax operation? #10

donnyyou commented Jan 2, 2018

BoPang1996 commented Mar 19, 2018

janericlenssen commented Apr 3, 2018 •

edited

Loading

zzzz94 commented May 7, 2018 •

edited

Loading

tengteng95 commented Jan 9, 2019

CoderHHX commented Feb 14, 2019

Is there any error about the softmax operation? #10

Is there any error about the softmax operation? #10

Comments

donnyyou commented Jan 2, 2018

BoPang1996 commented Mar 19, 2018

janericlenssen commented Apr 3, 2018 • edited Loading

zzzz94 commented May 7, 2018 • edited Loading

tengteng95 commented Jan 9, 2019

CoderHHX commented Feb 14, 2019

janericlenssen commented Apr 3, 2018 •

edited

Loading

zzzz94 commented May 7, 2018 •

edited

Loading