-
Notifications
You must be signed in to change notification settings - Fork 144
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
-- convergence: nb of training epochs, ... #10
Comments
Hi jc-Rosichini, I also faced difficulty in reproducing the results. Thanks. |
Hi Felix, Here is the config: As for the error CUDNN_STATUS_BAD_PARAM, I went through it too, and it was fixed by using the latest Nvidia driver For the data loader I'm merely using the camvid dataset and have the rgb in the range [0,1]. I tend to think it's necessary to have a pre-training (using an auto-encoder approach?) in order to hope getting the accuracy Simon refers to. I'll put some more thinking into it when I have some time. |
Hi jc-Rosichini, Thanks for your quick reply. Also, I tried balancing the class as well. It works better for mIoU but slightly worse for accuracy. The numbers I reported were in fact the one that balancing the class. Though pre-training sounds reasonable to me, I remember that the authors emphasized that there is neither further post-processing nor pre-training in the paper. |
Hi Felix, |
Hi, thanks a lot for your message, it's a very important feedback as the method needs different to be validated on different datasets. Unfortunately I don't have a lot of time to work on FC-DenseNets these days but I'll try to answer. We used float32, and easily get convergence on the CamVid dataset. We made a first attempt on PascalVOC and it did not converge until we use Adam optimizer but it ended with poor results. The 'optimizer_including=fusion' won't change anything on the results, it's juste a way to add an optimization during the compilation of the theano graph (the fast run mode doesn't work because of the important number of skip connections which was not took into account in theano so far). I don't remember the number of epochs needed for convergence sorry. Pretraining would be great sure ! Please consider to do more experiments on the batch norm layer (maybe try batch renormalization because out batch size is small ?). As you saw we don't use it in a standard way, and several authors in segmentation don't use moving averages neither. |
Thanks Simon, |
@SimJeg what adam parameters did you use, and though you mentioned they are poor do you have the result numbers? |
Hi,
Just tried to run your model but unfortunately I'm far from getting the same accuracy.
Could you please provide some additional infos on:
Thanks for your feedback.
The text was updated successfully, but these errors were encountered: