Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix bandit.py #63

Open
wants to merge 16 commits into
base: master
Choose a base branch
from

Conversation

RafaelSterzinger
Copy link

Changed the default observation of bandits to 1. By default, almost all biases are set to 0 during initialization. Combining this with inputting always 0 will cause problems during training/testing, i.e. gradients will be all 0 except for the biases of the last layer. By changing the default value to 1 will allow the network to also update the weights of the hidden layers and not only the biases of the last layer. After applying this change, bandits worked as expected.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants