You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for maintaining this repository and the effort you've put into it. While working with the model, I encountered an issue related to the softmax function in the _coordinate_selection function. Specifically, the softmax output often becomes extremely saturated, where only one element in the position_probs tensor is 1, and all others are 0. This behavior is unexpected and may be causing problems with selecting edit positions.
After applying softmax(dim=-1) to the position_probs tensor, the output shows only one element with a value of 1, while all others are 0.
As a result, the element with a value of 1 is always selected, and the other edit positions are randomly chosen, which is likely not the desired outcome.
If my is_corrupted tensor is targeting a specific region, such as the first half of the tokenized_seq, I noticed that my sequence is still changing in the second half.
Exp:
Please feel free to reach out if further clarification is needed.
Best regards.
The text was updated successfully, but these errors were encountered:
Thanks for raising the issue, I just pushed a change that normalizes the attributions before softmaxing which should alleviate this issue. Note that you can also avoid this behavior if it persists in the most recent version by increasing the feature_attr_temp value.
Would you mind confirming the fix resolves your issue?
Hello, I believe the previous issue has been resolved, but I have another question regarding the constrain_fn used for optimization. Would you mind open-sourcing the part related to the edit budget in constrain_fn? Thank you very much!
Hello,
Thank you for maintaining this repository and the effort you've put into it. While working with the model, I encountered an issue related to the softmax function in the
_coordinate_selection
function. Specifically, the softmax output often becomes extremely saturated, where only one element in theposition_probs
tensor is 1, and all others are 0. This behavior is unexpected and may be causing problems with selecting edit positions.Issue Details:
_coordinate_selection
function.softmax(dim=-1)
to theposition_probs
tensor, the output shows only one element with a value of 1, while all others are 0.is_corrupted
tensor is targeting a specific region, such as the first half of the tokenized_seq, I noticed that my sequence is still changing in the second half.Exp:
Please feel free to reach out if further clarification is needed.
Best regards.
The text was updated successfully, but these errors were encountered: