Issue with Softmax in _coordinate_selection Leading to Saturated Outputs #15

dezhi0730 · 2024-09-26T07:14:07Z

Hello,

Thank you for maintaining this repository and the effort you've put into it. While working with the model, I encountered an issue related to the softmax function in the _coordinate_selection function. Specifically, the softmax output often becomes extremely saturated, where only one element in the position_probs tensor is 1, and all others are 0. This behavior is unexpected and may be causing problems with selecting edit positions.

Issue Details:

The issue occurs in the _coordinate_selection function.
After applying softmax(dim=-1) to the position_probs tensor, the output shows only one element with a value of 1, while all others are 0.
As a result, the element with a value of 1 is always selected, and the other edit positions are randomly chosen, which is likely not the desired outcome.
If my is_corrupted tensor is targeting a specific region, such as the first half of the tokenized_seq, I noticed that my sequence is still changing in the second half.

Exp:

Please feel free to reach out if further clarification is needed.

Best regards.

The text was updated successfully, but these errors were encountered:

samuelstanton · 2024-10-07T18:13:51Z

Thanks for raising the issue, I just pushed a change that normalizes the attributions before softmaxing which should alleviate this issue. Note that you can also avoid this behavior if it persists in the most recent version by increasing the feature_attr_temp value.

Would you mind confirming the fix resolves your issue?

https://github.com/prescient-design/cortex/blob/main/cortex/optim/generative/_lambo.py#L264

dezhi0730 · 2024-10-17T15:33:07Z

Hello, I believe the previous issue has been resolved, but I have another question regarding the constrain_fn used for optimization. Would you mind open-sourcing the part related to the edit budget in constrain_fn? Thank you very much!

samuelstanton self-assigned this Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with Softmax in _coordinate_selection Leading to Saturated Outputs #15

Issue with Softmax in _coordinate_selection Leading to Saturated Outputs #15

dezhi0730 commented Sep 26, 2024

samuelstanton commented Oct 7, 2024 •

edited

Loading

dezhi0730 commented Oct 17, 2024

Issue with Softmax in _coordinate_selection Leading to Saturated Outputs #15

Issue with Softmax in _coordinate_selection Leading to Saturated Outputs #15

Comments

dezhi0730 commented Sep 26, 2024

Issue Details:

Exp:

samuelstanton commented Oct 7, 2024 • edited Loading

dezhi0730 commented Oct 17, 2024

samuelstanton commented Oct 7, 2024 •

edited

Loading