feature: add aligned samples for completion prompt strategy #687

kallewoof · 2023-10-06T07:40:11Z

By default, data will be chopped into samples, with the last sample being right-padded to fill up the context. This will often result in last-samples-of-a-text that have a short text followed by a bunch of pads. While this is fine from a training perspective, it is far more common in real life to encounter a completion task where you are given a very short amount of the starting text. This PR flips the padding side, when possible, so that the very first sample begins with padding tokens, followed by the starting text, and the remaining samples are then perfectly aligned with the sequence length so that no more padding is required.

In other words, we end up with filled up context sizes for all samples except the first ones in each input, which now looks like

[PAD] [PAD] [PAD] ... [PAD] Once upon a time, there was a

whereas we would normally end up with filled up context sizes for all except the last one in each input, which looks like

happily ever after.[PAD] [PAD] [PAD] ...[PAD] [PAD] [PAD]

I should note that this approach would most likely benefit instruction format prompt strategies as well, with the caveat that it at minimum gets to the Response part.

kallewoof · 2023-10-06T13:54:52Z

Tested. This code works as intended. It is giving me [PAD] [PAD] [PAD] [PAD] [PAD] Once upon a time for starting samples and everything else looks unpadded.

By default, data will be chopped into samples, with the last sample being right-padded to fill up the context. This will often result in last-samples-of-a-text that have a short text followed by a bunch of pads. While this is fine from a training perspective, it is far more common in real life to encounter a completion task where you are given a very short amount of the *starting text*. This PR flips the padding side, when possible, so that the very first sample begins with padding tokens, followed by the starting text, and the remaining samples are then perfectly aligned with the sequence length so that no more padding is required.

winglian · 2023-10-06T15:32:56Z

would you be able to add a unit test to validate this behavior?

…intended

kallewoof · 2023-10-07T14:05:28Z

Done. Converting to draft though, as I am seeing some odd losses when using this in training. Digging.

kallewoof · 2023-10-08T12:58:41Z

The odd losses I was seeing were unrelated to this PR (verified both by seeing the weird loss on main branch, and by seeing the weird loss go away when I fixed my settings). I think this is RFM.

kallewoof force-pushed the 202310-aligned-completion branch 4 times, most recently from 02f217a to 91a80b3 Compare October 6, 2023 09:36

kallewoof marked this pull request as ready for review October 6, 2023 13:52

kallewoof force-pushed the 202310-aligned-completion branch from 91a80b3 to 76ce0e5 Compare October 6, 2023 13:59

test: add tests to check that completion alignment sampling works as …

2ef1f90

…intended

kallewoof force-pushed the 202310-aligned-completion branch from 1600413 to 2ef1f90 Compare October 7, 2023 14:04

kallewoof marked this pull request as draft October 7, 2023 14:20

kallewoof marked this pull request as ready for review October 8, 2023 12:58

kallewoof closed this Oct 12, 2023

kallewoof deleted the 202310-aligned-completion branch October 19, 2023 05:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: add aligned samples for completion prompt strategy #687

feature: add aligned samples for completion prompt strategy #687

kallewoof commented Oct 6, 2023 •

edited

Loading

kallewoof commented Oct 6, 2023

winglian commented Oct 6, 2023

kallewoof commented Oct 7, 2023 •

edited

Loading

kallewoof commented Oct 8, 2023 •

edited

Loading

feature: add aligned samples for completion prompt strategy #687

feature: add aligned samples for completion prompt strategy #687

Conversation

kallewoof commented Oct 6, 2023 • edited Loading

kallewoof commented Oct 6, 2023

winglian commented Oct 6, 2023

kallewoof commented Oct 7, 2023 • edited Loading

kallewoof commented Oct 8, 2023 • edited Loading

kallewoof commented Oct 6, 2023 •

edited

Loading

kallewoof commented Oct 7, 2023 •

edited

Loading

kallewoof commented Oct 8, 2023 •

edited

Loading