You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Ref line assumes the new batch is from a new sentence. Which is fine when trying to predict a list of short text sentences.
However, if we pass a single very long text, the dataloader will split the text into batches.
And since the input is only one sentence, only the predictions of the first batch will be returned. In my case, only 13309 out of 16949 tokens.
Fixing this issue should be done with care as this function is called also to predict a list of sentences.
The text was updated successfully, but these errors were encountered:
ahmadabousetta
changed the title
long text clipped when disambiguated by BERT
Long text clipped when disambiguated by BERT
Jun 6, 2024
camel_tools/camel_tools/disambig/bert/unfactored.py
Line 177 in b496501
Ref line assumes the new batch is from a new sentence. Which is fine when trying to predict a list of short text sentences.
However, if we pass a single very long text, the dataloader will split the text into batches.
And since the input is only one sentence, only the predictions of the first batch will be returned. In my case, only 13309 out of 16949 tokens.
Fixing this issue should be done with care as this function is called also to predict a list of sentences.
The text was updated successfully, but these errors were encountered: