You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This last one also illustrates why filtering out is not that easy.
It would be handy if is was a sequential process, but unfortunately not.
At the moment TICCL-rank process it's input and output in chunks, but we have to change that and store all results so we can filter the above cases out afterwards.
A major change! More memory consuming, and less easy to handle multi threaded.
Some more investigation is needed.
The text was updated successfully, but these errors were encountered:
Might it be possible to shift solving this to the next module, TICCL-chain?
I still consider this quite a major problem. I also consider this closely related to what currently goes wrong with the ngram filtering (I'll probably come back to this in relation to another issue, but not issues 33 and 34, which actually I do not seem to be related at all.
@martinreynaert provided the following examples:
The last entry is undesirable.
The last entry is undesirable.
Idem
Here the second is undesirable.
This last one also illustrates why filtering out is not that easy.
It would be handy if is was a sequential process, but unfortunately not.
At the moment TICCL-rank process it's input and output in chunks, but we have to change that and store all results so we can filter the above cases out afterwards.
A major change! More memory consuming, and less easy to handle multi threaded.
Some more investigation is needed.
The text was updated successfully, but these errors were encountered: