Format: include :l if identical to :correct? #80

nschneid · 2023-04-01T03:16:23Z

Lemmas are specified if distinct from the word form, but what if there is a correction and the lemma is not distinct from the correction? It seems the trees are not consistent on this point. align_tokens.py is adding lemmas from UD despite identical corrections.

Consider insertions and deletions. Is it odd to specify an empty-lemma for a deleted word?

Perhaps the most intuitive policy is that if there is a :correct field, that value takes precedence when deciding whether the lemma needs to be explicit (and any deletion, i.e. :correct "", is assumed to have no lemma).

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Format: include :l if identical to :correct? #80

Format: include :l if identical to :correct? #80

nschneid commented Apr 1, 2023 •

edited

Loading

Format: include :l if identical to :correct? #80

Format: include :l if identical to :correct? #80

Comments

nschneid commented Apr 1, 2023 • edited Loading

nschneid commented Apr 1, 2023 •

edited

Loading