Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add UDT tag #21

Merged
merged 8 commits into from
Apr 2, 2019
Merged

Add UDT tag #21

merged 8 commits into from
Apr 2, 2019

Conversation

lynnporu
Copy link
Member

@lynnporu lynnporu commented Mar 31, 2019

Since MTE XPOS tags are poorly supported in ukrainian UD file, I was advised to use FEATS field in order to hash words by their morphology properties. So, I add this PR which includes UDT (Universal Dependencies Tag) parser class.

This PR relates to this issue: mova-institute/zoloto/issues/277.

UDT format

  • 1st letter: universal POS
  • (2, 3)n-th letters: property
  • (4. 5, 6)n-th letters: value

Example

POS namanicsnomgnneunmsin can be interpreted as:

n
am ani
cs nom
gn neu
nm sim

which is actually means:

{
    "upos": "NOUN",
    "Animacy": "Anim",
    "Case": "Nom",
    "Gender": "Neut",
    "Number": "Sing"
}

@lynnporu lynnporu added new abstract New library code or class do not merge Do not merge this brach even if there's no conflicts labels Mar 31, 2019
@lynnporu lynnporu self-assigned this Mar 31, 2019
@lynnporu lynnporu removed the do not merge Do not merge this brach even if there's no conflicts label Apr 1, 2019
@lynnporu lynnporu requested a review from iasx April 1, 2019 07:55
@lynnporu
Copy link
Member Author

lynnporu commented Apr 1, 2019

Copy link
Member

@iasx iasx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh, great!

@lynnporu lynnporu merged commit 68d0ecc into master Apr 2, 2019
@lynnporu lynnporu deleted the udt branch April 2, 2019 17:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new abstract New library code or class
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants