Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flesh out grammatical categories #25

Open
macchiati opened this issue Mar 19, 2024 · 2 comments
Open

Flesh out grammatical categories #25

macchiati opened this issue Mar 19, 2024 · 2 comments
Labels
discuss Discussion item

Comments

@macchiati
Copy link
Member

macchiati commented Mar 19, 2024

We have a set of grammatical categories/features in CLDR, that are also used in ICU. It would be very useful to flesh out these categories so that we have a consistent set of identifiers for grammatical categories, and lists of which categories are applicable to which languages, and for which scopes.

Currently the data for this is limited:

  1. Nouns & noun clauses: gender, case, definiteness, plurals (cardinals), ordinals, plural ranges.
  2. Two scopes: general and units
  3. Limited locales
    1. gender, case, definiteness: (50) Amharic, Arabic, Armenian, Azerbaijani, Bangla, ... Turkish, Ukrainian, Urdu, Uzbek
    2. plurals: (300+) Afrikaans, Akan, Albanian, Amharic, Anii, Arabic, Aragonese, Armenian, Assamese, Asturian, Asu, Azerbaijani, ... Xhosa, Yakut, Yiddish, Yoruba, Zulu

https://www.unicode.org/cldr/charts/45/grammar/index.html

https://www.unicode.org/cldr/charts/45/supplemental/language_plural_rules.html

@nciric
Copy link
Contributor

nciric commented Mar 19, 2024

I think this would be beneficial to CLDR/ICU with improving data quality and maybe reducing the size. It could also help our effort in defining which categories we want to tackle.

@nciric nciric moved this to In Progress in Inflection scope dashboard Mar 19, 2024
@grhoten
Copy link
Member

grhoten commented Dec 10, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discuss Discussion item
Projects
Status: In Progress
Development

No branches or pull requests

3 participants