Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Edit language list, remove dialect specificity #848

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

lcjohnso
Copy link
Member

@lcjohnso lcjohnso commented Jun 6, 2024

Towards #845

Current Status: first pass is complete on de-dialect work. Still need to sync with PFE list and remediate any differences. Also: before merge and deploy, cleanup of current translation database entries (i.e., re-assign with-dialect language codes to non-dialect language codes).

@lcjohnso
Copy link
Member Author

Explanation of choices regarding keeping or eliminating dialect specificity: removed most, except in case of Chinese (we already support two dialects as separate locales) and cases where the options differentiate between latin and non-latin characters (e.g., 'iu-latn-ca' and 'iu-cans-ca' for Inuktitut; also Bosnian).

@lcjohnso lcjohnso requested a review from shaunanoordin June 26, 2024 16:49
@lcjohnso lcjohnso marked this pull request as ready for review June 26, 2024 17:00
@shaunanoordin shaunanoordin self-assigned this Jun 26, 2024
Copy link
Member

@shaunanoordin shaunanoordin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR Review

This PR updates the the list of languages supported by the Zooniverse Translations website.

  • Languages are now standardised so that, uh, (trying to see what's the most accurate way to phrase this) only the "main parent class" (e.g. English) is used instead of a specific "dialect child class" (e.g. British English), where possible.
  • Exceptions continue to exist where it makes sense, e.g. there's no "main" Chinese language (zh), but there are the Chinese simplified (zh-cn) and Chinese traditional (zh-tw) dialects

Code changes look good, so dev-wise this PR looks fine. Context-wise, I don't see any glaring issues, given my limited understanding of global languages.

As mentioned, follow up is required in 1. PFE (list of languages need to match) and 2. the database (existing languages need to be set to the new standard, e.g. any existing "ne-np" translations need to be "ne" translations.)

Comment on lines 121 to 122
{ value: 'ts', label: 'Xitsonga'},
{ value: 'ts', label: 'Tsonga'},
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fascinating, there are two possible labels for the Tsonga language. Both are valid, though I'm noting that PFE (as of PR 7134) only uses the 'Xitsonga' label

https://en.wikipedia.org/wiki/Tsonga_language

No changes necessary on either this PR or the PF PR, but worth a follow up discussion.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Duplicate entries allow for more flexible language name search. As implemented in #666, these multiple entries allow users to match via endonym and exonym language names.

For additional related discussion, see: zooniverse/Panoptes-Front-End#6729 (comment), zooniverse/Panoptes-Front-End#6732 (comment)

Note: this multiple label flexibility is a Pandora-only feature, so a single value/label combination must be chosen for the PFE language list. To choose single labels, we defer to the option used or preferred by local speakers.

Comment on lines 109 to 110
{ value: 've', label: 'Tshivenḓa' },
{ value: 've', label: 'Venḓa' },
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar to my comments on the Tsonga/Xitsonga labels, there are two possible labels for the Venda language. PFE as of the matching PR 7134, only uses the "Xitsonga" label.

https://en.wikipedia.org/wiki/Venda_language

Again, no changes for this PR or the PFE one, but worth a follow up discussion.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same logic described here applies in this case.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants