Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLDR-17582 Cleanup English annotations #3751

Merged

Conversation

macchiati
Copy link
Member

@macchiati macchiati commented May 24, 2024

CLDR-17582

  • This cleans up the English annotations according to the directions given to vetters.
  • The multiword search keys are split up.
  • Duplicates (or near duplicates), synonyms, and low-value terms are removed.
    • or at least reduced, with a focus on items that had a very large number of annotations.
  • Reduce the maximum number of search terms on items.
  • Other misc. cleanup.

When reviewing, note that breaking up keywords and then alphabetizing means that phrases are distributed. Example:

  • cp="😄"
  • old: awesome | face | grin | grinning face with big eyes | happy | mouth | open | smile | smiling | smiling face with open mouth | teeth | yay
  • new: awesome | big | eyes | face | grin | grinning | happy | mouth | open | smile | smiling | teeth | yay

We can't expect a line-by-line review, so please just spot-check looking for anything that is terrible: We can tweak the English later on before release; the goal here is to adhere more to the guidelines to lessen the chances vetters will be misled (though we caution them that they need to look at the associations in their languages, not English!)

  • This PR completes the ticket.

ALLOW_MANY_COMMITS=true

cgaarden
cgaarden previously approved these changes May 24, 2024
Copy link
Contributor

@cgaarden cgaarden left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me

@macchiati
Copy link
Member Author

macchiati commented May 24, 2024 via email

common/annotations/en.xml Show resolved Hide resolved
common/annotations/en.xml Outdated Show resolved Hide resolved
common/annotations/en.xml Show resolved Hide resolved
@stenshamn
Copy link
Contributor

The term [eyes] doesn't distinguish much among smilies! But I don't think that is a showstopper.

____ What might be useful is a report that mapped search keyword to emoji?
On Fri, May 24, 2024, 14:49 Fredrik @.> wrote: @.* commented on this pull request. ------------------------------ In common/annotations/en.xml <#3751 (comment)>: > melting face face | flirt | heartbreaker | sexy | slide | tease | wink | winking | winks winking face - blush | eye | face | glad | satisfied | smile | smiling | smiling face with smiling eyes + blush | eye | eyes | face | glad | satisfied | smile | smiling Not sure if this is the emoji I would expect if searching for "eye" or "eyes"? — Reply to this email directly, view it on GitHub <#3751 (review)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACJLEMAOXVJNTBIUL5PM6Z3ZD6YWLAVCNFSM6AAAAABIIGWDZ6VHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDANZYGEYDENZTHA . You are receiving this because you authored the thread.Message ID: @.***>

I think before we send this out to translation into some 70-80 languages, we should remove superfluous terms though, no?

Copy link
Contributor

@stenshamn stenshamn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think there are too many superfluous individual terms with no relation to the emoji that will not be beneficial for the function. Before sending this out for translation in all the CLDR languages, we should clean that up to avoid a lot of confusion and wasted efforts, IMHO.

@macchiati
Copy link
Member Author

I think there are too many superfluous individual terms with no relation to the emoji that will not be beneficial for the function. Before sending this out for translation in all the CLDR languages, we should clean that up to avoid a lot of confusion and wasted efforts, IMHO.

See what you think now; there is a fair amount of cleanup, so I think it is overall better than what we are showing to translators right now.

@AEApple
Copy link
Contributor

AEApple commented May 29, 2024

Spec update LGTM

@macchiati
Copy link
Member Author

Don't worry about the jira-ticket commit; will squash once it is approved.

Copy link
Contributor

@AEApple AEApple left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM but I only spot checked the most recent commit, it would be good if someone else checks as well.

@macchiati macchiati force-pushed the CLDR-17582-Cleanup-English-annotations branch from 912e794 to e5fc491 Compare May 30, 2024 16:59
@jira-pull-request-webhook
Copy link

Hooray! The files in the branch are the same across the force-push. 😃

~ Your Friendly Jira-GitHub PR Checker Bot

<annotation cp="🫂" type="tts">people hugging</annotation>
<annotation cp="👪">child | family</annotation>
<annotation cp="👪" type="tts">family</annotation>
<annotation cp="🧑‍🧑‍🧒">family: adult, adult, child</annotation> <!-- 1F9D1 200D 1F9D1 200D 1F9D2 -->
<annotation cp="🧑‍🧑‍🧒">adult | child | family:</annotation> <!-- 1F9D1 200D 1F9D1 200D 1F9D2 -->
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is a left-over colon after "family" here.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, missed that one. I'd like to make that in a follow-on PR, because I've squashed this already. That, along with restoring Dracula, and removing Spock and Vulcan.

@macchiati macchiati merged commit 195243c into unicode-org:main May 30, 2024
10 checks passed
@macchiati macchiati deleted the CLDR-17582-Cleanup-English-annotations branch May 30, 2024 17:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants