Skip to content

Commit

Permalink
CLDR-17852 Cleanup after TC meeting.
Browse files Browse the repository at this point in the history
  • Loading branch information
macchiati committed May 29, 2024
1 parent 5eb5805 commit 52c117d
Show file tree
Hide file tree
Showing 2 changed files with 69 additions and 29 deletions.
44 changes: 22 additions & 22 deletions common/annotations/en.xml
Original file line number Diff line number Diff line change
Expand Up @@ -762,15 +762,15 @@ annotations.
<annotation cp="😊" type="tts">smiling face with smiling eyes</annotation>
<annotation cp="😇">angel | angelic | angels | blessed | face | fairy | fairytale | fantasy | halo | happy | innocent | peaceful | smile | smiling | spirit | tale</annotation>
<annotation cp="😇" type="tts">smiling face with halo</annotation>
<annotation cp="🥰">3 | adore | crush | face | heart | hearts | I | ily | love | romance | smile | smiling | you</annotation>
<annotation cp="🥰">3 | adore | crush | face | heart | hearts | ily | love | romance | smile | smiling | you</annotation>
<annotation cp="🥰" type="tts">smiling face with hearts</annotation>
<annotation cp="😍">143 | bae | eye | face | feels | heart-eyes | hearts | ily | kisses | love | romance | romantic | smile | xoxo</annotation>
<annotation cp="😍" type="tts">smiling face with heart-eyes</annotation>
<annotation cp="🤩">excited | eyes | face | grinning | smile | star | star-struck | starry-eyed | wow</annotation>
<annotation cp="🤩" type="tts">star-struck</annotation>
<annotation cp="😘">adorbs | bae | blowing | face | flirt | heart | I | ily | kiss | love | lover | miss | morning | muah | romantic | smooch | xoxo | you</annotation>
<annotation cp="😘">adorbs | bae | blowing | face | flirt | heart | ily | kiss | love | lover | miss | muah | romantic | smooch | xoxo | you</annotation>
<annotation cp="😘" type="tts">face blowing a kiss</annotation>
<annotation cp="😗">143 | date | dating | face | flirt | ily | kiss | kisses | kissing | love | smooch | smooches | xoxo | you</annotation>
<annotation cp="😗">143 | date | dating | face | flirt | ily | kiss | love | smooch | smooches | xoxo | you</annotation>
<annotation cp="😗" type="tts">kissing face</annotation>
<annotation cp="">face | happy | outlined | relaxed | smile | smiling</annotation>
<annotation cp="" type="tts">smiling face</annotation>
Expand Down Expand Up @@ -810,11 +810,11 @@ annotations.
<annotation cp="🤐" type="tts">zipper-mouth face</annotation>
<annotation cp="🤨">disapproval | disbelief | distrust | emoji | eyebrow | face | hmm | mild | raised | scepticism | skeptic | skeptical | surprise | what</annotation>
<annotation cp="🤨" type="tts">face with raised eyebrow</annotation>
<annotation cp="😐">awkward | blank | deadpan | expressionless | face | fine | jealous | meh | neutral | oh | shade | straight | uh | unamused | unhappy | unimpressed | whatever</annotation>
<annotation cp="😐">awkward | blank | deadpan | expressionless | face | fine | jealous | meh | neutral | oh | shade | straight | unamused | unhappy | unimpressed | whatever</annotation>
<annotation cp="😐" type="tts">neutral face</annotation>
<annotation cp="😑">awkward | bad | dead | expressionless | face | fine | impressed | inexpressive | jealous | meh | not | oh | omg | straight | uh | unhappy | unimpressed | whatever</annotation>
<annotation cp="😑">awkward | dead | expressionless | face | fine | inexpressive | jealous | meh | not | oh | omg | straight | uh | unhappy | unimpressed | whatever</annotation>
<annotation cp="😑" type="tts">expressionless face</annotation>
<annotation cp="😶">awkward | blank | cant | even | expressionless | face | mouth | mouthless | mute | quiet | secret | silence | silent | speechlessout</annotation>
<annotation cp="😶">awkward | blank | can't | even | expressionless | face | mouth | mouthless | mute | quiet | secret | silence | silent | speechlessout</annotation>
<annotation cp="😶" type="tts">face without mouth</annotation>
<annotation cp="🫥">depressed | disappear | dotted | face | hidden | hide | introvert | invisible | line | meh | whatever | wtv</annotation> <!-- 1FAE5 -->
<annotation cp="🫥" type="tts">dotted line face</annotation>
Expand Down Expand Up @@ -896,7 +896,7 @@ annotations.
<annotation cp="🙁" type="tts">slightly frowning face</annotation>
<annotation cp="">face | frown | frowning | sad</annotation>
<annotation cp="" type="tts">frowning face</annotation>
<annotation cp="😮">believe | don’t | face | forgot | I | mouth | omg | open | shocked | surprised | sympathy | unbelievable | unreal | whoa | woah | wow | you</annotation>
<annotation cp="😮">believe | don’t | face | forgot | mouth | omg | open | shocked | surprised | sympathy | unbelievable | unreal | whoa | wow | you</annotation>
<annotation cp="😮" type="tts">face with open mouth</annotation>
<annotation cp="😯">epic | face | hushed | omg | stunned | surprised | whoa | woah</annotation>
<annotation cp="😯" type="tts">hushed face</annotation>
Expand Down Expand Up @@ -988,11 +988,11 @@ annotations.
<annotation cp="😿" type="tts">crying cat</annotation>
<annotation cp="😾">animal | cat | face | pouting</annotation>
<annotation cp="😾" type="tts">pouting cat</annotation>
<annotation cp="🙈">cant | embarrassed | evil | face | forbidden | forgot | gesture | hide | monkey | no | omg | prohibited | scared | secret | see-no-evil | smh | watch</annotation>
<annotation cp="🙈">can't | embarrassed | evil | face | forbidden | forgot | gesture | hide | monkey | no | omg | prohibited | scared | secret | smh | watch</annotation>
<annotation cp="🙈" type="tts">see-no-evil monkey</annotation>
<annotation cp="🙉">animal | ears | evil | face | forbidden | gesture | hear | hear-no-evil | listen | monkey | no | not | prohibited | secret | shh | tmi</annotation>
<annotation cp="🙉">animal | ears | evil | face | forbidden | gesture | hear | listen | monkey | no | not | prohibited | secret | shh | tmi</annotation>
<annotation cp="🙉" type="tts">hear-no-evil monkey</annotation>
<annotation cp="🙊">animal | evil | face | forbidden | gesture | monkey | no | not | oops | prohibited | quiet | secret | speak | speak-no-evil | stealth</annotation>
<annotation cp="🙊">animal | evil | face | forbidden | gesture | monkey | no | not | oops | prohibited | quiet | secret | speak | stealth</annotation>
<annotation cp="🙊" type="tts">speak-no-evil monkey</annotation>
<annotation cp="💌">heart | letter | love | mail | romance | valentine</annotation>
<annotation cp="💌" type="tts">love letter</annotation>
Expand Down Expand Up @@ -1072,7 +1072,7 @@ annotations.
<annotation cp="💭" type="tts">thought balloon</annotation>
<annotation cp="💤">comic | good | goodnight | night | sleep | sleeping | sleepy | tired | zzz</annotation>
<annotation cp="💤" type="tts">ZZZ</annotation>
<annotation cp="👋">bye | cya | g2g | greetings | gtg | hand | hello | hey | hi | later | outtie | there? | ttfn | ttyl | wave | yo | you</annotation>
<annotation cp="👋">bye | cya | g2g | greetings | gtg | hand | hello | hey | hi | later | outtie | ttfn | ttyl | wave | yo | you</annotation>
<annotation cp="👋" type="tts">waving hand</annotation>
<annotation cp="🤚">back | backhand | hand | raised</annotation>
<annotation cp="🤚" type="tts">raised back of hand</annotation>
Expand Down Expand Up @@ -1106,7 +1106,7 @@ annotations.
<annotation cp="🤞" type="tts">crossed fingers</annotation>
<annotation cp="🫰">&lt;3 | crossed | expensive | finger | hand | heart | index | love | money | snap | thumb</annotation> <!-- 1FAF0 -->
<annotation cp="🫰" type="tts">hand with index finger and thumb crossed</annotation>
<annotation cp="🤟">fingers | gesture | hand | I | ILY | love | love-you | three | you</annotation>
<annotation cp="🤟">fingers | gesture | hand | ILY | love | love-you | three | you</annotation>
<annotation cp="🤟" type="tts">love-you gesture</annotation>
<annotation cp="🤘">finger | hand | horns | rock-on | sign</annotation>
<annotation cp="🤘" type="tts">sign of the horns</annotation>
Expand All @@ -1124,9 +1124,9 @@ annotations.
<annotation cp="👇" type="tts">backhand index pointing down</annotation>
<annotation cp="">finger | hand | index | point | pointing | this | up</annotation>
<annotation cp="" type="tts">index pointing up</annotation>
<annotation cp="🫵">at | finger | hand | index | pointing | poke | the | viewer | you</annotation> <!-- 1FAF5 -->
<annotation cp="🫵">at | finger | hand | index | pointing | poke | viewer | you</annotation> <!-- 1FAF5 -->
<annotation cp="🫵" type="tts">index pointing at the viewer</annotation>
<annotation cp="👍">+1</annotation>
<annotation cp="👍">+1 | good | hand | like | thumb | up | yes</annotation>
<annotation cp="👍" type="tts">thumbs up</annotation>
<annotation cp="👎">-1 | bad | dislike | down | good | hand | no | nope | thumb | thumbs</annotation>
<annotation cp="👎" type="tts">thumbs down</annotation>
Expand All @@ -1138,7 +1138,7 @@ annotations.
<annotation cp="🤛" type="tts">left-facing fist</annotation>
<annotation cp="🤜">fist | right-facing | rightwards</annotation>
<annotation cp="🤜" type="tts">right-facing fist</annotation>
<annotation cp="👏">applause | approval | awesome | clap | congrats | congratulations | done | excited | good | great | hand | homie | job | nice | prayed | well | yay</annotation>
<annotation cp="👏">applause | approval | awesome | clap | congrats | congratulations | excited | good | great | hand | homie | job | nice | prayed | well | yay</annotation>
<annotation cp="👏" type="tts">clapping hands</annotation>
<annotation cp="🙌">celebration | gesture | hand | hands | hooray | praise | raised | raising</annotation>
<annotation cp="🙌" type="tts">raising hands</annotation>
Expand Down Expand Up @@ -1256,11 +1256,11 @@ annotations.
<annotation cp="💁‍♂" type="tts">man tipping hand</annotation>
<annotation cp="💁‍♀">fetch | flick | flip | gossip | hand | sarcasm | sarcastic | sassy | seriously | tipping | whatever | woman</annotation>
<annotation cp="💁‍♀" type="tts">woman tipping hand</annotation>
<annotation cp="🙋">gesture | hand | here | I | know | me | person | pick | question | raise | raising</annotation>
<annotation cp="🙋">gesture | hand | here | know | me | person | pick | question | raise | raising</annotation>
<annotation cp="🙋" type="tts">person raising hand</annotation>
<annotation cp="🙋‍♂">gesture | hand | here | I | know | man | me | pick | question | raise | raising</annotation>
<annotation cp="🙋‍♂">gesture | hand | here | know | man | me | pick | question | raise | raising</annotation>
<annotation cp="🙋‍♂" type="tts">man raising hand</annotation>
<annotation cp="🙋‍♀">gesture | hand | here | I | know | me | pick | question | raise | raising | woman</annotation>
<annotation cp="🙋‍♀">gesture | hand | here | know | me | pick | question | raise | raising | woman</annotation>
<annotation cp="🙋‍♀" type="tts">woman raising hand</annotation>
<annotation cp="🧏">accessibility | deaf | ear | gesture | hear | person</annotation>
<annotation cp="🧏" type="tts">deaf person</annotation>
Expand All @@ -1280,11 +1280,11 @@ annotations.
<annotation cp="🤦‍♂" type="tts">man facepalming</annotation>
<annotation cp="🤦‍♀">again | bewilder | disbelief | exasperation | facepalm | no | not | oh | omg | shock | smh | woman</annotation>
<annotation cp="🤦‍♀" type="tts">woman facepalming</annotation>
<annotation cp="🤷">doubt | dunno | guess | I | idk | ignorance | indifference | knows | maybe | person | shrug | shrugging | whatever | who</annotation>
<annotation cp="🤷">doubt | dunno | guess | idk | ignorance | indifference | knows | maybe | person | shrug | shrugging | whatever | who</annotation>
<annotation cp="🤷" type="tts">person shrugging</annotation>
<annotation cp="🤷‍♂">doubt | dunno | guess | I | idk | ignorance | indifference | knows | man | maybe | shrug | shrugging | whatever | who</annotation>
<annotation cp="🤷‍♂">doubt | dunno | guess | idk | ignorance | indifference | knows | man | maybe | shrug | shrugging | whatever | who</annotation>
<annotation cp="🤷‍♂" type="tts">man shrugging</annotation>
<annotation cp="🤷‍♀">doubt | dunno | guess | I | idk | ignorance | indifference | knows | maybe | shrug | shrugging | whatever | who | woman</annotation>
<annotation cp="🤷‍♀">doubt | dunno | guess | idk | ignorance | indifference | knows | maybe | shrug | shrugging | whatever | who | woman</annotation>
<annotation cp="🤷‍♀" type="tts">woman shrugging</annotation>
<annotation cp="🧑‍⚕">doctor | health | healthcare | nurse | therapist | worker</annotation>
<annotation cp="🧑‍⚕" type="tts">health worker</annotation>
Expand Down Expand Up @@ -3641,7 +3641,7 @@ annotations.
<annotation cp="" type="tts">transgender symbol</annotation>
<annotation cp="">× | cancel | multiplication | multiply | sign | x</annotation>
<annotation cp="" type="tts">multiply</annotation>
<annotation cp="">+</annotation>
<annotation cp="">+ | plus</annotation>
<annotation cp="" type="tts">plus</annotation>
<annotation cp="">- | − | heavy | math | minus | sign</annotation>
<annotation cp="" type="tts">minus</annotation>
Expand Down
54 changes: 47 additions & 7 deletions docs/ldml/tr35-general.md
Original file line number Diff line number Diff line change
Expand Up @@ -2620,28 +2620,68 @@ For more information, see version 5.0 or [UTR #51, Unicode Emoji](https://www.un
<!ATTLIST annotation type (tts) #IMPLIED >
```

There are two kinds of annotations: **short names**, and **keywords**.
There are two kinds of annotations: **short names**, and **search keywords**.

With an attribute `type="tts"`, the value is a **short name**, such as one that can be used for text-to-speech. It should be treated as one of the element values for other purposes.
With an attribute `type="tts"`, the value is a **short name**, such as one that can be used for text-to-speech.
It should be treated as one of the element values for other purposes.

When there is no `type` attribute, the value is a set of **keywords**, delimited by |. Spaces around each element are to be trimmed. The **keywords** are words associated with the character(s) that might be used in searching for the character, or in predictive typing on keyboards. The short name itself can be used as a keyword.
When there is no `type` attribute, the value is a set of **keywords**, delimited by |.
Spaces around each element are to be trimmed.
The **keywords** are words associated with the character(s) that might be used in searching for the character,
or in predictive typing on keyboards. The short name itself can be used as a keyword.

Here is an example from German:

```xml
<annotation cp="👎">schlecht | Hand | Daumen | nach unten</annotation>
<annotation cp="👎">schlecht | Hand | Daumen | nach | unten</annotation>
<annotation cp="👎" type="tts">Daumen runter</annotation>
```

The `cp` attribute value has two formats: either a single string, or if contained within \[\] a UnicodeSet. The latter format can contain multiple code points or strings. A code point pr string can occur in multiple annotation element **cp** values, such as the following, which also contains the "thumbs down" character.
These are intended as search keywords, and not for "triggering" (aka suggesting).

- For triggering, the user is typing out a message and concurrently seeing a few emoji
displayed adjacent to the virtual keyboard. Selecting the emoji adds it to the message.
For example, you mention your birthday while writing, and an emoji cake pops up.
That is typically done with an LLM or similar advanced technology.
- For searching, the user is looking for an emoji in a search box,
and typing in in words that narrow down a displayed set of emoji.
For example, you type 'heart', but that has too many hits, so you add 'blue' and get the set of blue hearts.

### Usage Model

The usage model for the search keywords is:

- The user types one or more words in an emoji search field.
- Each word successively narrows a number of emoji in a results box.
- heart → 🥰 😘 😻 💌 💘 💝 💖 💗 💓 💞 💕 💟 ❣️ 💔 ❤️‍🔥 ❤️‍🩹 ❤️ 🩷 🧡 💛 💚 💙 🩵 💜 🤎 🖤 🩶 🤍 💋 🫰 🫶 🫀 💏 💑 🏠 🏡 ♥️ 🩺
- blue → 🥶 😰 💙 🩵 🫐 👕 👖 📘 🧿 🔵 🟦 🔷 🔹 🏳️‍⚧️
- heart blue → 💙 🩵
- A word with no hits is ignored
- [heart | blue | confabulation] is equivalent to [heart | blue]
- As the user types a word, each character added to the word narrows the results.
- Whenever the list is short enough to scan, the user will mouse-click on the right emoji — so it doesn’t have to be narrowed too far.
- In the following, the user would just click on 🎉 if that works for them.
- celebrate → 🥳 🥂 🎈 🎉 🎊 🪅
- The order of words doesn’t matter.

Multiword search keywords are typically broken up into separate parts,
because that works better with the usage model. So [hand | mouth | omg | open | over] covers the phrase "hand over mouth".

### cp attribute

The `cp` attribute value has two formats: either a single string, or if contained within \[\] a UnicodeSet.
The latter format can contain multiple code points or strings. A code point pr string can occur in multiple annotation element **cp** values, such as the following, which also contains the "thumbs down" character.

```xml
<annotation cp='[☝✊-✍👆-👐👫-👭💁🖐🖕🖖🙅🙆🙋🙌🙏🤘]'>hand</annotation>
```

Both for short names and keywords, values do not have to match between different languages. They should be the most common values that people using _that_ language would associate with those characters. For example, a "black heart" might have the association of "wicked" in English, but not in some other languages.
Both for short names and keywords, values do not have to match between different languages.
They should be the most common values that people using _that_ language would associate with those characters.
For example, a "black heart" might have the association of "wicked" in English, but not in some other languages.

The cp value may contain sequences, but does not contain any Emoji or Text Variant (VS15 & VS16) characters. All such characters should be removed before looking up any short names and keywords.
The cp value may contain sequences, but does not contain any Emoji or Text Variant (VS15 & VS16) characters.
All such characters should be removed before looking up any short names and keywords.

### <a name="SynthesizingNames" href="#SynthesizingNames">Synthesizing Sequence Names</a>

Expand Down

0 comments on commit 52c117d

Please sign in to comment.