Skip to content

Commit

Permalink
UCA 16.0 delta 20
Browse files Browse the repository at this point in the history
From Ken:

unidata-16.0.0d20.txt

poofed the two SHRIIs

allkeys-16.0.0d20.txt

decomps-16.0.0d20.txt

Regenerated. I included a regeneration of decomps.txt, because the
SHRIIs had been weighted as sequences, and so showed up in the list of
artificial decompositions.
  • Loading branch information
markusicu committed May 2, 2024
1 parent 5b73822 commit 8d61588
Show file tree
Hide file tree
Showing 3 changed files with 5 additions and 19 deletions.
16 changes: 3 additions & 13 deletions c/uca/sifter/unidata.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# unidata-16.0.0.txt
# Date: 2024-04-24, 00:00:00 GMT [KW]
# Date: 2024-04-25, 00:00:00 GMT [KW]
# © 2024 Unicode®, Inc.
# For terms of use, see https://www.unicode.org/terms_of_use.html
#
Expand All @@ -9,8 +9,8 @@
# Default Unicode Collation Element Table (DUCET) for
# the Unicode Collation Algorithm.
#
# Version 16.0.0 draft 19 (Unicode Version: 16.0.0)
# based on Unicode data file UnicodeData-16.0.0d15.txt
# Version 16.0.0 draft 20 (Unicode Version: 16.0.0)
# based on Unicode data file UnicodeData-16.0.0d16.txt
# Ordering for Unicode 16.0
#
# Fields:
Expand Down Expand Up @@ -22067,11 +22067,6 @@ DEFAULT
0C55;TELUGU LENGTH MARK;Mn;;;;;;
0C56;TELUGU AI LENGTH MARK;Mn;;;;;;

# Telugu archaic SHRII.
# Collate as a spelled-out word.

0C5C;TELUGU ARCHAIC SHRII;Lo;<sort> 0C36 0C4D 0C30 0C40;;;;;

# Kannada script begins here

0C85;KANNADA LETTER A;Lo;;;;;;
Expand Down Expand Up @@ -22178,11 +22173,6 @@ DEFAULT
0CD5;KANNADA LENGTH MARK;Mc;;;;;;
0CD6;KANNADA AI LENGTH MARK;Mc;;;;;;

# Kannada archaic SHRII.
# Collate as a spelled-out word.

0CDC;KANNADA ARCHAIC SHRII;Lo;<sort> 0CB6 0CCD 0CB0 0CC0;;;;;

# Malayalam script begins here

# Primary order updated slightly 2009-09-4, based on input
Expand Down
4 changes: 1 addition & 3 deletions unicodetools/data/uca/dev/allkeys.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# allkeys-16.0.0.txt
# Date: 2024-04-24, 16:25:39 GMT [KW]
# Date: 2024-04-25, 11:04:23 GMT [KW]
# Copyright 2024 Unicode, Inc.
# For terms of use, see https://www.unicode.org/terms_of_use.html
#
Expand Down Expand Up @@ -18637,7 +18637,6 @@ A8FF ; [.2E62.0020.0002] # DEVANAGARI VOWEL SIGN AY
0C32 ; [.2FCB.0020.0002] # TELUGU LETTER LA
0C35 ; [.2FCC.0020.0002] # TELUGU LETTER VA
0C36 ; [.2FCD.0020.0002] # TELUGU LETTER SHA
0C5C ; [.2FCD.0020.0004][.2FE4.0020.0004][.2FC9.0020.0004][.2FD7.0020.0004] # TELUGU ARCHAIC SHRII
0C37 ; [.2FCE.0020.0002] # TELUGU LETTER SSA
0C38 ; [.2FCF.0020.0002] # TELUGU LETTER SA
0C39 ; [.2FD0.0020.0002] # TELUGU LETTER HA
Expand Down Expand Up @@ -18712,7 +18711,6 @@ A8FF ; [.2E62.0020.0002] # DEVANAGARI VOWEL SIGN AY
0CB2 ; [.3013.0020.0002] # KANNADA LETTER LA
0CB5 ; [.3014.0020.0002] # KANNADA LETTER VA
0CB6 ; [.3015.0020.0002] # KANNADA LETTER SHA
0CDC ; [.3015.0020.0004][.302E.0020.0004][.3011.0020.0004][.3021.0020.0004] # KANNADA ARCHAIC SHRII
0CB7 ; [.3016.0020.0002] # KANNADA LETTER SSA
0CB8 ; [.3017.0020.0002] # KANNADA LETTER SA
0CB9 ; [.3018.0020.0002] # KANNADA LETTER HA
Expand Down
4 changes: 1 addition & 3 deletions unicodetools/data/uca/dev/decomps.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# decomps-16.0.0.txt
# Date: 2024-03-29, 11:14:48 GMT [KW]
# Date: 2024-04-26, 10:58:19 GMT [KW]
# Copyright 2024 Unicode, Inc.
# For terms of use, see https://www.unicode.org/terms_of_use.html
#
Expand Down Expand Up @@ -712,7 +712,6 @@
0C04;;0902 # TELUGU SIGN COMBINING ANUSVARA ABOVE => DEVANAGARI SIGN ANUSVARA
0C3C;;093C # TELUGU SIGN NUKTA => DEVANAGARI SIGN NUKTA
0C48;;0C46 0C56 # TELUGU VOWEL SIGN AI => TELUGU VOWEL SIGN E + TELUGU AI LENGTH MARK
0C5C;<sort>;0C36 0C4D 0C30 0C40 # TELUGU ARCHAIC SHRII => TELUGU LETTER SHA + TELUGU SIGN VIRAMA + TELUGU LETTER RA + TELUGU VOWEL SIGN II
0C5D;<sort>;0C28 0C4D # TELUGU LETTER NAKAARA POLLU => TELUGU LETTER NA + TELUGU SIGN VIRAMA
0C81;;0901 # KANNADA SIGN CANDRABINDU => DEVANAGARI SIGN CANDRABINDU
0C82;;0902 # KANNADA SIGN ANUSVARA => DEVANAGARI SIGN ANUSVARA
Expand All @@ -724,7 +723,6 @@
0CCA;;0CC6 0CC2 # KANNADA VOWEL SIGN O => KANNADA VOWEL SIGN E + KANNADA VOWEL SIGN UU
0CCB;;0CCA 0CD5 # KANNADA VOWEL SIGN OO => KANNADA VOWEL SIGN O + KANNADA LENGTH MARK
0CCB;;0CC6 0CC2 0CD5 # KANNADA VOWEL SIGN OO => KANNADA VOWEL SIGN E + KANNADA VOWEL SIGN UU + KANNADA LENGTH MARK
0CDC;<sort>;0CB6 0CCD 0CB0 0CC0 # KANNADA ARCHAIC SHRII => KANNADA LETTER SHA + KANNADA SIGN VIRAMA + KANNADA LETTER RA + KANNADA VOWEL SIGN II
0CDD;<sort>;0CA8 0CCD # KANNADA LETTER NAKAARA POLLU => KANNADA LETTER NA + KANNADA SIGN VIRAMA
0CF3;;0902 # KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT => DEVANAGARI SIGN ANUSVARA
0D00;;0902 # MALAYALAM SIGN COMBINING ANUSVARA ABOVE => DEVANAGARI SIGN ANUSVARA
Expand Down

0 comments on commit 8d61588

Please sign in to comment.