Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bengali sign combining anusvara above #886

Draft
wants to merge 10 commits into
base: main
Choose a base branch
from
14 changes: 12 additions & 2 deletions unicodetools/data/ucd/dev/DerivedAge.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedAge-16.0.0.txt
# Date: 2024-04-30, 21:48:12 GMT
# DerivedAge-17.0.0.txt

Check warning on line 1 in unicodetools/data/ucd/dev/DerivedAge.txt

View workflow job for this annotation

GitHub Actions / Draft unless approved

Not in the 17.0 pipeline

While the Unicode Technical Committee has provisionally assigned these characters, they have not been accepted for Unicode 17.0, nor for any specific version of Unicode. The Age property values for new characters are likely incorrect right now. They will be recomputed after the UTC accepts their encoding and this pull request is updated for the target version.
# Date: 2024-10-15, 01:00:38 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -2059,4 +2059,14 @@

# Total code points: 5185

# ================================================

# Age=V17_0

# Newly assigned in Unicode 17.0.0 (September, 2025)

0984 ; 17.0 # BENGALI SIGN COMBINING ANUSVARA ABOVE

# Total code points: 1

# EOF
22 changes: 14 additions & 8 deletions unicodetools/data/ucd/dev/DerivedCoreProperties.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedCoreProperties-16.0.0.txt
# Date: 2024-05-31, 18:09:32 GMT
# DerivedCoreProperties-17.0.0.txt
# Date: 2024-10-15, 01:01:11 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -368,6 +368,7 @@ FFE9..FFEC ; Math # Sm [4] HALFWIDTH LEFTWARDS ARROW..HALFWIDTH DOWNWARDS A
0972..0980 ; Alphabetic # Lo [15] DEVANAGARI LETTER CANDRA A..BENGALI ANJI
0981 ; Alphabetic # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; Alphabetic # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; Alphabetic # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
0985..098C ; Alphabetic # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
098F..0990 ; Alphabetic # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
0993..09A8 ; Alphabetic # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
Expand Down Expand Up @@ -1441,7 +1442,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
30000..3134A ; Alphabetic # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
31350..323AF ; Alphabetic # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF

# Total code points: 142759
# Total code points: 142760

# ================================================

Expand Down Expand Up @@ -3078,6 +3079,7 @@ FF41..FF5A ; Cased # L& [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN
0962..0963 ; Case_Ignorable # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0971 ; Case_Ignorable # Lm DEVANAGARI SIGN HIGH SPACING DOT
0981 ; Case_Ignorable # Mn BENGALI SIGN CANDRABINDU
0984 ; Case_Ignorable # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BC ; Case_Ignorable # Mn BENGALI SIGN NUKTA
09C1..09C4 ; Case_Ignorable # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
09CD ; Case_Ignorable # Mn BENGALI SIGN VIRAMA
Expand Down Expand Up @@ -3505,7 +3507,7 @@ E0001 ; Case_Ignorable # Cf LANGUAGE TAG
E0020..E007F ; Case_Ignorable # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; Case_Ignorable # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2749
# Total code points: 2750

# ================================================

Expand Down Expand Up @@ -7094,6 +7096,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
0972..0980 ; ID_Continue # Lo [15] DEVANAGARI LETTER CANDRA A..BENGALI ANJI
0981 ; ID_Continue # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; ID_Continue # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; ID_Continue # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
0985..098C ; ID_Continue # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
098F..0990 ; ID_Continue # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
0993..09A8 ; ID_Continue # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
Expand Down Expand Up @@ -8370,7 +8373,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
31350..323AF ; ID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
E0100..E01EF ; ID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 144541
# Total code points: 144542

# ================================================

Expand Down Expand Up @@ -9276,6 +9279,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
0972..0980 ; XID_Continue # Lo [15] DEVANAGARI LETTER CANDRA A..BENGALI ANJI
0981 ; XID_Continue # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; XID_Continue # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; XID_Continue # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
0985..098C ; XID_Continue # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
098F..0990 ; XID_Continue # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
0993..09A8 ; XID_Continue # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
Expand Down Expand Up @@ -10557,7 +10561,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
31350..323AF ; XID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 144522
# Total code points: 144523

# ================================================

Expand Down Expand Up @@ -10652,6 +10656,7 @@ E01F0..E0FFF ; Default_Ignorable_Code_Point # Cn [3600] <reserved-E01F0>..<rese
0951..0957 ; Grapheme_Extend # Mn [7] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI VOWEL SIGN UUE
0962..0963 ; Grapheme_Extend # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0981 ; Grapheme_Extend # Mn BENGALI SIGN CANDRABINDU
0984 ; Grapheme_Extend # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BC ; Grapheme_Extend # Mn BENGALI SIGN NUKTA
09BE ; Grapheme_Extend # Mc BENGALI VOWEL SIGN AA
09C1..09C4 ; Grapheme_Extend # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
Expand Down Expand Up @@ -11029,7 +11034,7 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
E0020..E007F ; Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; Grapheme_Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2193
# Total code points: 2194

# ================================================

Expand Down Expand Up @@ -12983,6 +12988,7 @@ ABED ; Grapheme_Link # Mn MEETEI MAYEK APUN IYEK
0951..0957 ; InCB; Extend # Mn [7] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI VOWEL SIGN UUE
0962..0963 ; InCB; Extend # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0981 ; InCB; Extend # Mn BENGALI SIGN CANDRABINDU
0984 ; InCB; Extend # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BC ; InCB; Extend # Mn BENGALI SIGN NUKTA
09BE ; InCB; Extend # Mc BENGALI VOWEL SIGN AA
09C1..09C4 ; InCB; Extend # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
Expand Down Expand Up @@ -13357,6 +13363,6 @@ FF9E..FF9F ; InCB; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HA
E0020..E007F ; InCB; Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; InCB; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2192
# Total code points: 2193

# EOF
3 changes: 2 additions & 1 deletion unicodetools/data/ucd/dev/EastAsianWidth.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# EastAsianWidth-16.0.0.txt
# Date: 2024-04-30, 21:48:20 GMT
# Date: 2024-07-25, 11:42:51 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -364,6 +364,7 @@
0980 ; N # Lo BENGALI ANJI
0981 ; N # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; N # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; N # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
0985..098C ; N # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
098F..0990 ; N # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
0993..09A8 ; N # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
Expand Down
3 changes: 2 additions & 1 deletion unicodetools/data/ucd/dev/IndicPositionalCategory.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# IndicPositionalCategory-16.0.0.txt
# Date: 2024-04-30, 21:48:21 GMT
# Date: 2024-07-25, 11:42:52 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -412,6 +412,7 @@ AABB..AABC ; Visual_Order_Left # Lo [2] TAI VIET VOWEL AUE..TAI VIET VOWEL
0951 ; Top # Mn DEVANAGARI STRESS SIGN UDATTA
0955 ; Top # Mn DEVANAGARI VOWEL SIGN CANDRA LONG E
0981 ; Top # Mn BENGALI SIGN CANDRABINDU
0984 ; Top # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09FE ; Top # Mn BENGALI SANDHI MARK
0A01..0A02 ; Top # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
0A47..0A48 ; Top # Mn [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI
Expand Down
3 changes: 2 additions & 1 deletion unicodetools/data/ucd/dev/IndicSyllabicCategory.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# IndicSyllabicCategory-16.0.0.txt
# Date: 2024-04-30, 21:48:21 GMT
# Date: 2024-07-25, 11:42:53 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -72,6 +72,7 @@
0900..0902 ; Bindu # Mn [3] DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANAGARI SIGN ANUSVARA
0981 ; Bindu # Mn BENGALI SIGN CANDRABINDU
0982 ; Bindu # Mc BENGALI SIGN ANUSVARA
0984 ; Bindu # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09FC ; Bindu # Lo BENGALI LETTER VEDIC ANUSVARA
0A01..0A02 ; Bindu # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
0A70 ; Bindu # Mn GURMUKHI TIPPI
Expand Down
3 changes: 2 additions & 1 deletion unicodetools/data/ucd/dev/LineBreak.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# LineBreak-16.0.0.txt
# Date: 2024-07-29, 16:26:55 GMT
# Date: 2024-08-15, 11:06:22 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -310,6 +310,7 @@
0980 ; AL # Lo BENGALI ANJI
0981 ; CM # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; CM # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; CM # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
0985..098C ; AL # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
098F..0990 ; AL # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
0993..09A8 ; AL # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
Expand Down
5 changes: 3 additions & 2 deletions unicodetools/data/ucd/dev/PropList.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# PropList-16.0.0.txt
# Date: 2024-05-31, 18:09:48 GMT
# Date: 2024-08-15, 11:06:49 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -475,6 +475,7 @@ FF41..FF46 ; Hex_Digit # L& [6] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH L
0962..0963 ; Other_Alphabetic # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0981 ; Other_Alphabetic # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; Other_Alphabetic # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; Other_Alphabetic # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BE..09C0 ; Other_Alphabetic # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II
09C1..09C4 ; Other_Alphabetic # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
09C7..09C8 ; Other_Alphabetic # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
Expand Down Expand Up @@ -858,7 +859,7 @@ FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
1F150..1F169 ; Other_Alphabetic # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
1F170..1F189 ; Other_Alphabetic # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z

# Total code points: 1495
# Total code points: 1496

# ================================================

Expand Down
5 changes: 3 additions & 2 deletions unicodetools/data/ucd/dev/Scripts.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# Scripts-16.0.0.txt
# Date: 2024-04-30, 21:48:40 GMT
# Date: 2024-07-25, 11:43:36 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -987,6 +987,7 @@ A8FF ; Devanagari # Mn DEVANAGARI VOWEL SIGN AY
0980 ; Bengali # Lo BENGALI ANJI
0981 ; Bengali # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; Bengali # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; Bengali # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
0985..098C ; Bengali # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
098F..0990 ; Bengali # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
0993..09A8 ; Bengali # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
Expand Down Expand Up @@ -1015,7 +1016,7 @@ A8FF ; Devanagari # Mn DEVANAGARI VOWEL SIGN AY
09FD ; Bengali # Po BENGALI ABBREVIATION SIGN
09FE ; Bengali # Mn BENGALI SANDHI MARK

# Total code points: 96
# Total code points: 97

# ================================================

Expand Down
1 change: 1 addition & 0 deletions unicodetools/data/ucd/dev/UnicodeData.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2360,6 +2360,7 @@
0981;BENGALI SIGN CANDRABINDU;Mn;0;NSM;;;;;N;;;;;
0982;BENGALI SIGN ANUSVARA;Mc;0;L;;;;;N;;;;;
0983;BENGALI SIGN VISARGA;Mc;0;L;;;;;N;;;;;
0984;BENGALI SIGN COMBINING ANUSVARA ABOVE;Mn;0;NSM;;;;;N;;;;;
0985;BENGALI LETTER A;Lo;0;L;;;;;N;;;;;
0986;BENGALI LETTER AA;Lo;0;L;;;;;N;;;;;
0987;BENGALI LETTER I;Lo;0;L;;;;;N;;;;;
Expand Down
3 changes: 2 additions & 1 deletion unicodetools/data/ucd/dev/VerticalOrientation.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# VerticalOrientation-16.0.0.txt
# Date: 2024-04-30, 21:48:42 GMT
# Date: 2024-07-25, 11:43:40 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -298,6 +298,7 @@
0980 ; R # Lo BENGALI ANJI
0981 ; R # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; R # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; R # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
0985..098C ; R # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
098F..0990 ; R # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
0993..09A8 ; R # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
Expand Down
7 changes: 4 additions & 3 deletions unicodetools/data/ucd/dev/auxiliary/GraphemeBreakProperty.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# GraphemeBreakProperty-16.0.0.txt
# Date: 2024-05-31, 18:09:38 GMT
# GraphemeBreakProperty-17.0.0.txt
# Date: 2024-10-15, 01:01:20 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -117,6 +117,7 @@ E01F0..E0FFF ; Control # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
0951..0957 ; Extend # Mn [7] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI VOWEL SIGN UUE
0962..0963 ; Extend # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0981 ; Extend # Mn BENGALI SIGN CANDRABINDU
0984 ; Extend # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BC ; Extend # Mn BENGALI SIGN NUKTA
09BE ; Extend # Mc BENGALI VOWEL SIGN AA
09C1..09C4 ; Extend # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
Expand Down Expand Up @@ -495,7 +496,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
E0020..E007F ; Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2198
# Total code points: 2199

# ================================================

Expand Down
7 changes: 4 additions & 3 deletions unicodetools/data/ucd/dev/auxiliary/SentenceBreakProperty.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# SentenceBreakProperty-16.0.0.txt
# Date: 2024-07-29, 16:27:32 GMT
# SentenceBreakProperty-17.0.0.txt
# Date: 2024-10-15, 01:02:04 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -71,6 +71,7 @@
0962..0963 ; Extend # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0981 ; Extend # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; Extend # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; Extend # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BC ; Extend # Mn BENGALI SIGN NUKTA
09BE..09C0 ; Extend # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II
09C1..09C4 ; Extend # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
Expand Down Expand Up @@ -586,7 +587,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
E0020..E007F ; Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2601
# Total code points: 2602

# ================================================

Expand Down
7 changes: 4 additions & 3 deletions unicodetools/data/ucd/dev/auxiliary/WordBreakProperty.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# WordBreakProperty-16.0.0.txt
# Date: 2024-07-29, 16:27:36 GMT
# WordBreakProperty-17.0.0.txt
# Date: 2024-10-15, 01:02:07 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -107,6 +107,7 @@ FB46..FB4F ; Hebrew_Letter # Lo [10] HEBREW LETTER TSADI WITH DAGESH..HEBREW
0962..0963 ; Extend # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0981 ; Extend # Mn BENGALI SIGN CANDRABINDU
0982..0983 ; Extend # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
0984 ; Extend # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BC ; Extend # Mn BENGALI SIGN NUKTA
09BE..09C0 ; Extend # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II
09C1..09C4 ; Extend # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
Expand Down Expand Up @@ -623,7 +624,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
E0020..E007F ; Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2605
# Total code points: 2606

# ================================================

Expand Down
11 changes: 6 additions & 5 deletions unicodetools/data/ucd/dev/extracted/DerivedBidiClass.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedBidiClass-16.0.0.txt
# Date: 2024-04-30, 21:48:13 GMT
# DerivedBidiClass-17.0.0.txt
# Date: 2024-10-15, 01:01:08 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -1214,8 +1214,8 @@ FFDA..FFDC ; L # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL LETTER
F0000..FFFFD ; L # Co [65534] <private-use-F0000>..<private-use-FFFFD>
100000..10FFFD; L # Co [65534] <private-use-100000>..<private-use-10FFFD>

# The above property value applies to 815351 code points not listed here.
# Total code points: 1095513
# The above property value applies to 815350 code points not listed here.
# Total code points: 1095512

# ================================================

Expand Down Expand Up @@ -2082,6 +2082,7 @@ FFFFE..FFFFF ; BN # Cn [2] <noncharacter-FFFFE>..<noncharacter-FFFFF>
0951..0957 ; NSM # Mn [7] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI VOWEL SIGN UUE
0962..0963 ; NSM # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
0981 ; NSM # Mn BENGALI SIGN CANDRABINDU
0984 ; NSM # Mn BENGALI SIGN COMBINING ANUSVARA ABOVE
09BC ; NSM # Mn BENGALI SIGN NUKTA
09C1..09C4 ; NSM # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
09CD ; NSM # Mn BENGALI SIGN VIRAMA
Expand Down Expand Up @@ -2408,7 +2409,7 @@ FE20..FE2F ; NSM # Mn [16] COMBINING LIGATURE LEFT HALF..COMBINING CYRILLIC
1E944..1E94A ; NSM # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
E0100..E01EF ; NSM # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2028
# Total code points: 2029

# ================================================

Expand Down
Loading
Loading