diff --git a/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.html b/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.html
index 4c03f89ee..17c644a16 100644
--- a/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.html
+++ b/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.html
@@ -7,7 +7,7 @@
Line_Break Chart
Unicode Version: 16.0.0
-Date: 2024-05-02, 18:47:59 GMT
+Date: 2024-05-03, 13:11:08 GMT
This page illustrates the application of the Line_Break specification. The material here is informative, not normative.
The first chart shows where breaks would appear between different sample characters or strings. The sample characters are chosen mechanically to represent the different properties used by the specification.
Each cell shows the break-status for the position between the character(s) in its row header and the character(s) in its column header. The × symbol indicates no break, while the ÷ symbol indicated a break. The cells with × are also shaded to make it easier to scan the table. For example, in the cell at the intersection of the row headed by “CR” and the column headed by “LF”, there is a × symbol, indicating that there is no break between CR and LF.
Some column headers may be composed, reflecting “treat as” or “ignore” rules.
If your browser handles titles (tooltips), then hovering the mouse over the row header will show a sample character of that type. Hovering over a column header will show the sample character, plus its abbreviated general category and script. Hovering over the intersected cells shows the rule number that produces the break-status. For example, hovering over the cell at the intersection of H3 and JT shows ×, with the rule 26.03. Checking below the table, rule 26.03 is “JT | H3 × JT”, which is the one that applies to that case. Note that a rule is invoked only when no lower-numbered rules have applied.
@@ -125,7 +125,7 @@
21.02 | | × | HY |
21.03 | | × | NS |
21.04 | BB | × | |
-21.1 | HL (HY | BA) | × | |
+21.1 | HL (HY | BA) | × | [^HL] |
21.2 | SY | × | HL |
22.0 | | × | IN |
23.02 | (AL | HL) | × | NU |
@@ -939,7 +939,7 @@
🇷 🇺 □ 🇸 🇪
256 |
- א - א
+ א - א
|
257 |
𑼦 𑼂 𑼭 𑼦 ◌𑽂 𑼦 𑼱 𑽁
@@ -1026,9 +1026,15 @@
A n m e r k u n g : „ W h i t e “ b z w . ‚ 白 人 ‘ – i n d e r A m t l i c h e n S t a t i s t i k
|
285 |
- 🏿
+ □ J o h n ו - M i c h a e l □ ;
|
286 |
+ ו ◌ַ ◌ֽ י ◌ְ ה ◌ִ י ־ כ ◌ֵ ◌ֽ ן ׃
+ |
+287 |
+ 🏿
+ |
+288 |
© 🏿
|
diff --git a/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.txt b/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.txt
index 728c5cb52..9b7e0c13e 100644
--- a/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.txt
+++ b/unicodetools/data/ucd/dev/auxiliary/LineBreakTest.txt
@@ -1,5 +1,5 @@
# LineBreakTest-16.0.0.txt
-# Date: 2024-05-02, 18:48:03 GMT
+# Date: 2024-05-03, 13:11:09 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -16661,7 +16661,7 @@
× 1F1F7 × 1F1FA ÷ 1F1F8 ÷ # × [0.3] REGIONAL INDICATOR SYMBOL LETTER R (RI_NotEastAsian) × [30.11] REGIONAL INDICATOR SYMBOL LETTER U (RI_NotEastAsian) ÷ [30.13] REGIONAL INDICATOR SYMBOL LETTER S (RI_NotEastAsian) ÷ [0.3]
× 1F1F7 × 1F1FA ÷ 1F1F8 × 1F1EA ÷ # × [0.3] REGIONAL INDICATOR SYMBOL LETTER R (RI_NotEastAsian) × [30.11] REGIONAL INDICATOR SYMBOL LETTER U (RI_NotEastAsian) ÷ [30.13] REGIONAL INDICATOR SYMBOL LETTER S (RI_NotEastAsian) × [30.11] REGIONAL INDICATOR SYMBOL LETTER E (RI_NotEastAsian) ÷ [0.3]
× 1F1F7 × 1F1FA × 200B ÷ 1F1F8 × 1F1EA ÷ # × [0.3] REGIONAL INDICATOR SYMBOL LETTER R (RI_NotEastAsian) × [30.11] REGIONAL INDICATOR SYMBOL LETTER U (RI_NotEastAsian) × [7.02] ZERO WIDTH SPACE (ZW_NotEastAsian) ÷ [8.0] REGIONAL INDICATOR SYMBOL LETTER S (RI_NotEastAsian) × [30.12] REGIONAL INDICATOR SYMBOL LETTER E (RI_NotEastAsian) ÷ [0.3]
-× 05D0 × 002D × 05D0 ÷ # × [0.3] HEBREW LETTER ALEF (HL_NotEastAsian) × [21.02] HYPHEN-MINUS (HY_NotEastAsian) × [21.1] HEBREW LETTER ALEF (HL_NotEastAsian) ÷ [0.3]
+× 05D0 × 002D ÷ 05D0 ÷ # × [0.3] HEBREW LETTER ALEF (HL_NotEastAsian) × [21.02] HYPHEN-MINUS (HY_NotEastAsian) ÷ [999.0] HEBREW LETTER ALEF (HL_NotEastAsian) ÷ [0.3]
× 11F26 ÷ 11F02 × 11F2D ÷ 11F26 × 11F42 × 11F26 ÷ 11F31 × 11F41 ÷ # × [0.3] KAWI LETTER PA (AK_NotEastAsian) ÷ [999.0] KAWI SIGN REPHA (AP_NotEastAsian) × [28.11] KAWI LETTER LA (AK_NotEastAsian) ÷ [999.0] KAWI LETTER PA (AK_NotEastAsian) × [28.12] KAWI CONJOINER (VI_NotEastAsian) × [28.13] KAWI LETTER PA (AK_NotEastAsian) ÷ [999.0] KAWI LETTER SA (AK_NotEastAsian) × [9.0] KAWI SIGN KILLER (CM1_NotEastAsian_CM) ÷ [0.3]
× 1BD7 × 1BEC ÷ 1BD2 × 1BEA × 1BC9 × 1BF3 ÷ 1BC2 × 1BE7 × 1BC9 × 1BF3 ÷ # × [0.3] BATAK LETTER NORTHERN TA (AS_NotEastAsian) × [9.0] BATAK VOWEL SIGN O (CM1_NotEastAsian_CM) ÷ [999.0] BATAK LETTER RA (AS_NotEastAsian) × [9.0] BATAK VOWEL SIGN I (CM1_NotEastAsian_CM) × [28.14] BATAK LETTER NA (AS_NotEastAsian) × [28.12] BATAK PANONGONAN (VF_NotEastAsian) ÷ [999.0] BATAK LETTER HA (AS_NotEastAsian) × [9.0] BATAK VOWEL SIGN E (CM1_NotEastAsian_CM) × [28.14] BATAK LETTER NA (AS_NotEastAsian) × [28.12] BATAK PANONGONAN (VF_NotEastAsian) ÷ [0.3]
× 1B18 ÷ 1B27 × 1B44 × 200C × 1B2B × 1B38 ÷ 1B31 × 1B44 × 1B1D × 1B36 ÷ # × [0.3] BALINESE LETTER CA (AK_NotEastAsian) ÷ [999.0] BALINESE LETTER PA (AK_NotEastAsian) × [28.12] BALINESE ADEG ADEG (VI_NotEastAsian) × [9.0] ZERO WIDTH NON-JOINER (CM1_NotEastAsian_CM) × [28.13] BALINESE LETTER MA (AK_NotEastAsian) × [9.0] BALINESE VOWEL SIGN SUKU (CM1_NotEastAsian_CM) ÷ [999.0] BALINESE LETTER SA SAPA (AK_NotEastAsian) × [28.12] BALINESE ADEG ADEG (VI_NotEastAsian) × [28.13] BALINESE LETTER TA LATIK (AK_NotEastAsian) × [9.0] BALINESE VOWEL SIGN ULU (CM1_NotEastAsian_CM) ÷ [0.3]
@@ -16690,9 +16690,11 @@
× 0032 × 0030 × 0030 × 0030 ÷ 5E74 ÷ 83B7 ÷ 5F97 ÷ 4E86 ÷ 300A × 0049 × 0047 × 004E × 300B ÷ 7684 × 201C × 0042 × 0065 × 0073 × 0074 × 0020 ÷ 0047 × 0061 × 006D × 0065 × 0020 ÷ 0042 × 006F × 0079 × 0020 ÷ 0053 × 0074 × 0072 × 0061 × 0074 × 0065 × 0067 × 0079 × 201D × 5956 × 3002 ÷ # × [0.3] DIGIT TWO (NU_NotEastAsian) × [25.15] DIGIT ZERO (NU_NotEastAsian) × [25.15] DIGIT ZERO (NU_NotEastAsian) × [25.15] DIGIT ZERO (NU_NotEastAsian) ÷ [999.0] CJK UNIFIED IDEOGRAPH-5E74 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-83B7 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-5F97 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-4E86 (ID) ÷ [999.0] LEFT DOUBLE ANGLE BRACKET (OP) × [14.0] LATIN CAPITAL LETTER I (AL_NotEastAsian_AL) × [28.0] LATIN CAPITAL LETTER G (AL_NotEastAsian_AL) × [28.0] LATIN CAPITAL LETTER N (AL_NotEastAsian_AL) × [13.02] RIGHT DOUBLE ANGLE BRACKET (CL) ÷ [999.0] CJK UNIFIED IDEOGRAPH-7684 (ID) × [19.11] LEFT DOUBLE QUOTATION MARK (QU_QU_Pi_QUmPf_NotEastAsian) × [19.02] LATIN CAPITAL LETTER B (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER S (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN CAPITAL LETTER G (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER A (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER M (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN CAPITAL LETTER B (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER O (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER Y (AL_NotEastAsian_AL) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN CAPITAL LETTER S (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER R (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER A (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER G (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER Y (AL_NotEastAsian_AL) × [19.01] RIGHT DOUBLE QUOTATION MARK (QU_QU_Pf_QUmPi_NotEastAsian) × [19.13] CJK UNIFIED IDEOGRAPH-5956 (ID) × [13.02] IDEOGRAPHIC FULL STOP (CL) ÷ [0.3]
× 005A × 002D × 0031 × 201C × 83B1 ÷ 8D1D ÷ 96F7 ÷ 5E0C ÷ 7279 ÷ 00B7 ÷ 9A6C ÷ 65AF × 201D ÷ 53F7 ÷ 662F ÷ 5FB7 ÷ 56FD ÷ 56FD ÷ 5BB6 ÷ 6D77 ÷ 519B ÷ 66A8 ÷ 6218 ÷ 4E89 ÷ 6D77 ÷ 519B ÷ 4E8E ÷ 0031 × 0039 × 0033 × 0030 ÷ 5E74 ÷ 4EE3 ÷ # × [0.3] LATIN CAPITAL LETTER Z (AL_NotEastAsian_AL) × [21.02] HYPHEN-MINUS (HY_NotEastAsian) × [25.13] DIGIT ONE (NU_NotEastAsian) × [19.1] LEFT DOUBLE QUOTATION MARK (QU_QU_Pi_QUmPf_NotEastAsian) × [19.02] CJK UNIFIED IDEOGRAPH-83B1 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-8D1D (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-96F7 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-5E0C (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-7279 (ID) ÷ [999.0] MIDDLE DOT (AI_NotEastAsian_AL) ÷ [999.0] CJK UNIFIED IDEOGRAPH-9A6C (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-65AF (ID) × [19.01] RIGHT DOUBLE QUOTATION MARK (QU_QU_Pf_QUmPi_NotEastAsian) ÷ [999.0] CJK UNIFIED IDEOGRAPH-53F7 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-662F (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-5FB7 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-56FD (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-56FD (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-5BB6 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-6D77 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-519B (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-66A8 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-6218 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-4E89 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-6D77 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-519B (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-4E8E (ID) ÷ [999.0] DIGIT ONE (NU_NotEastAsian) × [25.15] DIGIT NINE (NU_NotEastAsian) × [25.15] DIGIT THREE (NU_NotEastAsian) × [25.15] DIGIT ZERO (NU_NotEastAsian) ÷ [999.0] CJK UNIFIED IDEOGRAPH-5E74 (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-4EE3 (ID) ÷ [0.3]
× 0041 × 006E × 006D × 0065 × 0072 × 006B × 0075 × 006E × 0067 × 003A × 0020 ÷ 201E × 0057 × 0068 × 0069 × 0074 × 0065 × 201C × 0020 ÷ 0062 × 007A × 0077 × 002E × 0020 ÷ 201A × 767D ÷ 4EBA × 2018 × 0020 ÷ 2013 × 0020 ÷ 0069 × 006E × 0020 ÷ 0064 × 0065 × 0072 × 0020 ÷ 0041 × 006D × 0074 × 006C × 0069 × 0063 × 0068 × 0065 × 006E × 0020 ÷ 0053 × 0074 × 0061 × 0074 × 0069 × 0073 × 0074 × 0069 × 006B ÷ # × [0.3] LATIN CAPITAL LETTER A (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER N (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER M (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER R (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER K (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER U (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER N (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER G (AL_NotEastAsian_AL) × [15.4] COLON (IS_NotEastAsian) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] DOUBLE LOW-9 QUOTATION MARK (OP_NotEastAsian_OP30) × [14.0] LATIN CAPITAL LETTER W (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER H (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER I (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [19.1] LEFT DOUBLE QUOTATION MARK (QU_QU_Pi_QUmPf_NotEastAsian) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN SMALL LETTER B (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER Z (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER W (AL_NotEastAsian_AL) × [15.4] FULL STOP (IS_NotEastAsian) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] SINGLE LOW-9 QUOTATION MARK (OP_NotEastAsian_OP30) × [14.0] CJK UNIFIED IDEOGRAPH-767D (ID) ÷ [999.0] CJK UNIFIED IDEOGRAPH-4EBA (ID) × [19.11] LEFT SINGLE QUOTATION MARK (QU_QU_Pi_QUmPf_NotEastAsian) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] EN DASH (BA_NotEastAsian) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN SMALL LETTER I (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER N (AL_NotEastAsian_AL) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN SMALL LETTER D (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER R (AL_NotEastAsian_AL) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN CAPITAL LETTER A (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER M (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER L (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER I (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER C (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER H (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER N (AL_NotEastAsian_AL) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] LATIN CAPITAL LETTER S (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER A (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER I (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER S (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER T (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER I (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER K (AL_NotEastAsian_AL) ÷ [0.3]
+× 0020 ÷ 2067 × 004A × 006F × 0068 × 006E × 0020 ÷ 05D5 × 002D × 004D × 0069 × 0063 × 0068 × 0061 × 0065 × 006C × 2069 × 003B ÷ # × [0.3] SPACE (SP_NotEastAsian) ÷ [18.0] RIGHT-TO-LEFT ISOLATE (CM1_NotEastAsian_CM) × [28.0] LATIN CAPITAL LETTER J (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER O (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER H (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER N (AL_NotEastAsian_AL) × [7.01] SPACE (SP_NotEastAsian) ÷ [18.0] HEBREW LETTER VAV (HL_NotEastAsian) × [21.02] HYPHEN-MINUS (HY_NotEastAsian) × [21.1] LATIN CAPITAL LETTER M (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER I (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER C (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER H (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER A (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER E (AL_NotEastAsian_AL) × [28.0] LATIN SMALL LETTER L (AL_NotEastAsian_AL) × [9.0] POP DIRECTIONAL ISOLATE (CM1_NotEastAsian_CM) × [15.4] SEMICOLON (IS_NotEastAsian) ÷ [0.3]
+× 05D5 × 05B7 × 05BD × 05D9 × 05B0 × 05D4 × 05B4 × 05D9 × 05BE ÷ 05DB × 05B5 × 05BD × 05DF × 05C3 ÷ # × [0.3] HEBREW LETTER VAV (HL_NotEastAsian) × [9.0] HEBREW POINT PATAH (CM1_NotEastAsian_CM) × [9.0] HEBREW POINT METEG (CM1_NotEastAsian_CM) × [28.0] HEBREW LETTER YOD (HL_NotEastAsian) × [9.0] HEBREW POINT SHEVA (CM1_NotEastAsian_CM) × [28.0] HEBREW LETTER HE (HL_NotEastAsian) × [9.0] HEBREW POINT HIRIQ (CM1_NotEastAsian_CM) × [28.0] HEBREW LETTER YOD (HL_NotEastAsian) × [21.01] HEBREW PUNCTUATION MAQAF (BA_NotEastAsian) ÷ [999.0] HEBREW LETTER KAF (HL_NotEastAsian) × [9.0] HEBREW POINT TSERE (CM1_NotEastAsian_CM) × [9.0] HEBREW POINT METEG (CM1_NotEastAsian_CM) × [28.0] HEBREW LETTER FINAL NUN (HL_NotEastAsian) × [28.0] HEBREW PUNCTUATION SOF PASUQ (AL_NotEastAsian_AL) ÷ [0.3]
× 1F02C × 1F3FF ÷ # × [0.3] (Other) × [30.22] EMOJI MODIFIER FITZPATRICK TYPE-6 (EM) ÷ [0.3]
× 00A9 ÷ 1F3FF ÷ # × [0.3] COPYRIGHT SIGN (AL_NotEastAsian_AL) ÷ [999.0] EMOJI MODIFIER FITZPATRICK TYPE-6 (EM) ÷ [0.3]
#
-# Lines: 16670
+# Lines: 16672
#
# EOF
diff --git a/unicodetools/src/main/java/org/unicode/text/UCD/GenerateBreakTest.java b/unicodetools/src/main/java/org/unicode/text/UCD/GenerateBreakTest.java
index 0c7c44aaa..df1f72103 100644
--- a/unicodetools/src/main/java/org/unicode/text/UCD/GenerateBreakTest.java
+++ b/unicodetools/src/main/java/org/unicode/text/UCD/GenerateBreakTest.java
@@ -1732,6 +1732,9 @@ public GenerateLineBreakTest(UCD ucd, Segmenter.Target target) {
"2000年获得了《IGN》的“Best Game Boy Strategy”奖。",
"Z-1“莱贝雷希特·马斯”号是德国国家海军暨战争海军于1930年代",
"Anmerkung: „White“ bzw. ‚白人‘ – in der Amtlichen Statistik",
+ // Examples for LB21a.
+ " John ו-Michael;", // No break after ו-.
+ "וַֽיְהִי־כֵֽן׃", // Break after maqaf since Unicode 16.
});
// Additions for Unicode 14 LB30b [\p{Extended_Pictographic}&\p{Cn}] × EM
diff --git a/unicodetools/src/main/resources/org/unicode/tools/SegmenterDefault.txt b/unicodetools/src/main/resources/org/unicode/tools/SegmenterDefault.txt
index 41e05ab97..3da55ce81 100644
--- a/unicodetools/src/main/resources/org/unicode/tools/SegmenterDefault.txt
+++ b/unicodetools/src/main/resources/org/unicode/tools/SegmenterDefault.txt
@@ -295,8 +295,8 @@ $AL=($AL | ^ $CM | (?<=$Spec1_) $CM)
21.02) × $HY
21.03) × $NS
21.04) $BB ×
-# LB 21a Don't break after Hebrew + Hyphen.
-21.1) $HL ($HY | $BA) ×
+# LB 21a Do not break after the hyphen in Hebrew-hyphen-non-Hebrew.
+21.1) $HL ($HY | $BA) × [^$HL]
# LB 21b Don’t break between Solidus and Hebrew letters.
21.2) $SY × $HL
# LB 22 Do not break before ellipses.