Skip to content

Commit

Permalink
CLDR-17145 v44: specs to final
Browse files Browse the repository at this point in the history
- tr35 revsion 70
- Note, v43.1 had the wrong revision numbers, filed CLDR-17198
  • Loading branch information
srl295 committed Oct 25, 2023
1 parent e42c80c commit 81bc8c3
Show file tree
Hide file tree
Showing 8 changed files with 56 additions and 55 deletions.
10 changes: 5 additions & 5 deletions docs/ldml/tr35-collation.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# Unicode Locale Data Markup Language (LDML)<br/>Part 5: Collation

|Version|44 (draft) |
|Version|44 |
|-------|----------------|
|Editors|Markus Scherer (<a href="mailto:[email protected]">[email protected]</a>) and <a href="tr35.md#Acknowledgments">other CLDR committee members</a>|

Expand All @@ -21,12 +21,12 @@ See <https://cldr.unicode.org> for up-to-date CLDR release data.

### _Status_

_This is a draft document which may be updated, replaced, or superseded by other documents at any time.
<!-- _This is a draft document which may be updated, replaced, or superseded by other documents at any time.
Publication does not imply endorsement by the Unicode Consortium.
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._ -->

<!-- _This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._ -->
_This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._

> _**A Unicode Technical Standard (UTS)** is an independent specification. Conformance to the Unicode Standard does not imply conformance to any UTS._
Expand Down
10 changes: 5 additions & 5 deletions docs/ldml/tr35-dates.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# Unicode Locale Data Markup Language (LDML)<br/>Part 4: Dates

|Version|44 (draft) |
|Version|44 |
|-------|------------------|
|Editors|Peter Edberg and <a href="tr35.md#Acknowledgments">other CLDR committee members</a>|

Expand All @@ -16,12 +16,12 @@ This is a partial document, describing only those parts of the LDML that are rel

### _Status_

_This is a draft document which may be updated, replaced, or superseded by other documents at any time.
<!-- _This is a draft document which may be updated, replaced, or superseded by other documents at any time.
Publication does not imply endorsement by the Unicode Consortium.
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._ -->

<!-- _This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._ -->
_This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._

> _**A Unicode Technical Standard (UTS)** is an independent specification. Conformance to the Unicode Standard does not imply conformance to any UTS._
Expand Down
18 changes: 9 additions & 9 deletions docs/ldml/tr35-general.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# Unicode Locale Data Markup Language (LDML)<br/>Part 2: General

|Version|44 (draft) |
|Version|44 |
|-------|---------------------|
|Editors|Yoshito Umaoka (<a href="mailto:[email protected]">[email protected]</a>) and <a href="tr35.md#Acknowledgments">other CLDR committee members|

Expand All @@ -21,12 +21,12 @@ See <https://cldr.unicode.org> for up-to-date CLDR release data.

### _Status_

_This is a draft document which may be updated, replaced, or superseded by other documents at any time.
<!-- _This is a draft document which may be updated, replaced, or superseded by other documents at any time.
Publication does not imply endorsement by the Unicode Consortium.
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._ -->

<!-- _This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._ -->
_This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._

> _**A Unicode Technical Standard (UTS)** is an independent specification. Conformance to the Unicode Standard does not imply conformance to any UTS._
Expand Down Expand Up @@ -1003,7 +1003,7 @@ Some of the constraints reference data from the unitIdComponents in [Unit_Conver
or &lt;unitIdComponent type="per"&gt;.
</li>
<li><em>Constraint:</em> must not have a prefix as an initial segment.</li>
<li><em>Constraint:</em> no two different base_components will share the first 8 letters.
<li><em>Constraint:</em> no two different base_components will share the first 8 letters.
(<b>For more information, see <a href="#Unit_Identifier_Uniqueness">Unit Identifier Uniqueness</a>.)</b>
</li>
</ul>
Expand Down Expand Up @@ -1848,7 +1848,7 @@ If the direction is `forward`, then an ID is composed from `target + "-" + sourc

The `visibility` attribute indicates whether the IDs should be externally visible, or whether they are only used internally.

Note: In CLDR v28 and before, the rules were expressed as fine-grained XML.
Note: In CLDR v28 and before, the rules were expressed as fine-grained XML.
That was discarded in CLDR version 29, in favor of a simpler format where the separate rules are simply terminated with ";".

The transform rules are similar to regular-expression substitutions, but adapted to the specific domain of text transformations. The rules and comments in this discussion will be intermixed, with # marking the comments. The simplest rule is a conversion rule, which replaces one string of characters with another. The conversion rule takes the following form:
Expand Down Expand Up @@ -2158,7 +2158,7 @@ Conversion rules can be forward, backward, or double. The complete conversion ru
> b | c ← e { f g } h ;
> ```
The `completed_result` | `result_to_revisit` is also known as the `resulting_text`. Either or both of the values can be empty. For example, the following removes any a, b, or c.
The `completed_result` | `result_to_revisit` is also known as the `resulting_text`. Either or both of the values can be empty. For example, the following removes any a, b, or c.
```
[a-c] → ;
Expand Down Expand Up @@ -2291,7 +2291,7 @@ Because the order of rules matters, the following will not work as expected
c → s;
ch → kh;
```
The second rule can never execute, because it is "masked" by the first.
The second rule can never execute, because it is "masked" by the first.
To help prevent errors, implementations should try to alert readers when this occurs, eg:
```
Rule {c > s;} masks {ch > kh;}
Expand Down
14 changes: 7 additions & 7 deletions docs/ldml/tr35-info.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# Unicode Locale Data Markup Language (LDML)<br/>Part 6: Supplemental

|Version|44 (draft) |
|Version|44 |
|-------|-----------|
|Editors|Steven Loomis (<a href="mailto:[email protected]">[email protected]</a>) and <a href="tr35.md#Acknowledgments">other CLDR committee members|

Expand All @@ -21,12 +21,12 @@ See <https://cldr.unicode.org> for up-to-date CLDR release data.

### _Status_

_This is a draft document which may be updated, replaced, or superseded by other documents at any time.
<!-- _This is a draft document which may be updated, replaced, or superseded by other documents at any time.
Publication does not imply endorsement by the Unicode Consortium.
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._ -->

<!-- _This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._ -->
_This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._

> _**A Unicode Technical Standard (UTS)** is an independent specification. Conformance to the Unicode Standard does not imply conformance to any UTS._
Expand Down Expand Up @@ -930,9 +930,9 @@ The systems attributes also apply to compound units, and are computed in the fol
1. The intersection of {… si …} and {… si_acceptable … } is {… si_acceptable …}
2. The intersection of {… metric …} and {… metric_adjacent … } is {… metric_adjacent …}

Examples:
Examples:
```
systems(liter-per-hectare)
systems(liter-per-hectare)
= {si_acceptable metric} ∪ {si_acceptable metric}
= {si_acceptable metric}
systems(meter-per-hectare)
Expand Down
6 changes: 3 additions & 3 deletions docs/ldml/tr35-keyboards.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# Unicode Locale Data Markup Language (LDML)<br/>Part 7: Keyboards

|Version|44 (draft) |
|Version|44 |
|-------|-------------|
|Editors|Steven Loomis (<a href="mailto:[email protected]">[email protected]</a>) and <a href="tr35.md#Acknowledgments">other CLDR committee members</a>|

Expand All @@ -28,8 +28,8 @@ See <https://cldr.unicode.org> for up-to-date CLDR release data.
### _Status_

This document is a _technical preview_ of the Keyboard standard.
This document has _not_ been approved for publication by the Unicode Consortium,
and may be substantially altered before any publication. For the latest published version of UTS#35, see <https://www.unicode.org/reports/tr35/>

To process earlier XML files, use the data and specification from v43.1, found at <https://www.unicode.org/reports/tr35/tr35-69/tr35.html>

The CLDR [Keyboard Workgroup](https://cldr.unicode.org/index/keyboard-workgroup) is currently
developing this technical preview to the CLDR keyboard specification.
Expand Down
9 changes: 5 additions & 4 deletions docs/ldml/tr35-numbers.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# Unicode Locale Data Markup Language (LDML)<br/>Part 3: Numbers

|Version|44 (draft)|
|Version|44 |
|-------|----------|
|Editors|Shane F. Carr (<a href="mailto:[email protected]">[email protected]</a>) and <a href="tr35.md#Acknowledgments">other CLDR committee members|

Expand All @@ -17,11 +17,12 @@ This is a partial document, describing only those parts of the LDML that are rel
### _Status_

_This is a draft document which may be updated, replaced, or superseded by other documents at any time.
<!-- _This is a draft document which may be updated, replaced, or superseded by other documents at any time.
Publication does not imply endorsement by the Unicode Consortium.
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._ -->

<!-- _This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._ -->
_This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._

> _**A Unicode Technical Standard (UTS)** is an independent specification. Conformance to the Unicode Standard does not imply conformance to any UTS._
Expand Down
22 changes: 11 additions & 11 deletions docs/ldml/tr35-personNames.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

# Unicode Locale Data Markup Language (LDML)<br/>Part 8: Person Names

|Version|44 (draft) |
|Version|44 |
|-------|------------------------|
|Editors|Mark Davis, Peter Edberg, Rich Gillam, Alex Kolisnychenko, Mike McKenna and [other CLDR committee members](tr35.md#Acknowledgments)|

Expand All @@ -16,12 +16,12 @@ This is a partial document, describing only those parts of the LDML that are rel

### _Status_

_This is a draft document which may be updated, replaced, or superseded by other documents at any time.
<!-- _This is a draft document which may be updated, replaced, or superseded by other documents at any time.
Publication does not imply endorsement by the Unicode Consortium.
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._
This is not a stable document; it is inappropriate to cite this document as other than a work in progress._ -->

<!-- _This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._ -->
_This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium.
This is a stable document and may be used as reference material or cited as a normative reference by other specifications._

> _**A Unicode Technical Standard (UTS)** is an independent specification. Conformance to the Unicode Standard does not imply conformance to any UTS._
Expand Down Expand Up @@ -290,11 +290,11 @@ This would tell the formatting code, when handling person name data from an Engl
```
Many clients of the person-names functionality don’t really care about formal versus informal; they just want whatever the “normal” formality level is for the user’s language. The same goes for the default length.

This parameter provides that information, so that APIs can allow users to use default values for the formality and length. The exact form that this takes depends on the API conventions, of course.
This parameter provides that information, so that APIs can allow users to use default values for the formality and length. The exact form that this takes depends on the API conventions, of course.

### foreignSpaceReplacement Element

The `<foreignSpaceReplacement>` element is used to specify how spaces should be handled when the name language is **different from** the formatting language. It is used in languages that don't normally require spaces between words. For example, Japanese and Chinese have the value of a middle dot (‘·’ U+00B7 MIDDLE DOT or ‘・’ U+30FB KATAKANA MIDDLE DOT), so that it is used between words in a foreign name; most other languages have the value of SPACE.
The `<foreignSpaceReplacement>` element is used to specify how spaces should be handled when the name language is **different from** the formatting language. It is used in languages that don't normally require spaces between words. For example, Japanese and Chinese have the value of a middle dot (‘·’ U+00B7 MIDDLE DOT or ‘・’ U+30FB KATAKANA MIDDLE DOT), so that it is used between words in a foreign name; most other languages have the value of SPACE.

```xml
<!ELEMENT foreignSpaceReplacement ( #PCDATA ) >
Expand Down Expand Up @@ -509,10 +509,10 @@ The modifiers transform the input data as described in the following table:

#### Grammatical Modifiers for Names

The CLDR person name formatting does not itself support grammatical inflection.
The CLDR person name formatting does not itself support grammatical inflection.
However, name sources (NameObject) can support inflections, either by having additional fields or by using an inflection engine that can handle personal name parts.

In the current release, the focus is on supporting `referring` and `addressing` forms.
In the current release, the focus is on supporting `referring` and `addressing` forms.
Typically the `referring` forms will be in the most neutral (*nominative*) case, and the `addressing` forms will be in the *vocative* case.
Some modifiers have been added to facilitate this, so that there can be patterns like: {given-vocative} {surname-vocative}.

Expand All @@ -527,7 +527,7 @@ For example:
Notice that the `surname` in Latvian needs to change to the genitive case with that pattern:

Ozoliņš ➡︎ **Ozoliņa**

That is accomplished by changing the pattern to be {surname<b>-genitive</b>} {title}. In this case the {surname} should only be genitive if followed by the {title}.

#### Future Modifiers
Expand Down Expand Up @@ -579,7 +579,7 @@ Let the **full formatting locale** be the maximal likely locale for the formatte
A few script values represent a set of scripts, such as Jpan = {Hani, Kana, Hira}. Two script codes are said to _match_ when they are either identical, or one represents a set which contains the other, or they both represent sets which intersect. For example, Hani and Jpan match, because {Hani, Kana, Hira} contains Hani.

If the **name script** doesn't match the **formatting script**:
1. If the name locale has name formatting data, then set the formatting locale to the name locale.
1. If the name locale has name formatting data, then set the formatting locale to the name locale.
2. Otherwise, set the formatting locale to the maximal likely locale for the the locale formed from und, plus the name script plus the region of the nameLocale.

For example, when a Hindi (Devanagari) formatter is called upon to format a name object that has the locale Ukrainian (Cyrillic):
Expand Down
Loading

0 comments on commit 81bc8c3

Please sign in to comment.