From 43ca9c2fdaeddfa9b5422e60b56af815266f8d6d Mon Sep 17 00:00:00 2001 From: Conrad Nied Date: Tue, 17 Sep 2024 17:25:24 -0700 Subject: [PATCH 1/5] CLDR-10478 Add latest data from Macau census This updates the Macau population language data with up-to-date census information. --- common/supplemental/likelySubtags.xml | 2 +- common/supplemental/supplementalData.xml | 20 +++++++++++-------- .../util/data/country_language_population.tsv | 12 +++++++---- 3 files changed, 21 insertions(+), 13 deletions(-) diff --git a/common/supplemental/likelySubtags.xml b/common/supplemental/likelySubtags.xml index ce4e105a831..85fa983ed02 100644 --- a/common/supplemental/likelySubtags.xml +++ b/common/supplemental/likelySubtags.xml @@ -1200,7 +1200,7 @@ not be patched by hand, as any changes made in that fashion may be lost. - + diff --git a/common/supplemental/supplementalData.xml b/common/supplemental/supplementalData.xml index 5d09af73a59..e4f3d8faddc 100644 --- a/common/supplemental/supplementalData.xml +++ b/common/supplemental/supplementalData.xml @@ -1540,7 +1540,7 @@ XXX Code for transations where no currency is involved - + @@ -1565,7 +1565,7 @@ XXX Code for transations where no currency is involved - + @@ -1980,7 +1980,7 @@ XXX Code for transations where no currency is involved - + @@ -2409,7 +2409,7 @@ XXX Code for transations where no currency is involved - + @@ -3628,9 +3628,13 @@ XXX Code for transations where no currency is involved - + + + - + + + @@ -5566,7 +5570,6 @@ XXX Code for transations where no currency is involved Ethnologue lists 1 million 2nd lang users of English; no other good figures found. also: http://en.wikipedia.org/wiki/Bosnian_language French is a minority official language. Crude estimate of usage based on import partner data. - Macao reported 5% native Portuguese speakers. 5% writing pop estimated in absence of other data [missing] Crude estimate based on import partner data. @@ -5754,7 +5757,6 @@ XXX Code for transations where no currency is involved Mainly unwritten Vai script is the main script for this language. Latin listed as being used (Scriptsource) but no pop figures available. - and https://en.wikipedia.org/wiki/Macau but no literacy data Including 1st and 2nd lang speakers [missing] @@ -5818,5 +5820,7 @@ XXX Code for transations where no currency is involved [missing] Greek population in Russia -- most ancestrally used Pontic Greek -- modern usage almost certainly has dropped off but we don't have clear statistics on current usage. [missing] + 2021 Census, counting people who are fluent in the language + 2011 Census -- the language is not distinguished in the 2021 census diff --git a/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv b/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv index 1c423b54848..623ed1eec19 100644 --- a/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv +++ b/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv @@ -787,10 +787,14 @@ Luxembourg LU "605,764" 100% "62,110,000,000" official French fr 87% Luxembourg LU "605,764" 100% "62,110,000,000" official German de 63% Luxembourg LU "605,764" 100% "62,110,000,000" official Luxembourgish lb 67% 5% "http://www.ethnologue.com/show_language.asp?code=ltz Some 99% of users are literate in French or German. For languages not customarily written, the writing population is artificially set to 5% in the absence of better information." Luxembourg LU "605,764" 100% "62,110,000,000" Portuguese pt 16% https://en.wikipedia.org/wiki/Portuguese_Luxembourger -Macao SAR China MO "606,340" 96% "71,820,000,000" Chinese zh 5% Hans literacy is unknown; set to 5% artificially pending better or official figures. -Macao SAR China MO "606,340" 96% "71,820,000,000" official Chinese (Traditional) zh_Hant 98% -Macao SAR China MO "606,340" 96% "71,820,000,000" English en "13,900" https://www.cia.gov/library/publications/the-world-factbook/geos/mc.html and https://en.wikipedia.org/wiki/Macau -Macao SAR China MO "606,340" 96% "71,820,000,000" official Portuguese pt 5% http://en.wikipedia.org/wiki/Geographic_distribution_of_Portuguese Macao reported 5% native Portuguese speakers. +Macao SAR China MO "682,070" 96% "71,820,000,000" Chinese zh 5% Hans literacy is unknown; set to 5% artificially pending better or official figures. +Macao SAR China MO "682,070" 96% "71,820,000,000" official Chinese (Traditional) zh_Hant 98% +Macao SAR China MO "682,070" 96% "71,820,000,000" English en 22.7% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" official Portuguese pt 2.3% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" official Cantonese yue 86.2% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" official Mandarin cmn 45% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" official Filipino fil "20,879" https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" official Hokkien nan 3.7% https://www.dsec.gov.mo/getAttachment/7a3b17c2-22cc-4197-9bd5-ccc6eec388a2/E_CEN_PUB_2011_Y.aspx 2011 Census -- the language is not distinguished in the 2021 census Madagascar MG "25,683,610" 65% "39,850,000,000" official English en 18% No literacy figure available for English in Madagascar; newly adopted official language; 5% is an estimate. Madagascar MG "25,683,610" 65% "39,850,000,000" official French fr 69% Madagascar MG "25,683,610" 65% "39,850,000,000" official Malagasy mg 90% http://www.wildmadagascar.org/overview/loc/27-minorities.html From 5060ad3a0c39c9901b4b0b7b2482a4f759c62e75 Mon Sep 17 00:00:00 2001 From: Conrad Nied Date: Wed, 18 Sep 2024 17:42:52 -0700 Subject: [PATCH 2/5] CLDR-10478 Update GenerateLikelyTestData java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelyTestData --- common/testData/localeIdentifiers/likelySubtags.txt | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/common/testData/localeIdentifiers/likelySubtags.txt b/common/testData/localeIdentifiers/likelySubtags.txt index d7f06199e44..ebfa73bd8c5 100644 --- a/common/testData/localeIdentifiers/likelySubtags.txt +++ b/common/testData/localeIdentifiers/likelySubtags.txt @@ -1435,7 +1435,7 @@ und-Latn-MG ; mg-Latn-MG ; mg ; und-Latn-MH ; en-Latn-MH ; en-MH ; und-Latn-MK ; sq-Latn-MK ; sq-MK ; und-Latn-ML ; bm-Latn-ML ; bm ; -und-Latn-MO ; pt-Latn-MO ; pt-MO ; +und-Latn-MO ; en-Latn-MO ; en-MO ; und-Latn-MP ; en-Latn-MP ; en-MP ; und-Latn-MQ ; fr-Latn-MQ ; fr-MQ ; und-Latn-MR ; fr-Latn-MR ; fr-MR ; From 8b848e914f9d4dbbd622316b7af376a1fda005aa Mon Sep 17 00:00:00 2001 From: Conrad Nied Date: Thu, 19 Sep 2024 10:12:51 -0700 Subject: [PATCH 3/5] CLDR-10478 Fix official languages for Macau --- common/supplemental/supplementalData.xml | 14 +++++++------- .../cldr/util/data/country_language_population.tsv | 8 ++++---- 2 files changed, 11 insertions(+), 11 deletions(-) diff --git a/common/supplemental/supplementalData.xml b/common/supplemental/supplementalData.xml index e4f3d8faddc..347a4ead7ee 100644 --- a/common/supplemental/supplementalData.xml +++ b/common/supplemental/supplementalData.xml @@ -1565,7 +1565,7 @@ XXX Code for transations where no currency is involved - + @@ -1980,7 +1980,7 @@ XXX Code for transations where no currency is involved - + @@ -2421,7 +2421,7 @@ XXX Code for transations where no currency is involved - + @@ -3628,12 +3628,12 @@ XXX Code for transations where no currency is involved - - + + - - + + diff --git a/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv b/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv index 623ed1eec19..671559691cb 100644 --- a/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv +++ b/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv @@ -791,10 +791,10 @@ Macao SAR China MO "682,070" 96% "71,820,000,000" Chinese zh 5% Hans literacy Macao SAR China MO "682,070" 96% "71,820,000,000" official Chinese (Traditional) zh_Hant 98% Macao SAR China MO "682,070" 96% "71,820,000,000" English en 22.7% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language Macao SAR China MO "682,070" 96% "71,820,000,000" official Portuguese pt 2.3% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language -Macao SAR China MO "682,070" 96% "71,820,000,000" official Cantonese yue 86.2% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language -Macao SAR China MO "682,070" 96% "71,820,000,000" official Mandarin cmn 45% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language -Macao SAR China MO "682,070" 96% "71,820,000,000" official Filipino fil "20,879" https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language -Macao SAR China MO "682,070" 96% "71,820,000,000" official Hokkien nan 3.7% https://www.dsec.gov.mo/getAttachment/7a3b17c2-22cc-4197-9bd5-ccc6eec388a2/E_CEN_PUB_2011_Y.aspx 2011 Census -- the language is not distinguished in the 2021 census +Macao SAR China MO "682,070" 96% "71,820,000,000" de_facto_official Cantonese yue 86.2% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" Mandarin cmn 45% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" Filipino fil "20,879" https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language +Macao SAR China MO "682,070" 96% "71,820,000,000" Hokkien nan 3.7% https://www.dsec.gov.mo/getAttachment/7a3b17c2-22cc-4197-9bd5-ccc6eec388a2/E_CEN_PUB_2011_Y.aspx 2011 Census -- the language is not distinguished in the 2021 census Madagascar MG "25,683,610" 65% "39,850,000,000" official English en 18% No literacy figure available for English in Madagascar; newly adopted official language; 5% is an estimate. Madagascar MG "25,683,610" 65% "39,850,000,000" official French fr 69% Madagascar MG "25,683,610" 65% "39,850,000,000" official Malagasy mg 90% http://www.wildmadagascar.org/overview/loc/27-minorities.html From 851cfd6ff6d4ed3ce58d570483900f087ee2576d Mon Sep 17 00:00:00 2001 From: Conrad Nied Date: Thu, 19 Sep 2024 10:17:46 -0700 Subject: [PATCH 4/5] CLDR-10478 Remove `cmn` from Macau because of overlap with `zh` Even though `cmn` knowledge is at 45% of Macau, since `zh` is implied to be `cmn` it ends up being double counted. Potentially we can separate `zh` from `cmn` -- but that's a whole new discussion that's best saved for later. --- common/supplemental/supplementalData.xml | 3 +-- .../org/unicode/cldr/util/data/country_language_population.tsv | 1 - 2 files changed, 1 insertion(+), 3 deletions(-) diff --git a/common/supplemental/supplementalData.xml b/common/supplemental/supplementalData.xml index 347a4ead7ee..3688b022e8c 100644 --- a/common/supplemental/supplementalData.xml +++ b/common/supplemental/supplementalData.xml @@ -2421,7 +2421,7 @@ XXX Code for transations where no currency is involved - + @@ -3629,7 +3629,6 @@ XXX Code for transations where no currency is involved - diff --git a/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv b/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv index 671559691cb..1614cef6042 100644 --- a/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv +++ b/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/country_language_population.tsv @@ -792,7 +792,6 @@ Macao SAR China MO "682,070" 96% "71,820,000,000" official Chinese (Traditional) Macao SAR China MO "682,070" 96% "71,820,000,000" English en 22.7% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language Macao SAR China MO "682,070" 96% "71,820,000,000" official Portuguese pt 2.3% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language Macao SAR China MO "682,070" 96% "71,820,000,000" de_facto_official Cantonese yue 86.2% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language -Macao SAR China MO "682,070" 96% "71,820,000,000" Mandarin cmn 45% https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language Macao SAR China MO "682,070" 96% "71,820,000,000" Filipino fil "20,879" https://www.dsec.gov.mo/getAttachment/6cb29f2f-524a-488f-aed3-4d7207bb109e/E_CEN_PUB_2021_Y.aspx 2021 Census, counting people who are fluent in the language Macao SAR China MO "682,070" 96% "71,820,000,000" Hokkien nan 3.7% https://www.dsec.gov.mo/getAttachment/7a3b17c2-22cc-4197-9bd5-ccc6eec388a2/E_CEN_PUB_2011_Y.aspx 2011 Census -- the language is not distinguished in the 2021 census Madagascar MG "25,683,610" 65% "39,850,000,000" official English en 18% No literacy figure available for English in Madagascar; newly adopted official language; 5% is an estimate. From f0360d01538ec2d2b78041adf8a05e9dfae36159 Mon Sep 17 00:00:00 2001 From: Conrad Nied Date: Sat, 21 Sep 2024 10:49:08 -0700 Subject: [PATCH 5/5] CLDR-10478 Add Cantonese (Macau) locale xml Since I added a new locale that has "de_facto_official" status I need to add a new xml -- easy enough, I'll just have it inherit from root for now. I also re-generated the test data with `java -jar tools/cldr-code/target/cldr-code.jar GenerateLikelyTestData` --- common/main/yue_Hant_MO.xml | 15 +++++++++++++++ .../testData/localeIdentifiers/likelySubtags.txt | 1 + 2 files changed, 16 insertions(+) create mode 100644 common/main/yue_Hant_MO.xml diff --git a/common/main/yue_Hant_MO.xml b/common/main/yue_Hant_MO.xml new file mode 100644 index 00000000000..1a44ad133f2 --- /dev/null +++ b/common/main/yue_Hant_MO.xml @@ -0,0 +1,15 @@ + + + + + + + +