-
Notifications
You must be signed in to change notification settings - Fork 385
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLDR-11155 Test for ST pages with too many rows #3519
CLDR-11155 Test for ST pages with too many rows #3519
Conversation
I took the code I made to find out the page sizes, and made it into a test. Once the emoji pages are handled, the number of failures will go down. (We can also raise the error threshold or make some exceptions if we want.) |
final long minError = 600; // above this, emit error | ||
final long minLog = 500; // otherwise above this, emit warning | ||
Factory factory = CLDRConfig.getInstance().getCommonAndSeedAndMainAndAnnotationsFactory(); | ||
// "en", "cs", "ar", "pl" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// "en", "cs", "ar", "pl" |
+ "\t" | ||
+ entry.getSectionId() | ||
+ "\t" | ||
+ entry | ||
+ "\thas too many entries:\t" | ||
+ count); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- String.format will avoid taking up 6 lines
- include
minError
also so it's clear what the target value is
#3508 is merged, so I'll restart the tests on this one |
The test still fails, including:
@macchiati should I keep dividing pages until the test passes with the max being 600? Anyway I was about to start on Volume/Volume2 |
I think we identified the pages that we wanted to address. At this point we probably want to set the limit a bit higher so we just knock down the bad cases. For volume, I recommend that we split by the system. Volume Metric and Volume Other. Some notes: From any unit you can get its system, and check it. make a static final UnicodeSystems METRIC = Sets.of(UnitSystem.metric, UnitSystem.metric_adjacent); extract the unit from the path get a UnitConverter from the SupplementalDataInfo |
OK! I was in the middle of another approach, trying to split about halfway down starting Volume2 with "Gallon" but what you suggest sounds more meaningful |
@btangmu Please integrate this in, setting the error size limit to a value that works, and incorporating into your work. You can reset the error size limit down after further work. |
@macchiati This was the basis for #3563 and #3573 -- now redundant; OK to close? |
CLDR-11155
ALLOW_MANY_COMMITS=true