8000 CLDR-18843 Remove organization Special by btangmu · Pull Request #5116 · unicode-org/cldr · GitHub
[go: up one dir, main page]

Skip to content

Conversation

btangmu
Copy link
Member
@btangmu btangmu commented Oct 1, 2025

-Remove Organization.special and special.visibleOnFrontEnd

-Remove all lines from Locales.txt referencing special organization

-Remove lines from Locales.txt referencing Cldr organization where same locale was included for special, with 3 exceptions: sr_Latn, qu, to, which are needed to pass test TestCLDRLocaleCoverage.TestCldrSuperset

-Remove SubmissionLocales.SPECIAL_ORG_LOCALES, no longer need to subtract from CLDR locales

-Revise SubmissionLocales.HIGH_LEVEL_LOCALES, removing some special locales

-Remove ChartLocaleGrowth.SpecialLocales, isSpecial

-No longer need to subtract special from CLDR locales in ListCoverageLevels, ShowInconsistentAvailable, DiffLanguageGroups, ListProblemDates, TestUnits

-In ShowLocaleCoverage, remove specialFlag

-In VettingViewer, remove HC = high coverage for special locales

-In TestStandardCodes.testTargetCoverageLevel, remove assertion for locale br

CLDR-18843

  • This PR completes the ticket.

ALLOW_MANY_COMMITS=true

-Remove Organization.special and special.visibleOnFrontEnd

-Remove all lines from Locales.txt referencing special organization

-Remove lines from Locales.txt referencing Cldr organization where same locale was included for special, with 3 exceptions: sr_Latn, qu, to, which are needed to pass test TestCLDRLocaleCoverage.TestCldrSuperset

-Remove SubmissionLocales.SPECIAL_ORG_LOCALES, no longer need to subtract from CLDR locales

-Revise SubmissionLocales.HIGH_LEVEL_LOCALES, removing some special locales

-Remove ChartLocaleGrowth.SpecialLocales, isSpecial

-No longer need to subtract special from CLDR locales in ListCoverageLevels, ShowInconsistentAvailable, DiffLanguageGroups, ListProblemDates, TestUnits

-In ShowLocaleCoverage, remove specialFlag

-In VettingViewer, remove HC = high coverage for special locales

-In TestStandardCodes.testTargetCoverageLevel, remove assertion for locale br
Cldr ; yue ; modern ; T5 Cantonese

#Cldr Tier generated
Cldr ; sr_Latn ; modern ; Tx Serbian (auto-generated)
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This line for "sr_Latn" is moved, not removed. It is essentially unchanged, except for the addition of a comment explaining the relation to TestCLDRLocaleCoverage.TestCldrSuperset. The same applies to "qu" and "to".

@btangmu
Copy link
Member Author
btangmu commented Oct 1, 2025

The strategy that resulted in these changes was essentially: (1) remove all "special" from Locales.txt; (2) remove those same locales from "Cldr" in Locales.txt, except for sr_Latn, qu, to, which are needed to pass test TestCLDRLocaleCoverage.TestCldrSuperset; (3) revise Java code as needed for tests to pass and to eliminate references to special org

@btangmu btangmu requested review from macchiati and AEApple October 1, 2025 15:27
// Note: ALL of these were found in Locales.txt under cldr.
"chr", // Cherokee
"gd", // Scottish Gaelic, Gaelic
"fo", // Faroese
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm confused by this since ha, kok, pcm are TC locales.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the confusing thing (1) that chr and fo were removed, or (2) that ha, kok, and pcm were not removed (or were present in the first place)?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

chr and fo were removed since they were special.

ha, kok, and pcm were not removed since they weren't special. Why they were present in the first place, I'm not sure, though I suspect they might be required for TestCldrSuperset to pass.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is a test that CLDR org is a superset of the TC orgs' locales. We were tracking Specials as part of that, but that was a hack so we're removing it. So locales that were only in CLDR org by virtual of being Special are being removed.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the HIGH_LEVEL_LOCALES should also go away; they were intended to align with the Specials above basic, but we are no longer tracking specials. However, that can be in a different PR.


Cldr ; fo ; moderate ; Faroese
Cldr ; qu ; modern ; Quechua
Cldr ; br ; moderate ; Breton
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought that the CLDR list was a superset of all the locales across all of the organizations?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The test TestCldrSuperset ensures that, if I understand it.

These are deleted from Cldr because they were "special", and there was a lot of code that relied on the special locales being subtracted from the Cldr locales. The general strategy I employed was to remove the special locales from Cldr, with the 3 exceptions mentioned

"doi,BASIC", // CLDR locale
"nn,MODERN", // CLDR locale
"hnj,MODERN", // Maximum coverage (hmong)
"br,MODERATE", // Maximum (Breton)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't understand this change?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

br was removed from Cldr in Locales.txt because it was one of the special locales. Consequently this test failed, unless br was removed from the test.

// Note: ALL of these were found in Locales.txt under cldr.
"chr", // Cherokee
"gd", // Scottish Gaelic, Gaelic
"fo", 8000 // Faroese
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the HIGH_LEVEL_LOCALES should also go away; they were intended to align with the Specials above basic, but we are no longer tracking specials. However, that can be in a different PR.

@macchiati macchiati marked this pull request as draft October 2, 2025 18:32
@macchiati
Copy link
Member

Marking as draft because we don't want to merge before branch.

@AEApple
Copy link
Contributor
AEApple commented Oct 2, 2025

Okay, in that case in a follow-on PR, can we move the CLDR org to be the first organization in the locales.txt file and add an explanation of what it contains in the comment at the top of the file?

@macchiati
Copy link
Member

We should add t A354 he comment, but what I'd like to do is move to using the reformatting tool.

@macchiati
Copy link
Member

That is, copy in https://github.com/unicode-org/cldr-staging/blob/main/docs/charts/48/tsv/coverage_goals.tsv instead (it reformats with each charts build), and do the ticket to clean out the deadwood orgs (that haven't contributed in (say) 5 years.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants
0