The Unicode CLDR 47 Alpha is now available for integration testing.
CLDR provides key building blocks for software to support the world's languages (dates, times, numbers, sort-order, etc.) For example, all major browsers and all modern mobile phones use CLDR for language support. (See Who uses CLDR?)
Via the online Survey Tool, contributors supply data for their languages — data that is widely used to support much of the world’s software. This data is also a factor in determining which languages are supported on mobile phones and computer operating systems.
The alpha has already been integrated into the development version of ICU. We would especially appreciate feedback from non-ICU consumers of CLDR data and on Migration issues. Feedback can be filed at CLDR Tickets.
CLDR 47 focused on MessageFormat 2.0 and tooling for an expansion of DDL support. It was a closed cycle: locale data changes were limited to bug fixes and the addition of new locales, mostly regional variants.
RBNF improvements and Transforms
CLDR added Gujarati RBNF support, which provides number spell out functionality, and made improvements to many other languages.
Transforms were also improved in both CLDR 46.1 and 47 releases which included:
Adding a Hant-Latn transliterator
Aliasing Hans-Latn to Hani-Latn
Improvements to several other transliterators
More regional variants
Over the past few years there have been an increasing number of requests for locales to be added to languages, such as English, when they are commonly used in a region as a lingua franca.
CLDR has been adding additional child locales to support these requests and has begun restructuring inheritance to allow for better maintenance of shared regional data, such as currency symbols and metazone names.
46.1 Improvements
CLDR 46.1 was a special interim release of CLDR that focused on MessageFormat 2.0. It included a few additional changes:
More explicit well-formedness and validity constraints for unit of measurement identifiers
Addition of derived emoji annotations that were missing: emoji with skin tones facing right
Fixes to make the ja, ko, yue, zh datetimeSkeletons useful for generating the standard patterns
Improved date/time test data
For more information, see 46.1 Changes
Tooling changes
Many tooling changes are difficult to accommodate in a data-submission release, including performance work and UI improvements. The changes in CLDR 47 provide faster turn-around for linguists, and higher data quality. They are targeted at the CLDR 48 submission period, starting in April, 2025.
For more information
See the draft CLDR v47 Release Note, which has information on accessing the data, reviewing charts of the changes, and — importantly — Migration issues.