11660 common.UTF-8.src should be compiled from CLDR data

Review Request #2282 — Created Sept. 5, 2019 and submitted

yuripv
illumos-gate
master
11660
7f9deb6...
general

Compile data/common.UTF-8.src from UnicodeData.txt and UTF-8 charmap.

tools/utf8-rollup.pl comes from FreeBSD, where I did modify it to do the same. It didn't have any license/copyrights there, so I'm not adding for illumos.

Add basic README while here (to be updated when next CLDR release comes in).

Just using it. Same was done in FreeBSD almost an year ago and doesn't seem to introduce any issues.

  • 0
  • 0
  • 0
  • 1
  • 1
Description From Last Updated
richlowe
  1. 
      
  2. This didn't need an any updates too, right?

    1. No, UTF-8.cm is updated only for new CLDR releases. Script gets character class values from UnicodeData.txt, and uses UTF-8.cm to do the wide-character code -> name mapping.

    2. Or it's possible I just don't understand what you are asking :)

      The script is run from usr/src/data/locale like perl tools/utf8-rollup.pl --unidata=/path/to/UnicodeData.txt once after updating CLDR to new release.

    3. You understood, I just wanted to make sure nothing was missing (and that I understood why nothing was missing) :)
      Thanks.

  3. 
      
yuripv
richlowe
  1. Ship It!
  2. 
      
yuripv
yuripv
Review request changed

Status: Closed (submitted)

Loading...