I18n:Updating Unicode version

From MozillaWiki
Revision as of 12:53, 15 May 2006 by Smontagu (talk | contribs)
Jump to navigation Jump to search

I18n:Home Page

This document describes the process of updating the files in the Mozilla codebase that are generated from Unicode data files.

Case conversion

  1. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  2. Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
  3. Until bug 210501 is fixed you will have to edit UnicodeData-Latest.txt by hand and delete all the lines for codepoints above FFFF
  4. Run perl gencasetable.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/src/casetable.h

Character properties

  1. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  2. Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
  3. Run perl gencattable.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/src/cattable.h
  4. The previous step will probably issue warnings like the following:
WARNING, Unicode Database now contain characters which we have not considered.
change this program !!!
Problem- U+010900 - U+010907 range

In this case, you will need to edit @range in gencattable.pl to include the new ranges

Normalization

Transliteration

  1. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  2. Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
  3. Run perl gentransliterate.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/tables/transliterate.properties

Bidi

  1. Download the latest version of BidiMirroring.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/BidiMirroring.txt
  2. Copy this file to layout/tools/BidiMirroring.txt in the mozilla source tree
  3. Run perl gensymmtable.pl in layout/tools. This creates a new version of layout/base/symmtable.h
  4. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  5. Copy this file to layout/tools/UnicodeData-Latest.txt in the mozilla source tree
  6. Run perl genbidicattable.pl in layout/tools. This creates a new version of layout/base/bidicattable.h
  7. The previous step will probably issue warnings like the following:
WARNING, Unicode Database now contain characters which we have not considered.
change this program !!!
Problem- U+010900 - U+010907 range

In this case, you will need to edit @range in genbidicattable.pl to include the new ranges