I18n:Updating Unicode version: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
mNo edit summary
No edit summary
Line 8: Line 8:
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
#'''Until {{Bug|210501}} is fixed''' you will have to edit UnicodeData-Latest.txt by hand and delete all the lines for codepoints above FFFF
#'''Until {{Bug|210501}} is fixed''' you will have to edit UnicodeData-Latest.txt by hand and delete all the lines for codepoints above FFFF
#Run <tt>perl gencasetable.pl</tt> This creates a new version of intl/unicharutil/src/casetable.h
#Run <tt>perl gencasetable.pl</tt> in intl/unichar/tools. This creates a new version of intl/unicharutil/src/casetable.h


== Character properties ==
== Character properties ==
Line 14: Line 14:
#Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
#Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
#Run <tt>perl gencattable.pl</tt> This creates a new version of intl/unicharutil/src/cattable.h
#Run <tt>perl gencattable.pl</tt> in intl/unichar/tools.  This creates a new version of intl/unicharutil/src/cattable.h
#The previous step will probably issue warnings like the following:
#The previous step will probably issue warnings like the following:
  WARNING, Unicode Database now contain characters which we have not considered.
  WARNING, Unicode Database now contain characters which we have not considered.
Line 24: Line 24:
== Normalization ==
== Normalization ==
== Transliteration ==
== Transliteration ==
#Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
#Run <tt>perl gentransliterate.pl</tt> in intl/unichar/tools. This creates a new version of intl/unicharutil/tables/transliterate.properties
== Bidi ==
== Bidi ==


#Download the latest version of BidiMirroring.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/BidiMirroring.txt
#Copy this file to layout/tools/BidiMirroring.txt in the mozilla source tree
#Run <tt>perl gensymmtable.pl</tt> in layout/tools. This creates a new version of layout/base/symmtable.h
#Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
#Copy this file to layout/tools/UnicodeData-Latest.txt in the mozilla source tree
#Run <tt>perl genbidicattable.pl</tt> in layout/tools. This creates a new version of layout/base/bidicattable.h
#The previous step will probably issue warnings like the following:
WARNING, Unicode Database now contain characters which we have not considered.
change this program !!!
Problem- U+010900 - U+010907 range
In this case, you will need to edit <tt>@range</tt> in genbidicattable.pl to include the new ranges
[[Category:I18n]]
[[Category:I18n]]

Revision as of 12:53, 15 May 2006

I18n:Home Page

This document describes the process of updating the files in the Mozilla codebase that are generated from Unicode data files.

Case conversion

  1. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  2. Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
  3. Until bug 210501 is fixed you will have to edit UnicodeData-Latest.txt by hand and delete all the lines for codepoints above FFFF
  4. Run perl gencasetable.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/src/casetable.h

Character properties

  1. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  2. Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
  3. Run perl gencattable.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/src/cattable.h
  4. The previous step will probably issue warnings like the following:
WARNING, Unicode Database now contain characters which we have not considered.
change this program !!!
Problem- U+010900 - U+010907 range

In this case, you will need to edit @range in gencattable.pl to include the new ranges

Normalization

Transliteration

  1. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  2. Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
  3. Run perl gentransliterate.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/tables/transliterate.properties

Bidi

  1. Download the latest version of BidiMirroring.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/BidiMirroring.txt
  2. Copy this file to layout/tools/BidiMirroring.txt in the mozilla source tree
  3. Run perl gensymmtable.pl in layout/tools. This creates a new version of layout/base/symmtable.h
  4. Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
  5. Copy this file to layout/tools/UnicodeData-Latest.txt in the mozilla source tree
  6. Run perl genbidicattable.pl in layout/tools. This creates a new version of layout/base/bidicattable.h
  7. The previous step will probably issue warnings like the following:
WARNING, Unicode Database now contain characters which we have not considered.
change this program !!!
Problem- U+010900 - U+010907 range

In this case, you will need to edit @range in genbidicattable.pl to include the new ranges