I18n:Updating Unicode version: Difference between revisions
Jump to navigation
Jump to search
mNo edit summary |
No edit summary |
||
Line 8: | Line 8: | ||
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree | #Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree | ||
#'''Until {{Bug|210501}} is fixed''' you will have to edit UnicodeData-Latest.txt by hand and delete all the lines for codepoints above FFFF | #'''Until {{Bug|210501}} is fixed''' you will have to edit UnicodeData-Latest.txt by hand and delete all the lines for codepoints above FFFF | ||
#Run <tt>perl gencasetable.pl</tt> This creates a new version of intl/unicharutil/src/casetable.h | #Run <tt>perl gencasetable.pl</tt> in intl/unichar/tools. This creates a new version of intl/unicharutil/src/casetable.h | ||
== Character properties == | == Character properties == | ||
Line 14: | Line 14: | ||
#Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt. | #Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt. | ||
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree | #Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree | ||
#Run <tt>perl gencattable.pl</tt> This creates a new version of intl/unicharutil/src/cattable.h | #Run <tt>perl gencattable.pl</tt> in intl/unichar/tools. This creates a new version of intl/unicharutil/src/cattable.h | ||
#The previous step will probably issue warnings like the following: | #The previous step will probably issue warnings like the following: | ||
WARNING, Unicode Database now contain characters which we have not considered. | WARNING, Unicode Database now contain characters which we have not considered. | ||
Line 24: | Line 24: | ||
== Normalization == | == Normalization == | ||
== Transliteration == | == Transliteration == | ||
#Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt. | |||
#Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree | |||
#Run <tt>perl gentransliterate.pl</tt> in intl/unichar/tools. This creates a new version of intl/unicharutil/tables/transliterate.properties | |||
== Bidi == | == Bidi == | ||
#Download the latest version of BidiMirroring.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/BidiMirroring.txt | |||
#Copy this file to layout/tools/BidiMirroring.txt in the mozilla source tree | |||
#Run <tt>perl gensymmtable.pl</tt> in layout/tools. This creates a new version of layout/base/symmtable.h | |||
#Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt. | |||
#Copy this file to layout/tools/UnicodeData-Latest.txt in the mozilla source tree | |||
#Run <tt>perl genbidicattable.pl</tt> in layout/tools. This creates a new version of layout/base/bidicattable.h | |||
#The previous step will probably issue warnings like the following: | |||
WARNING, Unicode Database now contain characters which we have not considered. | |||
change this program !!! | |||
Problem- U+010900 - U+010907 range | |||
In this case, you will need to edit <tt>@range</tt> in genbidicattable.pl to include the new ranges | |||
[[Category:I18n]] | [[Category:I18n]] |
Revision as of 12:53, 15 May 2006
This document describes the process of updating the files in the Mozilla codebase that are generated from Unicode data files.
Case conversion
- Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
- Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
- Until bug 210501 is fixed you will have to edit UnicodeData-Latest.txt by hand and delete all the lines for codepoints above FFFF
- Run perl gencasetable.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/src/casetable.h
Character properties
- Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
- Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
- Run perl gencattable.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/src/cattable.h
- The previous step will probably issue warnings like the following:
WARNING, Unicode Database now contain characters which we have not considered. change this program !!! Problem- U+010900 - U+010907 range
In this case, you will need to edit @range in gencattable.pl to include the new ranges
Normalization
Transliteration
- Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
- Copy this file to intl/unicharutil/tools/UnicodeData-Latest.txt in the mozilla source tree
- Run perl gentransliterate.pl in intl/unichar/tools. This creates a new version of intl/unicharutil/tables/transliterate.properties
Bidi
- Download the latest version of BidiMirroring.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/BidiMirroring.txt
- Copy this file to layout/tools/BidiMirroring.txt in the mozilla source tree
- Run perl gensymmtable.pl in layout/tools. This creates a new version of layout/base/symmtable.h
- Download the latest version of UnicodeData.txt from the Unicode website. The current version can be found at http://www.unicode.org/Public/UNIDATA/UnicodeData.txt.
- Copy this file to layout/tools/UnicodeData-Latest.txt in the mozilla source tree
- Run perl genbidicattable.pl in layout/tools. This creates a new version of layout/base/bidicattable.h
- The previous step will probably issue warnings like the following:
WARNING, Unicode Database now contain characters which we have not considered. change this program !!! Problem- U+010900 - U+010907 range
In this case, you will need to edit @range in genbidicattable.pl to include the new ranges