Which character set for Big 5 plus HKSCS is used by ICU?

View: New views
2 Messages — Rating Filter:   Alert me  

Which character set for Big 5 plus HKSCS is used by ICU?

by Yingying Zhao :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Which character set for Big 5 plus HKSCS is used by ICU?

Mr. Rhoten:

In the last post, you said ICU has more than one HKSCS mapping table. The Big5-HKSCS converter in ICU uses the Unicode 4.0 or 4.1 mappings by default (I forget which one).  Do you mean ICU use differnt mapping table for differnt component ? Do they coexist in one version? And do those mapping tables change when ICU updage to a new version?

I searched information from google these days and found several charsets used in ICU, but I dont know which one is used by default.

http://source.icu-project.org/repos/icu/data/trunk/charset/data/ucm/macos-2566-10.2.ucm

http://source.icu-project.org/repos/icu/data/trunk/charset/data/ucm/windows-950_hkscs-2001.ucm

http://source.icu-project.org/repos/icu/data/trunk/charset/data/ucm/ibm-1375_P100-2004.ucm

Regards

 

Zhao Yingying

Internationalization Development Engineer

SAS Research and Development (Beijing) Co., Ltd.

TEL: +86 10 63103355

Email:yingying.zhao@...

Web: www.sas.com

SAS ... The Power to Know


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
icu-support mailing list - icu-support@...
To Un/Subscribe: https://lists.sourceforge.net/lists/listinfo/icu-support

Re: Which character set for Big 5 plus HKSCS is used by ICU?

by George Rhoten :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

ICU comes with 2 Big5-HKSCS variants by default.  I count at least 7
variant Big5-HKSCS tables in ICU's charset repository, which is what you
found.  The 2 Big5-HKSCS tables in ICU coexist with each other in much the
same manner that ICU has 3 variant Shift-JIS tables, 3 variant EUC-JP
tables, 2 variant Big5 (non-HKSCS) tables, and about 6 variant
KSC_5601/EUC-KR tables.  Since ICU is multi-platform in nature, ICU has to
work with all the quirks on each platform.  The variant table that is used
by ICU, depends on the alias you used to request the table (don't use the
canonical table name, since those can change when the tables are updated).

If you think this is all too complicated, just use the latest version of
Unicode (ISO-10646).  Unicode is more portable than Big5-HKSCS and many
other charsets.

If you want to explore what charset conversion tables are in ICU, you may
find ICU's Converter Explorer helpful:
http://demo.icu-project.org/icu-bin/convexp

You may also find this old presentation helpful to understand some of the
problems during the charset conversion process:
http://www.icu-project.org/docs/papers/conversions_and_mappings_iuc19.ppt

George Rhoten
IBM Globalization Center of Competency/ICU  San José, CA, USA
http://www.icu-project.org/



"Yingying Zhao" <yingying.zhao@...>
Sent by: icu-support-bounces@...
05/11/2007 12:40 AM
Please respond to
ICU support mailing list <icu-support@...>


To
George Rhoten/San Jose/IBM@IBMUS, <icu-support@...>
cc

Subject
[icu-support] Which character set for Big 5 plus HKSCS is used by ICU?






Mr. Rhoten:
In the last post, you said“ ICU has more than one HKSCS mapping table. The
Big5-HKSCS converter in ICU uses the Unicode 4.0 or 4.1 mappings by
default (I forget which one).“  Do you mean ICU use differnt mapping table
for differnt component ? Do they coexist in one version? And do those
mapping tables change when ICU updage to a new version?
I searched information from google these days and found several charsets
used in ICU, but I don’t know which one is used by default.



-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
icu-support mailing list - icu-support@...
To Un/Subscribe: https://lists.sourceforge.net/lists/listinfo/icu-support