CJK Support Notes
Introduction
Chinese, Japanese, and Korean (CJK) all have more than 256 characters
that could not be represented using single byte character sets.
Instead, double bytes are used to represent CJK characters.
Several double-byte character sets are for used Chinese (GB, BIG5, HZ), Japanese
(Shift-JIS, JIS, EUC-JIS), and Korean (KSC). Universal
character sets that include all CJK characters are under development
(ISO2022, UTF7, UTF8).
Different input methods are also required to input CJK files.
The most popular ones include PY for Chinese, KK for Japanese, and HG
for Korean. Intelligent Mode are frequently used for Chinese and Japanese
input. It 'guesses' the next word that the user is likely to input
based on common phrases. This greatly increases inputting speed.
CJK versions of Windows support CJK applications, however,
they are quite expansive. Instead, a CJK system is frequently
used on top of regular Windows. There are different CJK systems for
different platforms and operating systems. These CJK systems
allow users to use any application to view/edit CJK files, as if
the applications are CJK applications. Thus once a CJK system is
installed, the user no longer need to use specific programs to do
editing/viewing, all existing applications will support CJK.
Listed below is a summary of several CJK systems that are commercially available.
Blanks denote information that are not currently available.
For more detailed information on CJK encodings, input methods, and
various CJK applications, please see CJK Notes.
CJK Systems
[
UnionWay |
CStar |
MView |
TwinBridge |
AsianBridge |
NJStar |
WinMASS Lite
]
UnionWay (http://www.unionway.com/)
- Platform/OS:
- Win 3.x, Win95, NT, 3.x for WorkGroups, compatible with CJK versions of Win 3.x and Win95.
- Fonts:
- GB, BIG5, HZ, JIS, Shift-JIS, KSC
- Character Sets:
- TSJK, can add other sets
- Input Methods:
- PY, ZY, CJ, CAN, EC, RK, KK, HK, HG
- Intelligent Mode:
- Pinyin, Zhuyin, Rama-Kanji
- TrueType Fonts:
- Provides TTF Fonts at addtional costs
- Graphics Compatibility:
- PageMaker, PhotoShop and CorelDraw
- Warranty:
- Technical Support:
- Pricing:
- http://www.unionway.com/purchase.htm
- Licensing:
- sales@unionway.com
- Notes:
- Supports MS Office
Chinese Star (http://www.suntendyusa.com/)
- Platform/OS:
- Win 3.x Win95, NT
- Fonts:
- GB, BIG5, Unicode on NT version
- Character Sets:
- TS
- Input Methods:
- PY, NPY, QW, ZM, PZM, WBS, WBD, WBB, EC, Tabular.
- Intelligent Mode:
- Pinyin
- TrueType Fonts:
- 42 TTF Fonts
- Graphics Compatibility:
- PageMaker, Powerpoint, PhotoShop, PaintBrush.
- Warranty:
- Lifetime Replacment Warranty For Defective Parts.
- Technical Support:
- Lifetime support via e-mail.
- Pricing: http://www.suntendyusa.com/buycstar.html
- Licensing:
- http://www.suntendyusa.com/license.html
- Notes:
MView (http://www.ifcss.org/ftp-pub/software/ms-win/c-sys/msystem.txt)
- Platform/OS:
- Win 3.x, Win95
- Fonts:
- GB, HZ, BIG5, JIS, EUC-JIS, Shift-JIS, KSC, UTF-7, UTF-8
- Character Sets:
- TSJKU
- Input Methods:
- PY, NPY, CAN, CJ, WB, 4C, KK, FC, ST, HG
- Intelligent Mode:
- Chinese, Japanese
- TrueType Fonts:
- None
- Graphics Compatibility:
- Warranty:
- SHAREWARE, NO WARRANTY
- Technical Support:
- Pricing:
- 30 day Free Trial,
- Licensing:
- Notes:
TwinBridge (http://www.twinbridge.com)
- Platform/OS:
- Win 3.x, Win 95, NT, Mac
- Fonts: BIG5, GB, JIS, KSC
- Character Sets: TSJK
- Input Methods: PY, ZY, CJ, QW, etc.
- Intelligent Mode:
- Pinyin
- TrueType Fonts:
- 27 Chinese fonts, 15 Japanese fonts, 4 Korean fonts, support 3rd party TTF fonts.
- Graphics Compatibility:
- PageMaker, CorelDraw
- Warranty:
- Technical Support:
- http://www.twinbridge.com/html/techsup.html, techsup@twinbridge.com
- Pricing:
- $149 Each Partner ($447 total) tbsales@twinbridge.com
- Licensing:
- Notes:
- Supports Microsoft Office, AutoDetect GB/BIG5, AutoDetect Shift-JIS/EUC-JIS
AsianBridge (http://www.twinbridge.com/html/csuite/csdoc.html)
- Platform/OS:
- Win 3.1x, Win 95 (also asian versions), OS/2, PowerMac
- Fonts:
- GB, BIG5, HZ, Shift-JIS, JIS, EUC-JIS, KSC, ISO2022, MIME
- Character Sets:
- TSJK
- Input Methods:
- PY, ZY, etc
- Intelligent Mode:
- Pinyin, Zhuyin
- TrueType Fonts:
- Graphics Compatibility:
- Warranty:
- Technical Support:
- techsup@twinbridge.com
- Pricing:
- $149, $79 sale tbsales@twinbridge.com
- Licensing:
- Notes:
- Supports viewing of web pages, email, news group and Internet applications, Auto Code Detection
NJStar (http://www.njstar.com.au)
- Platform/OS:
- Win 3.1x, Win 95, Win 98, NT
- Fonts:
- GB, BIG5, HZ, JIS, Shift-JIS, EUC-JIS, KSC, ISO2022, Support Unicode
- Character Sets:
- TSJK
- Input Methods:
- 20 Chinese methods, PY, ZY, CAN, WB, CJ, BS, ... RK, KK, FC, ST, NI, UI, RL, EJ
- Intelligent Modes:
- Chinese
- TrueType Fonts:
- in Professional Version
- Graphics Compatibility:
- Warranty:
- Technical Support:
- support@njstar.com
- Pricing: $99 Chinese/Japanese Word Processor
- Licensing:
- orders@njstar.com
- Note:
- AutoDetection of Chinese and Japanese Codes
WinMASS Lite (http://www.starglobe.com.sg/)
- Platform/OS:
- Win 3.1, Win 3.22 running Win32s, UNIX
- Fonts:
- UTF-7, UTF-8, EACC(Library Edition), GB, HZ, HZX, BIG5, CNS, Shift-JIS, EUC-JIS, ISO2022, KSC, ISO8859-1
- Character Sets:
- TSJKU, can add other sets
- Input Methods:
- PY, ZY, CJ, JBS, SN, EC
- Intelligent Modes:
- TrueType Fonts:
- Graphics Compatibility:
- Warranty:
- Technical Support:
- http://www.starglobe.com.sg/cgi-bin/multi/techsupp/home.html
- Pricing:
- Licensing:
- Notes: Website very slow
A few other systems that are designed specifically for one character set, or can only
support viewing but no editing, or are hard to find information from, are listed
under separate document.
Recommandations
Below is a price comparison chart of 4 most frequently used systems/programs:
UW-Asian StdPack 97 (Demo 60 Free)
TSJK bmp fonts $59 / 1 user
$325 / 10 user
$12 shipping
Chinese Star Overseas Edition v2.97 (Demo Free)
TS ttf fonts, $100 / 5-10 users license
$9.5 shipping
NJWIN CJK Internet Viewer (viewer only)
TSJK $49 / 1 user
MView System V1.00 (16-bit Windows)
TSJKU bmp fonts $18 / 1 user
$38 / 3 user
$48 / 5 user
NJWin is viewer only, and Chinese Star(CStar) only supports Chinese.
The MView System V1.00 is clearly much cheaper, however, it only
supports 16-bit Windows, and could not work well on Win95 or Windows
NT.
For cross platform support, UW-Asian StdPack 97 is strongly
recommended.
UnionWay has all CJK fonts, the ability to add more
character sets, has several popular input methods, supports intelligent
mode, and are compatible to MS Office and several graphics programs.
NJWIN is the best CJK viewing system, it autodetects between different
character sets, thus allows user to view files that uses multiple
encodings.
Chinese Star is the best system for Chinese, however,
it does not support JK.
Glossary
- 4C
- Four Corners Method, for Chinese inputting. Each character is divided
into 4 parts,
- CAN
- Cantonese input method, for Chinese. This is essentially pinyin method using Cantonese sounds.
- CJ
- Cangjie Method, for Chinese inputting.
- CJK Character Set
- A set of characters defined for one or more languages. In most
cases one character set defined one language, although there are
exceptions (ie, ISO-8859-1).
- Chinese, Japanese, and Korean.
- EC
- English-Chinese input method. For Chinese.
- EJ
- English-Japanese input method. For Japanese.
- Encoding
- Encoding is a method by which a document or message converts to
computerized data. One encoding can be used by multiple languages,
and one language may have several different encodings.
- EUC-JIS
- Japanese encoding. 8-bit. Used mostly on Unix.
- FC
- Four Corners input method for Japanese.
- GB
- GuoBiao encoding for Chinese. This is the most common encoding
for places using simplified Chinese. Typically used in mainland
China and Singapore. 8-bit.
- HG
- Hangul input method for Korean. Most common Korean input method.
- HJ
- Hanja input method. For Korean.
- HZ
- HanZi encoding for Chinese, a variation of GB. 7-bit. Created mostly to support mixed ASCII/GB network file exchange and editing.
- J
- Japanese Character Set.
- JBS
- Jianyi Bushou (Simplified Radical Lookup) input method, for Chinese.
- JIS
- Japanese encoding. 7-bit. Used mostly to support 7-bit internet mail/news.
- K
- Korean Character Set.
- KK
- Kana-Kanji input method. Most common Japanese input method.
- NDPY
- New Double Pinyin input method, for Chinese.
- NPY
- No-tune Pinyin input method, or New Pinyin input method. For Chinese inputting.
- NI
- Nelson Index input method, for Japanese.
- PY
- Pinyin input method. Most popular Chinese input method.
- PZM
- Popularized Zhengma input method, for Chinese.
- QW
- Quwei input method, for Chinese.
- RK
- Roma-Kanji input method, for Japanese.
- RL
- Radical Lookup input method, for Japanese.
- S
- Simplified Chinese Character Set.
- Shift-JIS (SJIS, or S-JIS)
- Japanese encoding. 8-bit. Used mostly on Mac/PC.
- SN
- Stroke Number input method, for Chinese.
- ST
- Strokes input method, for Japanese.
- T
- Traditional Chinese Character Set.
- TC
- Telecode input method. For Chinese. Each character is represented by a telegraph code in Mainland China.
- U
- Unicode Character Set.
- UI
- Unicode input method, for Chinese, Japanese, Korean.
- WB
- Wubi (also called Five Strokes Method) input method, for Chinese.
- WBB
- Wubi Bridge input method, for Chinese.
- WBD
- Wubi Drawing input method, for Chinese.
- WBS
- Wubi Shape input method, for Chinese.
- WM
- WangMa input method, for Chinese.
- ZM
- ZhengMa input method, for Chinese.
- ZY
- Zhuyin input method, for Chinese. Popular in Taiwan.
jeanz@rice.edu