About us
Contact us
 Twitter    Bookmark and Share

C. Character Encodings

Contents | Previous Chapter

Framework ID MIBenum Standard Name Java Name Description
UTF_8 106 UTF-8 UTF8 UTF-8
UTF_16 -1 UTF-16 Unicode UTF-16
ISO_8859_1 4 ISO-8859-1 ISO8859_1 ISO 8859-1, Latin alphabet No. 1
ISO_8859_2 5 ISO-8859-2 ISO8859_2 ISO 8859-2, Latin alphabet No. 2
ISO_8859_3 6 ISO-8859-3 ISO8859_3 ISO 8859-3, Latin alphabet No. 3
ISO_8859_4 7 ISO-8859-4 ISO8859_4 ISO 8859-4, Latin alphabet No. 4
ISO_8859_5 8 ISO-8859-5 ISO8859_5 ISO 8859-5, Latin/Cyrillic alphabet
ISO_8859_6 9 ISO-8859-6 ISO8859_6 ISO 8859-6, Latin/Arabic alphabet
ISO_8859_7 10 ISO-8859-7 ISO8859_7 ISO 8859-7, Latin/Greek alphabet
ISO_8859_8 11 ISO-8859-8 ISO8859_8 ISO 8859-8, Latin/Hebrew alphabet
ISO_8859_9 12 ISO-8859-9 ISO8859_9 ISO 8859-9, Latin alphabet No. 5
ISO_8859_13 -1 ISO-8859-13 ISO8859_13 ISO 8859-13, Latin alphabet No. 7
ISO_8859_15 -1 ISO-8859-15 ISO8859_15_FDIS ISO 8859-15, Latin alphabet No. 9
Big5 2026 Big5 Big5 Big5, Traditional Chinese
IBM037 2028 IBM037 Cp037 USA, Canada(Bilingual, French), Netherlands, Portugal, Brazil, Australia
IBM273 2030 IBM273 Cp273 IBM Austria, Germany
IBM277 2033 IBM277 Cp277 IBM Denmark, Norway
IBM278 2034 IBM278 Cp278 IBM Finland, Sweden
IBM280 2035 IBM280 Cp280 IBM Italy
IBM284 2037 IBM284 Cp284 IBM Catalan/Spain, Spanish Latin America
IBM285 2038 IBM285 Cp285 IBM United Kingdom, Ireland
IBM297 2040 IBM297 Cp297 IBM France
IBM420 2041 IBM420 Cp420 IBM Arabic
IBM424 2043 IBM424 Cp424 IBM Hebrew
IBM437 2011 IBM437 Cp437 MS-DOS United States, Australia, New Zealand, South Africa
IBM500 2044 IBM500 Cp500 EBCDIC 500V1
IBM737 -1 IBM737 Cp737 PC Greek
IBM775 2087 IBM775 Cp775 PC Baltic
IBM838 -1 IBM838 Cp838 IBM Thailand extended SBCS
IBM850 2009 IBM850 Cp850 MS-DOS Latin-1
IBM852 2010 IBM852 Cp852 MS-DOS Latin-2
IBM855 2046 IBM855 Cp855 IBM Cyrillic
IBM856 -1 IBM856 Cp856 IBM Hebrew
IBM857 2047 IBM857 Cp857 IBM Turkish
IBM858 -1 IBM858 Cp858 Variant of IBM850 with Euro character
IBM860 2048 IBM860 Cp860 MS-DOS Portuguese
IBM861 2049 IBM861 Cp861 MS-DOS Icelandic
IBM862 2013 IBM862 Cp862 PC Hebrew
IBM863 2050 IBM863 Cp863 MS-DOS Canadian French
IBM864 2051 IBM864 Cp864 PC Arabic
IBM865 2052 IBM865 Cp865 MS-DOS Nordic
IBM866 2086 IBM866 Cp866 MS-DOS Russian
IBM868 2053 IBM868 Cp868 MS-DOS Pakistan
IBM869 2054 IBM869 Cp869 IBM Modern Greek
IBM870 2055 IBM870 Cp870 IBM Multilingual Latin-2
IBM871 2056 IBM871 Cp871 IBM Iceland
IBM874 -1 IBM874 Cp874 IBM Thai
IBM875 -1 IBM875 Cp875 IBM Greek
IBM918 2062 IBM918 Cp918 IBM Pakistan(Urdu)
IBM921 -1 IBM921 Cp921 IBM Latvia, Lithuania (AIX, DOS)
IBM922 -1 IBM922 Cp922 IBM Estonia (AIX, DOS)
IBM930 -1 IBM930 Cp930 Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5026
IBM933 -1 IBM933 Cp933 Korean Mixed with 1880 UDC, superset of 5029
IBM935 -1 IBM935 Cp935 Simplified Chinese Host mixed with 1880 UDC, superset of 5031
IBM937 -1 IBM937 Cp937 Traditional Chinese Host miexed with 6204 UDC, superset of 5033
IBM939 -1 IBM939 Cp939 Japanese Latin Kanji mixed with 4370 UDC, superset of 5035
IBM942 -1 IBM942 Cp942 Japanese (OS/2) superset of 932
IBM942C -1 IBM942C Cp942C Variant of IBM942
IBM943 -1 IBM943 Cp943 Japanese (OS/2) superset of 932 and Shift-JIS
IBM943C -1 IBM943C Cp943C Variant of IBM943
IBM948 -1 IBM948 Cp948 OS/2 Chinese (Taiwan) superset of 938
IBM949 -1 IBM949 Cp949 PC Korean
IBM949C -1 IBM949C Cp949C Variant of IBM949
IBM950 -1 IBM950 Cp950 PC Chinese (Hong Kong, Taiwan)
IBM964 -1 IBM964 Cp964 AIX Chinese (Taiwan)
IBM970 -1 IBM970 Cp970 AIX Korean
IBM1006 -1 IBM1006 Cp1006 IBM AIX Pakistan (Urdu)
IBM1025 -1 IBM1025 Cp1025 IBM Multilingual Cyrillic: Bulgaria, Bosnia, Herzegovinia, Macedonia(FYR)
IBM1026 2063 IBM1026 Cp1026 IBM Latin-5, Turkey
IBM1046 -1 IBM1046 Cp1046 IBM Open Edition US EBCDIC
IBM1097 -1 IBM1097 Cp1097 IBM Iran(Farsi)/Persian
IBM1098 -1 IBM1098 Cp1098 IBM Iran(Farsi)/Persian (PC)
IBM1112 -1 IBM1112 Cp1112 IBM Latvia, Lithuania
IBM1122 -1 IBM1122 Cp1122 IBM Estonia
IBM1123 -1 IBM1123 Cp1123 IBM Ukraine
IBM1124 -1 IBM1124 Cp1124 IBM AIX Ukraine
IBM1140 -1 IBM1140 Cp1140 Variant of IBM037 with Euro character
IBM1141 -1 IBM1141 Cp1141 Variant of IBM273 with Euro character
IBM1142 -1 IBM1142 Cp1142 Variant of IBM277 with Euro character
IBM1143 -1 IBM1143 Cp1143 Variant of IBM278 with Euro character
IBM1144 -1 IBM1144 Cp1144 Variant of IBM280 with Euro character
IBM1145 -1 IBM1145 Cp1145 Variant of IBM284 with Euro character
IBM1146 -1 IBM1146 Cp1146 Variant of IBM285 with Euro character
IBM1147 -1 IBM1147 Cp1147 Variant of IBM297 with Euro character
IBM1148 -1 IBM1148 Cp1148 Variant of IBM500 with Euro character
IBM1149 -1 IBM1149 Cp1149 Variant of IBM871 with Euro character
Windows_1250 2250 windows-1250 Cp1250 Windows Eastern European
Windows_1251 2251 windows-1251 Cp1251 Windows Cyrillic
Windows_1252 2252 windows-1252 Cp1252 Windows Latin-1
Windows_1253 2253 windows-1253 Cp1253 Windows Greek
Windows_1254 2254 windows-1254 Cp1254 Windows Turkish
Windows_1255 2255 windows-1255 Cp1255 Windows Hebrew
Windows_1256 2256 windows-1256 Cp1256 Windows Arabic
Windows_1257 2257 windows-1257 Cp1257 Windows Baltic
Windows_1258 2258 windows-1258 Cp1258 Windows Vietnamese
IBM1381 -1 IBM1381 Cp1381 IBM OS/2, DOS People's Republic of China (PRC)
IBM1383 -1 IBM1383 Cp1383 IBM AIX People's Republic of China (PRC)
IBM33722 -1 IBM33722 Cp33722 IBM-eucJP - Japanese (superset of 5050)
GB2312 2025 GB2312 EUC_CN GB2312, EUC encoding, Simplified Chinese
EUC_JP 18 EUC-JP EUC_JP JIS0201, 0208, 0212, EUC Encoding, Japanese
EUC_KR 38 EUC-KR EUC_KR KS C 5601, EUC Encoding, Korean
CNS11643 -1 CNS11643 EUC_TW CNS11643 (Plane 1-3), Traditional Chinese, EUC encoding
GBK -1 GBK GBK GBK, Simplified Chinese
ISO_2022_CN 104 ISO-2022-CN ISO2022CN ISO 2022 CN, Chinese (conversion to Unicode only)
ISO_2022_CN_CNS -1 ISO-2022-CN-CNS ISO2022CN_CNS CNS 11643 in ISO-2022-CN form, Traditional Chinese (conversion from Unicode only)
ISO_2022_CN_GB -1 ISO-2022-CN-GB ISO2022CN_GB GB 2312 in ISO-2022-CN form, Simplified Chinese (conversion from Unicode only)
ISO_2022_JP 39 ISO-2022-JP ISO2022JP JIS0201, 0208, 0212, ISO2022 Encoding, Japanese
ISO_2022_KR 37 ISO-2022-KR ISO2022KR ISO 2022 KR, Korean
JIS_X0201 15 JIS_X0201 JIS0201 JIS 0201, Japanese
JIS_X0208_1983 63 JIS_X0208-1983 JIS0208 JIS 0208, Japanese
JIS_X0212_1990 98 JIS_X0212-1990 JIS0212 JIS 0212, Japanese
JISAutoDetect -1 JISAutoDetect JISAutoDetect Detects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only)
Johab -1 Johab Johab Johab, Korean
KOI8_R 2084 KOI8-R KOI8_R KOI8-R, Russian
Windows_874 -1 windows-874 MS874 Windows Thai
Windows_932 -1 windows-932 MS932 Windows Japanese
Windows_936 -1 windows-936 MS936 Windows Simplified Chinese
Windows_949 -1 windows-949 MS949 Windows Korean
Windows_950 -1 windows-950 MS950 Windows Traditional Chinese
MacArabic -1 MacArabic MacArabic Macintosh Arabic
MacCentralEurope -1 MacCentralEurope MacCentralEurope Macintosh Latin-2
MacCroatian -1 MacCroatian MacCroatian Macintosh Croatian
MacCyrillic -1 MacCyrillic MacCyrillic Macintosh Cyrillic
MacDingbat -1 MacDingbat MacDingbat Macintosh Dingbat
MacGreek -1 MacGreek MacGreek Macintosh Greek
MacHebrew -1 MacHebrew MacHebrew Macintosh Hebrew
MacIceland -1 MacIceland MacIceland Macintosh Iceland
MacRoman -1 MacRoman MacRoman Macintosh Roman
MacRomania -1 MacRomania MacRomania Macintosh Romania
MacSymbol -1 MacSymbol MacSymbol Macintosh Symbol
MacThai -1 MacThai MacThai Macintosh Thai
MacTurkish -1 MacTurkish MacTurkish Macintosh Turkish
MacUkraine -1 MacUkraine MacUkraine Macintosh Ukraine
Shift_JIS 17 Shift_JIS SJIS Shift-JIS, Japanese
TIS_620 2259 TIS-620 TIS620 TIS-620, Thai

Contents | Previous Chapter

Copyright © 2000-2009 Devsphere

About us
Contact us