NAME

Encode::ISO2022::CCS - coded character sets for ISO/IEC 2022 character encoding scheme

DESCRIPTION

This document lists encodings which may be used as coded character set (CCS) for ISO/IEC 2022 character encoding scheme.

From view of Perl, a CCS is the Encode::Encoding object performing stateless and fixed-length conversion over 94^n or 96^n code table (By now, Encode::ISO2022 does not support variable length multibyte CCS).

In tables below, "reg#" column indicates the number of registry in ISO International Registry of Coded Character Sets [ISO-IR].

Built-in encodings

reg# Name                  Description
----------------------------------------------------------------
  6  ascii                 ANSI X3.4-1986, equivalent to ISO 646 IRV
 -   ascii-ctrl            ISO 6429 control characters (C0)
 -   c1-ctrl               ISO 6429 control characters (C1)
----------------------------------------------------------------

Encode::ISOIRSingle

reg# Name                  Description
----------------------------------------------------------------
 13  jis-x-0201-right      JIS X 0201 katakana set
 14  iso-646-jp            JIS X 0201 Latin set, ISO/IEC 646 JP
100  iso-8859-1-right      ISO/IEC 8859 no. 1, Latin 1
101  iso-8859-2-right      ISO/IEC 8859 no. 2, Latin 2
109  iso-8859-3-right      ISO/IEC 8859 no. 3, Latin 3
110  iso-8859-4-right      ISO/IEC 8859 no. 4, Latin 4
144  iso-8859-5-right      ISO/IEC 8859 no. 5, Cyrilic set
127  iso-8859-6-right      ISO/IEC 8859 no. 6, Arabic set
126  iso-8859-7-right      ISO/IEC 8859 no. 7, Greek set
138  iso-8859-8-right      ISO/IEC 8859 no. 8, Hebrew set
148  iso-8859-9-right      ISO/IEC 8859 no. 9, Latin 5
157  iso-8859-10-right     ISO/IEC 8859 no. 10, Latin 6
166  iso-8859-11-right     ISO/IEC 8859 no. 11, Thai set
179  iso-8859-13-right     ISO/IEC 8859 no. 13, Latin 7
199  iso-8859-14-right     ISO/IEC 8859 no. 14, Latin 8
203  iso-8859-15-right     ISO/IEC 8859 no. 15, Latin 9
226  iso-8859-16-right     ISO/IEC 8859 no. 16, Latin 10
----------------------------------------------------------------

Encode::JISLegacy

 reg# Name                  Description
 ----------------------------------------------------------------
  42  jis-x-0208-1978       JIS C 6226-1978, 1st rev. of JIS X 0208
(159) jis-x-0212-ascii      JIS X 0212-1990 cooperable with ASCII
(159) jis-x-0212-cp932      JIS X 0212-1990 cooperable with CP932
 ----------------------------------------------------------------

Encodings provided by external modules

Encode::CN

reg# Name                  Description
----------------------------------------------------------------
 58  gb2312-raw            GB2312-80
n/a  gb12345-raw           GB12345-90
165  iso-ir-165            CCITT Chinese set
----------------------------------------------------------------

Encode::HanExtra

reg# Name                  Description
----------------------------------------------------------------
n/a  cccii                 CCCII
171  cns11643-1            CNS 11643 plane 1
172  cns11643-2            CNS 11643 plane 2
183  cns11643-3            CNS 11643 plane 3
184  cns11643-4            CNS 11643 plane 4
185  cns11643-5            CNS 11643 plane 5
186  cns11643-6            CNS 11643 plane 6
187  cns11643-7            CNS 11643 plane 7
n/a  cns11643-f            CNS 11643 plane 15 or 8
----------------------------------------------------------------

Encode::JISX0213::CCS

reg# Name                  Description
----------------------------------------------------------------
 87  jis-x-0208            JIS X 0208-1983, 2nd rev. of JIS X 0208
168      ditto             JIS X 0208-1990, 3rd rev. of JIS X 0208
     jis-x-0208-ascii
     jis-x-0208-jis
228  jis-x-0213-2000-plane1  JIS X 0213:2000 level 3 (plane 1)
     jis-x-0213-2000-plane1-ascii
     jis-x-0213-2000-plane1-jis
233  jis-x-0213-plane1     JIS X 0213:2004 level 3 (plane 1)
     jis-x-0213-plane1-ascii
     jis-x-0213-plane1-jis
229  jis-x-0213-plane2     JIS X 0213:2000/2004 level 4 (plane 2)
----------------------------------------------------------------
  • About "-ascii" and "-jis" variants see documentation of module.

Encode::JP

reg# Name                  Description
----------------------------------------------------------------
 87  jis0208-raw           JIS X 0208-1983, 2nd rev. of JIS X 0208
168      ditto             JIS X 0208-1990, 3rd rev. of JIS X 0208
159  jis0212-raw           JIS X 0212-1990
----------------------------------------------------------------
  • About compatible encodings taking care of characters with combining marks, see Encode::JISX0213::CCS.

  • Note that GL mapping of jis0201-raw provided by this module is broken. Use iso-646-jp provided by Encode::ISOIRSingle.

Encode::KR

reg# Name                  Description
----------------------------------------------------------------
149  ksc5601-raw           KS C5601-1987, 1st rev. of KS X 1001
----------------------------------------------------------------

SEE ALSO

See modules described above for specific CCS.

[ISO-IR] International Register of Coded Character Sets to be Used with Escape Sequences, http://www.itscj.ipsj.or.jp/ISO-IR/.

Encode, Encode::ISO2022.

AUTHOR

Hatuka*nezumi - IKEDA Soji, <nezumi@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2013 by Hatuka*nezumi - IKEDA Soji

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.