The Perl and Raku Conference 2025: Greenville, South Carolina - June 27-29 Learn more

NAME

String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP

SYNOPSIS

$eucjp = String::Multibyte->new('EUC_JP');
$eucjp_length = $eucjp->length($eucjp_string);

DESCRIPTION

String::Multibyte::EUC_JP is used for manipulation of strings in EUC-JP family encodings.

Byte range of single-byte characters: 0x00..0x7F.

First and second byte range of double-byte characters: 0xA1..0xFE.

Legal byte range for two bytes preceded by SS3: 0xA1..0xFE.

Character order (invalid code points are excluded): 0x00..0x7F, 0x8EA1..0x8EFE, 0xA1A1..0xFEFE, 0x8FA1A1..0x8FFEFE.

EUC-JP EUC-JISX0213
1. 0x00..0x7F C0/G0 ASCII ASCII
2. 0x8EA1..0x8EFE G2 JIS kana JIS kana
3. 0xA1A1..0xFEFE G1 JIS X 0208 JIS X 0213 plane-1
4. 0x8FA1A1..0x8FFEFE G3 JIS X 0212 JIS X 0213 plane-2

CAVEAT

C1 controls other than SS2/SS3 (0x80..0x8D and 0x90..0x9E) are not supported.

Unassigned G2 (0x8EE0..0x8EFE) are now included.

SEE ALSO

String::Multibyte