NAME
jacode4e - jacode.pl-like program for enterprise
SYNOPSIS
use FindBin;
use lib "$FindBin::Bin/lib";
require 'jacode4e.pl';
$return =
jacode4e::convert(\$line, $OUTPUT_encoding, $INPUT_encoding [, { %option }]);
$return
Number of characters in $line
$line
String variable to convert
After conversion, this variable is overwritten
$OUTPUT_encoding, and $INPUT_encoding
To convert, you must specify both $OUTPUT_encoding and $INPUT_encoding.
The encodings you can specify are as follows:
mnemonic means
-----------------------------------------------------------------------
cp932x CP932X, Extended CP932 to JIS X 0213 using 0x9C5A as single shift
cp932 CP932
sjis2004 Shift_JIS-2004
cp00930 IBM CP00930(CP00290+CP00300), CCSID 5026 katakana
keis78 HITACHI KEIS78
keis83 HITACHI KEIS83
keis90 HITACHI KEIS90
jef FUJITSU JEF
jipsj NEC JIPS(J)
jipse NEC JIPS(E)
utf8 UTF-8
utf8jp UTF-8-SPUA-JP, JIS X 0213 on SPUA ordered by JIS level, plane, row, cell
-----------------------------------------------------------------------
%option
The options you can specify are as follows:
key mnemonic value means
-----------------------------------------------------------------------
INPUT_LAYOUT input record layout by 'S' and 'D' sequence
'S' means one char as SBCS, 'D' means one char as DBCS
OUTPUT_SHIFTING true means use output shift code, false means not use
default is false
SPACE output space code in DBCS
GETA output geta code in DBCS
-----------------------------------------------------------------------
SAMPLES
use FindBin;
use lib "$FindBin::Bin/lib";
require 'jacode4e.pl';
jacode4e::VERSION('2.13.6.5');
while (<>) {
$return =
jacode4e::convert(\$_, 'cp932x', 'cp00930', {
'INPUT_LAYOUT' => 'SSSDDDSSDDSDSD',
'OUTPUT_SHIFTING' => 0,
'SPACE' => "\x81\xA2",
'GETA' => "\x81\xA1",
});
print $_;
}
ABSTRACT
This software has been developed for use promotion of JIS X 0213.
JIS X 0213 contains almost all of the characters used in everyday life in
Japan. In addition, JIS X 0213 was designed with consideration for use
in Shift_JIS environment. However, because it did not consider CP932 and
GAIJI, it is not very popular.
"JIS X 0213 bridge" needs last one meter.
To solve the problem, jacode4e.pl proposes CP932X [shi:pi:kju:san'nikai].
CP932X implements JIS X 0213 over CP932.
We can continue to use CP932 data, application programs, databases, and
networks using CP932X.
WHAT IS "CP932X"?
"cp932x" as mnemonic
CP932X is CP932
Pronounce [si: pi: nain thri: tu: kai] in English
Pronounce [shi: pi: kju: san' ni kai] in Japanese
[si: pi: nain thri: tu: iks] is reserved for Microsoft Corporation ;-P
CP932 upper compatible
Supports JIS X 0213 character set
Used ghost character "\x9C\x5A" as single shift code
Used "\x9C\x5A\x9C\x5A" for single "\x9C\x5A"
You can use private use characters you made
You can use your operating system, network, and database.
In most cases, application programs can be used as it is.
WHAT IS "UTF-8-SPUA-JP"?
"utf8jp" as mnemonic
UTF-8-SPUA-JP is UTF-8
Internal character encoding of jacode4e.pl, universally
Implements JIS X 0213 character set on to Unicode Supplementary Private Use Area-A
Code point ordered by JIS level, plane, row, cell
Uniformly length encoding
No grapheme clustering, one character by uniquely code point
DEPENDENCIES
This software requires perl version 5.00503 or later to run. (All of Perl4 users in the world, pardon me!)
SOFTWARE LIFE CYCLE
Jacode.pm Jacode4e.pm
jcode.pl Encode.pm jacode.pl jacode4e.pl
--------------------------------------------------------------
1993 Perl4.036 | |
: : : :
1999 Perl5.00503 | | |
2000 Perl5.6 | | |
2002 Perl5.8 | Born | |
2007 Perl5.10 V | | |
2010 Perl5.12 EOL | Born |
2011 Perl5.14 | | |
2012 Perl5.16 | | |
2013 Perl5.18 | | |
2014 Perl5.20 | | |
2015 Perl5.22 | | |
2016 Perl5.24 | | |
2017 Perl5.26 | | |
2018 Perl5.28 : : Born
2019 Perl5.30 : : :
2020 Perl5.32 : : :
: : V V V
--------------------------------------------------------------
SOFTWARE COVERAGE
When you lost your way, you can see this matrix and find your way.
Skill/Use Amateur Semipro Pro Enterprise
---------------------------------------------------------
Expert jacode.pl Encode.pm Encode.pm jacode4e.pl
---------------------------------------------------------
Middle jacode.pl jacode.pl Encode.pm jacode4e.pl
---------------------------------------------------------
Beginner jacode.pl jacode.pl jacode.pl jacode4e.pl
---------------------------------------------------------
AUTHOR
INABA Hitoshi <ina@cpan.org> in a CPAN
This project was originated by INABA Hitoshi.
LICENSE AND COPYRIGHT
This software is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See perlartistic.
This software is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
SEE ALSO
CPGID 00290
https://www-01.ibm.com/software/globalization/cdra/
https://www-01.ibm.com/software/globalization/cp/cp00290.html
ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP00290.pdf
ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP00290.txt
HiRDB Datareplicator Version 8 manuals, Hitachi, Ltd.
http://itdoc.hitachi.co.jp/manuals/3020/3020636050/W3600001.HTM
http://itdoc.hitachi.co.jp/manuals/3020/3020636050/W3600166.HTM
http://itdoc.hitachi.co.jp/manuals/3020/30203J3820/ISUS0268.HTM
http://itdoc.hitachi.co.jp/manuals/3000/30003D5820/CLNT0235.HTM
Linkexpress, FUJITSU LIMITED
http://software.fujitsu.com/jp/manual/manualfiles/M080093/J2X15930/03Z200/index.html
http://software.fujitsu.com/jp/manual/manualfiles/M080093/J2X15930/03Z200/unyo05/unyo0413.html
http://software.fujitsu.com/jp/manual/manualfiles/m130010/b1fw5992/01z200/b5992-c-00-00.html
iDIVO Ver.1.4.0
https://www.hulft.com/shukka/files/iDIVO/SP-DV1-CC-02-01.pdf
cp932 to Unicode table
ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP932.TXT
https://support.microsoft.com/ja-jp/help/170559/prb-conversion-problem-between-shift-jis-and-unicode
Shift_JIS-2004 to Unicode table
http://x0213.org/codetable/sjis-0213-2004-std.txt
IBM Japanese Graphic Character Set, Kanji DBCS Host and DBCS - PC
https://www-01.ibm.com/software/globalization/cdra/
ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP00300.pdf
IBM Kanji code list
N:GC18-2040-3
KEIS code book, Culti Co.,Ltd.
http://www.culti.co.jp/2016/02/01/%e3%82%ab%e3%83%ab%e3%83%81%e7%99%ba%e8%a1%8c%e6%9b%b8%e7%b1%8d/
JIS X 0208 (1990) to Unicode
ftp://ftp.unicode.org/Public/MAPPINGS/OBSOLETE/EASTASIA/JIS/JIS0208.TXT
Appendix B.2 Character code differences
http://itdoc.hitachi.co.jp/manuals/3020/3020759580/G5950334.HTM
Appendix E Handling of character codes in PDE - Form Designer (applies only to distributed type PDE)
http://itdoc.hitachi.co.jp/manuals/3020/30203p0360/PDEF0203.HTM
HITAC Character code table (KEIS83)
Document number 8080-2-100-10
JEF code book, Culti Co.,Ltd.
http://www.culti.co.jp/2016/02/01/jef%e3%82%b3%e3%83%bc%e3%83%89%e3%83%96%e3%83%83%e3%82%af/
Linkexpress operation manual J2X1-5930-03Z2(00) FUJITSU LIMITED
http://software.fujitsu.com/jp/manual/manualfiles/M080093/J2X15930/03Z200/index.html
http://software.fujitsu.com/jp/manual/manualfiles/M070086/J2X15930/01Z200/unyo05/unyo0416.html
http://software.fujitsu.com/jp/manual/manualfiles/M070086/J2X15930/01Z200/unyo05/unyo0420.html
http://software.fujitsu.com/jp/manual/manualfiles/M070086/J2X15930/01Z200/unyo05/unyo0421.html
http://software.fujitsu.com/jp/manual/manualfiles/m120010/b1fw5691/05z200/index.html
http://software.fujitsu.com/jp/manual/manualfiles/m120010/b1fw5691/05z200/index.html
http://software.fujitsu.com/jp/manual/manualfiles/m120010/b1fw5691/05z200/index.html
http://software.fujitsu.com/jp/manual/manualfiles/m120010/b1fw5691/05z200/index.html
http://software.fujitsu.com/jp/manual/manualfiles/m120010/b1fw5691/05z200/b5691-g-00-00.html
hidekatsu-izuno/jef4j
https://github.com/hidekatsu-izuno/jef4j
JHTc(JHT command edition)
http://www.vector.co.jp/soft/winnt/util/se094205.html
FACOM JEF Character code index dictionary
Manual code 99FR-0012-3
JIPS code book, Culti Co.,Ltd.
http://www.culti.co.jp/2016/02/01/jips%e3%82%b3%e3%83%bc%e3%83%89%e3%83%96%e3%83%83%e3%82%af/
NEC Corporation Standard character set dictionary <BASIC>
ZBB10-3
NEC Corporation Standard character set dictionary <EXTENSION>
ZBB11-2
UTF-8, a transformation format of ISO 10646
https://www.rfc-editor.org/rfc/rfc3629.txt
Kanji shift code
https://ja.wikipedia.org/wiki/%E6%BC%A2%E5%AD%97%E3%82%B7%E3%83%95%E3%83%88%E3%82%B3%E3%83%BC%E3%83%89
Very old fj.kanji discussion
http://www.ie.u-ryukyu.ac.jp/~kono/fj/fj.kanji/index.html
BackPAN
http://backpan.perl.org/authors/id/I/IN/INA/
ACKNOWLEDGEMENTS
I could make this software by good luck. I thank all stakeholders.
I received character code table of KEIS, JEF, and JIPS by electronic data
from Culti Co.,Ltd. Moreover, Culti Co.,Ltd. has allowed me to use it to
make open source software.
I thank Culti Co.,Ltd. once again.
HELLO WORLD
To support JIS X 0213:2004,
Using ghost character 1-55-27(it's me!),
Found by JIS X 0208:1997,
Was born in JIS C 6226-1978.
Hello world,
What do we hack, today?
-- 1-55-27, 2018-01-27