NAME
mb - run Perl script written in MBCS
SYNOPSIS
$ perl mb.pm MBCS_Perl_script.pl
$ perl mb.pm -e big5 MBCS_Perl_script.pl
$ perl mb.pm -e big5hkscs MBCS_Perl_script.pl
$ perl mb.pm -e eucjp MBCS_Perl_script.pl
$ perl mb.pm -e gb18030 MBCS_Perl_script.pl
$ perl mb.pm -e gbk MBCS_Perl_script.pl
$ perl mb.pm -e sjis MBCS_Perl_script.pl
$ perl mb.pm -e uhc MBCS_Perl_script.pl
$ perl mb.pm -e utf8 MBCS_Perl_script.pl
MBCS subroutines:
mb::chop(...);
mb::chr(...);
mb::dosglob(...);
mb::getc(...);
mb::index(...);
mb::index_byte(...);
mb::length(...);
mb::ord(...);
mb::reverse(...);
mb::rindex(...);
mb::rindex_byte(...);
mb::substr(...);
supported encodings:
Big5, Big5-HKSCS, EUC-JP, GB18030, GBK, Sjis, UHC, UTF-8
supported operating systems:
Apple Inc. OS X,
Hewlett-Packard Development Company, L.P. HP-UX,
International Business Machines Corporation AIX,
Microsoft Corporation Windows,
Oracle Corporation Solaris,
and Other Systems
supported perl versions:
perl version 5.005_03 to newest perl
INSTALLATION BY MAKE (for UNIX-like system)
To install this software by make, type the following:
perl Makefile.PL
make
make test
make install
INSTALLATION WITHOUT MAKE (for DOS-like system)
To install this software without make, type the following:
pmake.bat test
pmake.bat install
DESCRIPTION
This software is a source code filter, transpiler-modulino.
Perl is said to have been able to handle Unicode since version 5.8. However,
unlike JPerl, "Easy jobs easy" has been lost. (but we have got it again :-D)
In Shift_JIS and similar encodings(Big5, Big5HKSCS, GB18030, GBK, Sjis, UHC)
have any DAMEMOJI at second octet in double-byte codepoint. DAMEMOJI are
metacharacters. Which octets are DAMEMOJI depends on whether the enclosing
delimiter is single quote or double quote. This software escapes DAMEMOJI in
your script, generate a new script and run it.
There are some MBCS encodings in the world.
in Japan since 1978, JIS C 6226-1978,
in China since 1980, GB 2312-80,
in Taiwan since 1984, Big5,
in South Korea since 1991, KS X 1002:1991, and more.
Even if you are an avid Unicode proponent, you cannot change this fact. These
encodings are still used today in most areas except the world wide web.
This software ...
* supports MBCS literals in Perl scripts
* supports Big5, Big5HKSCS, EUC-JP, GB18030, GBK, Sjis, UHC, and UTF-8
* does not use the UTF8 flag to avoid MOJIBAKE
* escapes DAMEMOJI in scripts
* handles raw encoding to support GAIJI
* adds multibyte anchoring to regular expressions
* rewrites character classes in regular expressions to work as MBCS codepoint
* supports special variables $`, $&, and $'
* does not change features of octet-oriented built-in functions
* lc(), lcfirst(), uc(), and ucfirst() convert US-ASCII only
* character ranges by hyphen of regular expression supports US-ASCII only
* tr/// and y/// doesn't support ranges by hyphen
* You have to write mb::* subroutines if you want codepoint semantics
Let's enjoy MBSC scripting in Perl, together!!
TERMINOLOGY
To understand and use this software, you must know some terminologies.
But now I have no time for write them. So today is July 7th, I have to go to
meet Juliet.
The necessary terms are listed below. Maybe world wide web will help you.
byte
octet
encoding
decode
character
code point
grapheme
SBCS(Single Byte Character Set)
DBCS(Double Byte Character Set)
MBCS(Multibyte Character Set)
multibyte anchoring
character class
MOJIBAKE
DAMEMOJI
GAIJI
GETA, GETA-MOJI, GETA-MARK
MBCS Encodings supported by this software
The encodings supported by this software and their range of octets are as
follows.
------------------------------------------------------------------------------
big5
1st 2nd
81..FE 00..FF
00..7F
https://en.wikipedia.org/wiki/Big5
------------------------------------------------------------------------------
big5hkscs
1st 2nd
81..FE 00..FF
00..7F
https://en.wikipedia.org/wiki/Hong_Kong_Supplementary_Character_Set
------------------------------------------------------------------------------
eucjp
1st 2nd
A1..FE 00..FF
00..7F
https://en.wikipedia.org/wiki/Extended_Unix_Code#EUC-JP
------------------------------------------------------------------------------
gb18030
1st 2nd
81..FE 30..39 81..FE 30..39
81..FE 00..FF
00..7F
https://en.wikipedia.org/wiki/GB_18030
------------------------------------------------------------------------------
gbk
1st 2nd
81..FE 00..FF
00..7F
https://en.wikipedia.org/wiki/GBK_(character_encoding)
------------------------------------------------------------------------------
sjis
1st 2nd
81..9F 00..FF
E0..FC 00..FF
80..FF
00..7F
https://en.wikipedia.org/wiki/Shift_JIS
------------------------------------------------------------------------------
uhc
1st 2nd
81..FE 00..FF
00..7F
https://en.wikipedia.org/wiki/Unified_Hangul_Code
------------------------------------------------------------------------------
utf8
1st 2nd 3rd 4th
E1..EC 80..BF 80..BF
C2..DF 80..BF
EE..EF 80..BF 80..BF
F0..F0 90..BF 80..BF 80..BF
E0..E0 A0..BF 80..BF
ED..ED 80..9F 80..BF
F1..F3 80..BF 80..BF 80..BF
F4..F4 80..8F 80..BF 80..BF
00..7F
https://en.wikipedia.org/wiki/UTF-8
------------------------------------------------------------------------------
MBCS subroutines provided by this software
This software provides traditional feature "as was." The new MBCS features
are provided by subroutines with new names. If you like utf8 pragma, mb::*
subroutines will help you. On other hand, If you love JPerl, those
subroutines will not help you very much. Traditional functions of Perl are
useful still now in octet-oriented semantics.
-----------------------------------------------------------------
bare Perl4 JPerl4
bare Perl5 JPerl5 use utf8; mb.pm
bare Perl7 pragma modulino
-----------------------------------------------------------------
chop --- --- chop
chr chr bytes::chr chr
getc getc --- getc
index --- bytes::index index
lc lc --- lc
lcfirst lcfirst --- lcfirst
length length bytes::length length
ord ord bytes::ord ord
reverse reverse --- reverse
rindex --- bytes::rindex rindex
substr substr bytes::substr substr
uc uc --- uc
ucfirst ucfirst --- ucfirst
--- chop chop mb::chop
--- index --- mb::index_byte
--- rindex --- mb::rindex_byte
--- --- chr mb::chr
--- --- getc mb::getc
--- --- index mb::index
--- --- lc ---
--- --- lcfirst ---
--- --- length mb::length
--- --- ord mb::ord
--- --- reverse mb::reverse
--- --- rindex mb::rindex
--- --- substr mb::substr
--- --- uc ---
--- --- ucfirst ---
-----------------------------------------------------------------
DOS-like glob() as MBCS subroutine
-----------------------------------------------------------------
MBCS semantics broken function, not so useful
-----------------------------------------------------------------
mb::dosglob glob, and <globbing*>
-----------------------------------------------------------------
index brothers
------------------------------------------------------------------------------------------
functions or subs works returns considered
------------------------------------------------------------------------------------------
index as octet as octet useful, bare Perl like
rindex as octet as octet useful, bare Perl like
mb::index as codepoint as codepoint not so useful, utf8 pragma like
mb::rindex as codepoint as codepoint not so useful, utf8 pragma like
mb::index_byte as codepoint as octet useful, JPerl like
mb::rindex_byte as codepoint as octet useful, JPerl like
------------------------------------------------------------------------------------------
Porting from script in bare Perl4, bare Perl5, and bare Perl7
-----------------------------------------------------------------
original script in script with
Perl4, Perl5, Perl7 mb.pm modulino
-----------------------------------------------------------------
chop chop
chr chr
getc getc
index index
lc lc
lcfirst lcfirst
length length
no Your::Module; no Your::Module;
ord ord
reverse reverse
rindex rindex
substr substr
uc uc
ucfirst ucfirst
use Your::Module; use Your::Module;
-----------------------------------------------------------------
Porting from script in JPerl4, and JPerl5
-----------------------------------------------------------------
original script in script with
JPerl4, JPerl5 mb.pm modulino
-----------------------------------------------------------------
chop mb::chop
index mb::index_byte
no Your::MBCS::Module; no mb::mode Your::MBCS::Module; *1
rindex mb::rindex_byte
use Your::MBCS::Module; use mb::mode Your::MBCS::Module; *1
-----------------------------------------------------------------
*1 mb::mode module comes later
Porting from script with utf8 pragma
-----------------------------------------------------------------
original script with script with
utf8 pragma mb.pm modulino
-----------------------------------------------------------------
chop mb::chop
chr mb::chr
getc mb::getc
index mb::index
lc ---
lcfirst ---
length mb::length
no Your::MBCS::Module; no mb::mode Your::MBCS::Module; *2
ord mb::ord
reverse mb::reverse
rindex mb::rindex
substr mb::substr
uc ---
ucfirst ---
use Your::MBCS::Module; use mb::mode Your::MBCS::Module; *2
-----------------------------------------------------------------
*2 mb::mode module comes later, and module must be without utf8 pragma.
What are DAMEMOJI
In single quote, DAMEMOJI are double-byte characters that include the
following metacharacters ('', q{}, <<'END', qw{}, m'', s''', split(''),
split(m''), and qr'')
------------------------------------------------------------------
hex character
------------------------------------------------------------------
5C [\] backslashed escapes
------------------------------------------------------------------
In double quote, DAMEMOJI are double-byte characters that include the
following metacharacters ("", qq{}, <<END, <<"END", ``, qx{}, <<`END`, //,
m//, ??, s///, split(//), split(m//), and qr//)
------------------------------------------------------------------
hex character
------------------------------------------------------------------
21 [!]
22 ["]
23 [#] regexp comment
24 [$] sigil of scalar variable
25 [%]
26 [&]
27 [']
28 [(] regexp group and capture
29 [)] regexp group and capture
2A [*] regexp matches zero or more times
2B [+] regexp matches one or more times
2C [,]
2D [-]
2E [.] regexp matches any octet
2F [/]
3A [:]
3B [;]
3C [<]
3D [=]
3E [>]
3F [?] regexp matches zero or one times
40 [@] sigil of array variable
5B [[] regexp bracketed character class
5C [\] backslashed escapes
5D []] regexp bracketed character class
5E [^] regexp true at beginning of string
60 [`] command execution
7B [{] regexp quantifier
7C [|] regexp alternation
7D [}] regexp quantifier
7E [~]
------------------------------------------------------------------
How to escape DAMEMOJI
ex. Japanese KATAKANA "SO" like [ `/ ] code is "\x83\x5C" in Sjis
see hex dump
-----------------------------------------
source script "`/" [83 5c]
-----------------------------------------
using mb.pm,
hex dump
-----------------------------------------
escaped script "`\/" [83 [5c] 5c]
-----------------------------------------
^--- escape by mb.pm
by the by see hex dump
-----------------------------------------
your eye's "`/\" [83 5c] [5c]
-----------------------------------------
perl eye's "`\/" [83] \[5c]
-----------------------------------------
hex dump
-----------------------------------------
in the perl "`/" [83] [5c]
-----------------------------------------
What converts to what by this software?
This software automatically converts MBCS literal strings in scripts to
octet-oriented strings(OO-quotee).
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
in your script script transpiled by this software
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
do "string"; mb::do "string";
mb::do { block } do { block }
eval "string"; mb::eval "string";
mb::eval { block } eval { block }
require mb::require
mb::require mb::require
chop chop
lc mb::lc
lcfirst mb::lcfirst
uc mb::uc
ucfirst mb::ucfirst
index index
rindex rindex
'MBCS-quotee' 'OO-quotee'
"MBCS-quotee" "OO-quotee"
`MBCS-quotee` `OO-quotee`
/MBCS-quotee/cgimosx m{\G${mb::_anchor}@{[mb::_ignorecase(qr/OO-quotee/mosx)]}@{[mb::_m_passed()]}}cg
/MBCS-quotee/cgmosx m{\G${mb::_anchor}@{[qr/OO-quotee/mosx ]}@{[mb::_m_passed()]}}cg
?MBCS-quotee?cgimosx m{\G${mb::_anchor}@{[mb::_ignorecase(qr?OO-quotee?mosx)]}@{[mb::_m_passed()]}}cg
?MBCS-quotee?cgmosx m{\G${mb::_anchor}@{[qr?OO-quotee?mosx ]}@{[mb::_m_passed()]}}cg
<MBCS-quotee> <OO-quotee>
q/MBCS-quotee/ q/OO-quotee/
qx'MBCS-quotee' qx'OO-quotee'
qw/MBCS-quotee/ qw/OO-quotee/
m'MBCS-quotee'cgimosx m{\G${mb::_anchor}@{[mb::_ignorecase(qr'OO-quotee'mosx)]}@{[mb::_m_passed()]}}cg
m'MBCS-quotee'cgmosx m{\G${mb::_anchor}@{[qr'OO-quotee'mosx ]}@{[mb::_m_passed()]}}cg
s'MBCS-regexp'MBCS-replacement'eegimosxr s{(\G${mb::_anchor})@{[mb::_ignorecase(qr'OO-regexp'mosx)]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q'OO-replacement'}egr
s'MBCS-regexp'MBCS-replacement'eegmosxr s{(\G${mb::_anchor})@{[qr'OO-regexp'mosx ]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q'OO-replacement'}egr
tr/MBCS-search/MBCS-replacement/cdsr s{[\x00-\xFF]*}{mb::tr($&,q/OO-search/,q/OO-replacement/,'cdsr')}er
tr/MBCS-search/MBCS-replacement/cds s{[\x00-\xFF]*}{mb::tr($&,q/OO-search/,q/OO-replacement/,'cdsr')}e
tr/MBCS-search/MBCS-replacement/ds s{[\x00-\xFF]*}{mb::tr($&,q/OO-search/,q/OO-replacement/,'dsr')}e
y/MBCS-search/MBCS-replacement/cdsr s{[\x00-\xFF]*}{mb::tr($&,q/OO-search/,q/OO-replacement/,'cdsr')}er
y/MBCS-search/MBCS-replacement/cds s{[\x00-\xFF]*}{mb::tr($&,q/OO-search/,q/OO-replacement/,'cdsr')}e
y/MBCS-search/MBCS-replacement/ds s{[\x00-\xFF]*}{mb::tr($&,q/OO-search/,q/OO-replacement/,'dsr')}e
qr'MBCS-quotee'cgimosx qr{\G${mb::_anchor}@{[mb::_ignorecase(qr'OO-quotee'mosx)]}@{[mb::_m_passed()]}}cg
qr'MBCS-quotee'cgmosx qr{\G${mb::_anchor}@{[qr'OO-quotee'mosx ]}@{[mb::_m_passed()]}}cg
split m'^' mb::_split qr{@{[qr'^'m ]}}
split m'MBCS-quotee'cgimosx mb::_split qr{@{[mb::_ignorecase(qr'OO-quotee'mosx)]}}cg
split m'MBCS-quotee'cgmosx mb::_split qr{@{[qr'OO-quotee'mosx ]}}cg
split qr'^' mb::_split qr{@{[qr'^'m ]}}
split qr'MBCS-quotee'cgimosx mb::_split qr{@{[mb::_ignorecase(qr'OO-quotee'mosx)]}}cg
split qr'MBCS-quotee'cgmosx mb::_split qr{@{[qr'OO-quotee'mosx ]}}cg
qq/MBCS-quotee/ qq/OO-quotee/
qq'MBCS-quotee' qq'OO-quotee'
qx/MBCS-quotee/ qx/OO-quotee/
m/MBCS-quotee/cgimosx m{\G${mb::_anchor}@{[mb::_ignorecase(qr/OO-quotee/mosx)]}@{[mb::_m_passed()]}}cg
m/MBCS-quotee/cgmosx m{\G${mb::_anchor}@{[qr/OO-quotee/mosx ]}@{[mb::_m_passed()]}}cg
s/MBCS-regexp/MBCS-replacement/eegimosxr s{(\G${mb::_anchor})@{[mb::_ignorecase(qr/OO-regexp/mosx)]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q/OO-replacement/}egr
s/MBCS-regexp/MBCS-replacement/eegmosxr s{(\G${mb::_anchor})@{[qr/OO-regexp/mosx ]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q/OO-replacement/}egr
qr/MBCS-quotee/cgimosx qr{\G${mb::_anchor}@{[mb::_ignorecase(qr/OO-quotee/mosx)]}@{[mb::_m_passed()]}}cg
qr/MBCS-quotee/cgmosx qr{\G${mb::_anchor}@{[qr/OO-quotee/mosx ]}@{[mb::_m_passed()]}}cg
split /^/ mb::_split qr{@{[qr/^/m ]}}
split /MBCS-quotee/cgimosx mb::_split qr{@{[mb::_ignorecase(qr/OO-quotee/mosx)]}}cg
split /MBCS-quotee/cgmosx mb::_split qr{@{[qr/OO-quotee/mosx ]}}cg
split m/^/ mb::_split qr{@{[qr/^/m ]}}
split m/MBCS-quotee/cgimosx mb::_split qr{@{[mb::_ignorecase(qr/OO-quotee/mosx)]}}cg
split m/MBCS-quotee/cgmosx mb::_split qr{@{[qr/OO-quotee/mosx ]}}cg
split qr/^/ mb::_split qr{@{[qr/^/m ]}}
split qr/MBCS-quotee/cgimosx mb::_split qr{@{[mb::_ignorecase(qr/OO-quotee/mosx)]}}cg
split qr/MBCS-quotee/cgmosx mb::_split qr{@{[qr/OO-quotee/mosx ]}}cg
m:MBCS-quotee:cgimosx m{\G${mb::_anchor}@{[mb::_ignorecase(qr`OO-quotee`mosx)]}@{[mb::_m_passed()]}}cg
m:MBCS-quotee:cgmosx m{\G${mb::_anchor}@{[qr`OO-quotee`mosx ]}@{[mb::_m_passed()]}}cg
s:MBCS-regexp:MBCS-replacement:eegimosxr s{(\G${mb::_anchor})@{[mb::_ignorecase(qr`OO-regexp`mosx)]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q:OO-replacement:}egr
s:MBCS-regexp:MBCS-replacement:eegmosxr s{(\G${mb::_anchor})@{[qr`OO-regexp`mosx ]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q:OO-replacement:}egr
qr:MBCS-quotee:cgimosx qr{\G${mb::_anchor}@{[mb::_ignorecase(qr`OO-quotee`mosx)]}@{[mb::_m_passed()]}}cg
qr:MBCS-quotee:cgmosx qr{\G${mb::_anchor}@{[qr`OO-quotee`mosx ]}@{[mb::_m_passed()]}}cg
split m:^: mb::_split qr{@{[qr`^`m ]}}
split m:MBCS-quotee:cgimosx mb::_split qr{@{[mb::_ignorecase(qr`OO-quotee`mosx)]}}cg
split m:MBCS-quotee:cgmosx mb::_split qr{@{[qr`OO-quotee`mosx ]}}cg
split qr:^: mb::_split qr{@{[qr`^`m ]}}
split qr:MBCS-quotee:cgimosx mb::_split qr{@{[mb::_ignorecase(qr`OO-quotee`mosx)]}}cg
split qr:MBCS-quotee:cgmosx mb::_split qr{@{[qr`OO-quotee`mosx ]}}cg
m@MBCS-quotee@cgimosx m{\G${mb::_anchor}@{[mb::_ignorecase(qr`OO-quotee`mosx)]}@{[mb::_m_passed()]}}cg
m@MBCS-quotee@cgmosx m{\G${mb::_anchor}@{[qr`OO-quotee`mosx ]}@{[mb::_m_passed()]}}cg
s@MBCS-regexp@MBCS-replacement@eegimosxr s{(\G${mb::_anchor})@{[mb::_ignorecase(qr`OO-regexp`mosx)]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q@OO-replacement@}egr
s@MBCS-regexp@MBCS-replacement@eegmosxr s{(\G${mb::_anchor})@{[qr`OO-regexp`mosx ]}@{[mb::_s_passed()]}}{$1 . mb::eval mb::eval q@OO-replacement@}egr
qr@MBCS-quotee@cgimosx qr{\G${mb::_anchor}@{[mb::_ignorecase(qr`OO-quotee`mosx)]}@{[mb::_m_passed()]}}cg
qr@MBCS-quotee@cgmosx qr{\G${mb::_anchor}@{[qr`OO-quotee`mosx ]}@{[mb::_m_passed()]}}cg
split m@^@ mb::_split qr{@{[qr`^`m ]}}
split m@MBCS-quotee@cgimosx mb::_split qr{@{[mb::_ignorecase(qr`OO-quotee`mosx)]}}cg
split m@MBCS-quotee@cgmosx mb::_split qr{@{[qr`OO-quotee`mosx ]}}cg
split qr@^@ mb::_split qr{@{[qr`^`m ]}}
split qr@MBCS-quotee@cgimosx mb::_split qr{@{[mb::_ignorecase(qr`OO-quotee`mosx)]}}cg
split qr@MBCS-quotee@cgmosx mb::_split qr{@{[qr`OO-quotee`mosx ]}}cg
$` mb::_PREMATCH()
${`} mb::_PREMATCH()
$PREMATCH mb::_PREMATCH()
${PREMATCH} mb::_PREMATCH()
${^PREMATCH} mb::_PREMATCH()
$& mb::_MATCH()
${&} mb::_MATCH()
$MATCH mb::_MATCH()
${MATCH} mb::_MATCH()
${^MATCH} mb::_MATCH()
$1 mb::_CAPTURE(1)
$2 mb::_CAPTURE(2)
$3 mb::_CAPTURE(3)
@{^CAPTURE} mb::_CAPTURE()
${^CAPTURE}[0] mb::_CAPTURE(0+1)
${^CAPTURE}[1] mb::_CAPTURE(1+1)
${^CAPTURE}[2] mb::_CAPTURE(2+1)
@- mb::_LAST_MATCH_START()
@LAST_MATCH_START mb::_LAST_MATCH_START()
@{LAST_MATCH_START} mb::_LAST_MATCH_START()
@{^LAST_MATCH_START} mb::_LAST_MATCH_START()
$-[1] mb::_LAST_MATCH_START(1)
$LAST_MATCH_START[1] mb::_LAST_MATCH_START(1)
${LAST_MATCH_START}[1] mb::_LAST_MATCH_START(1)
${^LAST_MATCH_START}[1] mb::_LAST_MATCH_START(1)
@+ mb::_LAST_MATCH_END()
@LAST_MATCH_END mb::_LAST_MATCH_END()
@{LAST_MATCH_END} mb::_LAST_MATCH_END()
@{^LAST_MATCH_END} mb::_LAST_MATCH_END()
$+[1] mb::_LAST_MATCH_END(1)
$LAST_MATCH_END[1] mb::_LAST_MATCH_END(1)
${LAST_MATCH_END}[1] mb::_LAST_MATCH_END(1)
${^LAST_MATCH_END}[1] mb::_LAST_MATCH_END(1)
"$`" "@{[mb::_PREMATCH()]}"
"${`}" "@{[mb::_PREMATCH()]}"
"$PREMATCH" "@{[mb::_PREMATCH()]}"
"${PREMATCH}" "@{[mb::_PREMATCH()]}"
"${^PREMATCH}" "@{[mb::_PREMATCH()]}"
"$&" "@{[mb::_MATCH()]}"
"${&}" "@{[mb::_MATCH()]}"
"$MATCH" "@{[mb::_MATCH()]}"
"${MATCH}" "@{[mb::_MATCH()]}"
"${^MATCH}" "@{[mb::_MATCH()]}"
"$1" "@{[mb::_CAPTURE(1)]}"
"$2" "@{[mb::_CAPTURE(2)]}"
"$3" "@{[mb::_CAPTURE(3)]}"
"@{^CAPTURE}" "@{[join $", mb::_CAPTURE()]}"
"${^CAPTURE}[0]" "@{[mb::_CAPTURE(0)]}"
"${^CAPTURE}[1]" "@{[mb::_CAPTURE(1)]}"
"${^CAPTURE}[2]" "@{[mb::_CAPTURE(2)]}"
"@-" "@{[mb::_LAST_MATCH_START()]}"
"@LAST_MATCH_START" "@{[mb::_LAST_MATCH_START()]}"
"@{LAST_MATCH_START}" "@{[mb::_LAST_MATCH_START()]}"
"@{^LAST_MATCH_START}" "@{[mb::_LAST_MATCH_START()]}"
"$-[1]" "@{[mb::_LAST_MATCH_START(1)]}"
"$LAST_MATCH_START[1]" "@{[mb::_LAST_MATCH_START(1)]}"
"${LAST_MATCH_START}[1]" "@{[mb::_LAST_MATCH_START(1)]}"
"${^LAST_MATCH_START}[1]" "@{[mb::_LAST_MATCH_START(1)]}"
"@+" "@{[mb::_LAST_MATCH_END()]}"
"@LAST_MATCH_END" "@{[mb::_LAST_MATCH_END()]}"
"@{LAST_MATCH_END}" "@{[mb::_LAST_MATCH_END()]}"
"@{^LAST_MATCH_END}" "@{[mb::_LAST_MATCH_END()]}"
"$+[1]" "@{[mb::_LAST_MATCH_END(1)]}"
"$LAST_MATCH_END[1]" "@{[mb::_LAST_MATCH_END(1)]}"
"${LAST_MATCH_END}[1]" "@{[mb::_LAST_MATCH_END(1)]}"
"${^LAST_MATCH_END}[1]" "@{[mb::_LAST_MATCH_END(1)]}"
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
The following conversions are for Microsoft Windows, but will always be
converted so that the converted script will work on any system.
-----------------------------------------------------------------------------
in your script script transpiled by this software
-----------------------------------------------------------------------------
chdir mb::_chdir
lstat mb::_lstat
opendir mb::_opendir
stat mb::_stat
unlink mb::_unlink
filetest -B mb::_B
filetest -C mb::_C
filetest -M mb::_M
filetest -T mb::_T
filetest -d mb::_d
filetest -e mb::_e
filetest -f mb::_f
filetest -r mb::_r
filetest -s mb::_s
filetest -w mb::_w
filetest -x mb::_x
filetest -z mb::_z
-----------------------------------------------------------------------------
Each elements in strings or regular expressions that are double-quote like are
converted as follows.
---------------------------------------------------------------------------------------------
in your script script transpiled by this software
---------------------------------------------------------------------------------------------
\L\u MBCS-quotee \E\E \L\u OO-quotee \E\E
\U\l MBCS-quotee \E\E \U\l OO-quotee \E\E
\L MBCS-quotee \E \L OO-quotee \E
\U MBCS-quotee \E \U OO-quotee \E
\l MBCS-quotee \E \l OO-quotee \E
\u MBCS-quotee \E \u OO-quotee \E
\Q MBCS-quotee \E \Q OO-quotee \E
---------------------------------------------------------------------------------------------
Each elements in regular expressions are converted as follows.
----------------------------------------------------------------------------------------------------------------------
in your script script transpiled by this software (on sjis encoding)
----------------------------------------------------------------------------------------------------------------------
qr'.' qr{\G${mb::_anchor}@{[qr'(?:(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|.)' ]}@{[mb::_m_passed()]}}
qr'\B' qr{\G${mb::_anchor}@{[qr'(?:(?<![ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_])(?![ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_])|(?<=[ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_])(?=[ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_]))' ]}@{[mb::_m_passed()]}}
qr'\D' qr{\G${mb::_anchor}@{[qr'(?:(?![0123456789])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'\H' qr{\G${mb::_anchor}@{[qr'(?:(?![\x09\x20])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'\N' qr{\G${mb::_anchor}@{[qr'(?:(?!\n)(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'\R' qr{\G${mb::_anchor}@{[qr'(?>\r\n|[\x0A\x0B\x0C\x0D])' ]}@{[mb::_m_passed()]}}
qr'\S' qr{\G${mb::_anchor}@{[qr'(?:(?![\t\n\f\r\x20])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'\V' qr{\G${mb::_anchor}@{[qr'(?:(?![\x0A\x0B\x0C\x0D])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'\W' qr{\G${mb::_anchor}@{[qr'(?:(?![ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'\b' qr{\G${mb::_anchor}@{[qr'(?:(?<![ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_])(?=[ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_])|(?<=[ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_])(?![ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_]))' ]}@{[mb::_m_passed()]}}
qr'\d' qr{\G${mb::_anchor}@{[qr'[0123456789]' ]}@{[mb::_m_passed()]}}
qr'\h' qr{\G${mb::_anchor}@{[qr'[\x09\x20]' ]}@{[mb::_m_passed()]}}
qr'\s' qr{\G${mb::_anchor}@{[qr'[\t\n\f\r\x20]' ]}@{[mb::_m_passed()]}}
qr'\v' qr{\G${mb::_anchor}@{[qr'[\x0A\x0B\x0C\x0D]' ]}@{[mb::_m_passed()]}}
qr'\w' qr{\G${mb::_anchor}@{[qr'[ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789_]' ]}@{[mb::_m_passed()]}}
qr'[\b]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x08])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:alnum:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x30-\x39\x41-\x5A\x61-\x7A])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:alpha:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x41-\x5A\x61-\x7A])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:ascii:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x00-\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:blank:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x09\x20])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:cntrl:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x00-\x1F\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:digit:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x30-\x39])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:graph:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x21-\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:lower:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[abcdefghijklmnopqrstuvwxyz])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:print:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x20-\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:punct:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x21-\x2F\x3A-\x3F\x40\x5B-\x5F\x60\x7B-\x7E])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:space:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\s\x0B])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:upper:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[ABCDEFGHIJKLMNOPQRSTUVWXYZ])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:word:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x30-\x39\x41-\x5A\x5F\x61-\x7A])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:xdigit:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=[\x30-\x39\x41-\x46\x61-\x66])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^alnum:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x30-\x39\x41-\x5A\x61-\x7A])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^alpha:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x41-\x5A\x61-\x7A])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^ascii:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x00-\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^blank:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x09\x20])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^cntrl:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x00-\x1F\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^digit:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x30-\x39])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^graph:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x21-\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^lower:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![abcdefghijklmnopqrstuvwxyz])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^print:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x20-\x7F])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^punct:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x21-\x2F\x3A-\x3F\x40\x5B-\x5F\x60\x7B-\x7E])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^space:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\s\x0B])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^upper:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![ABCDEFGHIJKLMNOPQRSTUVWXYZ])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^word:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x30-\x39\x41-\x5A\x5F\x61-\x7A])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr'[[:^xdigit:]]' qr{\G${mb::_anchor}@{[qr'(?:(?=(?:(?![\x30-\x39\x41-\x46\x61-\x66])(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F]))))(?^:(?>(?>[\x81-\x9F\xE0-\xFC][\x00-\xFF]|[\x80-\xFF])|[\x00-\x7F])))' ]}@{[mb::_m_passed()]}}
qr/./ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_dot})/ ]}@{[mb::_m_passed()]}}
qr/\B/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_B})/ ]}@{[mb::_m_passed()]}}
qr/\D/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_D})/ ]}@{[mb::_m_passed()]}}
qr/\H/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_H})/ ]}@{[mb::_m_passed()]}}
qr/\N/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_N})/ ]}@{[mb::_m_passed()]}}
qr/\R/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_R})/ ]}@{[mb::_m_passed()]}}
qr/\S/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_S})/ ]}@{[mb::_m_passed()]}}
qr/\V/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_V})/ ]}@{[mb::_m_passed()]}}
qr/\W/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_W})/ ]}@{[mb::_m_passed()]}}
qr/\b/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_b})/ ]}@{[mb::_m_passed()]}}
qr/\d/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_d})/ ]}@{[mb::_m_passed()]}}
qr/\h/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_h})/ ]}@{[mb::_m_passed()]}}
qr/\s/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_s})/ ]}@{[mb::_m_passed()]}}
qr/\v/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_v})/ ]}@{[mb::_m_passed()]}}
qr/\w/ qr{\G${mb::_anchor}@{[qr/(?:@{mb::_w})/ ]}@{[mb::_m_passed()]}}
qr/[\b]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[\\b])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:alnum:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:alnum:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:alpha:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:alpha:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:ascii:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:ascii:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:blank:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:blank:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:cntrl:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:cntrl:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:digit:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:digit:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:graph:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:graph:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:lower:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:lower:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:print:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:print:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:punct:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:punct:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:space:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:space:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:upper:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:upper:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:word:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:word:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:xdigit:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:xdigit:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^alnum:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^alnum:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^alpha:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^alpha:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^ascii:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^ascii:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^blank:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^blank:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^cntrl:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^cntrl:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^digit:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^digit:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^graph:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^graph:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^lower:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^lower:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^print:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^print:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^punct:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^punct:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^space:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^space:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^upper:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^upper:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^word:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^word:]])]}/ ]}@{[mb::_m_passed()]}}
qr/[[:^xdigit:]]/ qr{\G${mb::_anchor}@{[qr/@{[mb::_cc(qq[[:^xdigit:]])]}/ ]}@{[mb::_m_passed()]}}
----------------------------------------------------------------------------------------------------------------------
DEPENDENCIES
This software requires perl5.00503 or later.
BUGS, LIMITATIONS, and COMPATIBILITY
I have tested and verified this software using the best of my ability. However, a software containing much regular expression is bound to contain some bugs. Thus, if you happen to find a bug that's in this software and not your own program, you can try to reduce it to a minimal test case and then report it to the following author's address. If you have an idea that could make this a more useful tool, please let everyone share it.
Special Variables $` and $& need /( Capture All )/
Because $` and $& use $1. ---------------------------------------------------------------------------------------------------------------------- in your script after m//, works as after s///, works as ---------------------------------------------------------------------------------------------------------------------- $` CORE::substr($&, 0, -CORE::length($1)) $1 ${`} CORE::substr($&, 0, -CORE::length($1)) $1 $PREMATCH CORE::substr($&, 0, -CORE::length($1)) $1 ${^PREMATCH} CORE::substr($&, 0, -CORE::length($1)) $1 $& $1 CORE::substr($&, CORE::length($1)) ${&} $1 CORE::substr($&, CORE::length($1)) $MATCH $1 CORE::substr($&, CORE::length($1)) ${^MATCH} $1 CORE::substr($&, CORE::length($1)) ----------------------------------------------------------------------------------------------------------------------
return value from tr///s
tr/// (or y///) operator with /s modifier returns 1 always. If you need right number, you can use mb::tr().
chdir
Function chdir() cannot work if path is ended by chr(0x5C).
see also, Bug #81839 chdir does not work with chr(0x5C) at end of path http://bugs.activestate.com/show_bug.cgi?id=81839
mb::substr as Lvalue
If Perl version is older than 5.14, mb::substr differs from CORE::substr, and cannot be used as a lvalue. To change part of a string, you need use the optional fourth argument which is the replacement string.
mb::substr($string, 13, 4, "JPerl");
Limitation of Regular Expression
This software has limitation from \G in multibyte anchoring. Only Perl 5.30.0 or later can treat the codepoint string which exceeds 65534 octets with a regular expression, and only Perl 5.10.1 or later can 32766 octets.
see also, The upper limit "n" specifiable in a regular expression quantifier of the form "{m,n}" has been doubled to 65534 https://metacpan.org/pod/release/XSAWYERX/perl-5.30.0/pod/perldelta.pod#The-upper-limit-%22n%22-specifiable-in-a-regular-expression-quantifier-of-the-form-%22%7Bm,n%7D%22-has-been-doubled-to-65534 In 5.10.0, the * quantifier in patterns was sometimes treated as {0,32767} http://perldoc.perl.org/perl5101delta.html [perl #116379] \G can't treat over 32767 octet http://www.nntp.perl.org/group/perl.perl5.porters/2013/01/msg197320.html perlre - Perl regular expressions http://perldoc.perl.org/perlre.html perlre length limit http://stackoverflow.com/questions/4592467/perlre-length-limit
fc(), lc(), lcfirst(), uc(), and ucfirst()
fc() not supported. lc(), lcfirst(), uc(), and ucfirst() support US-ASCII only.
character ranges by hyphen
Character ranges by hyphen of regular expression supports US-ASCII only. And tr///, y/// doesn't support ranges by hyphen.
cloister of regular expression
The cloister (?s) and (?i) of a regular expression will not be implemented for the time being. Cloister (?s) can be substituted with the .(dot) and \N on /s modifier.
Empty Variable in Regular Expression
Unlike literal null string, an interpolated variable evaluated to the empty string can't use the most recent pattern from a previous successful regular expression.
Limitation of ?? and m??
Multibyte character needs ( ) which is before {n,m}, {n,}, {n}, *, and + in ?? or m??. As a result, you need to rewrite a script about $1,$2,$3,... You cannot use (?: ), ?, {n,m}?, {n,}?, and {n}? in ?? and m??, because delimiter of m?? is '?'.
Look-behind Assertion
The look-behind assertion like (?<=[A-Z]) is not prevented from matching trail octet of the previous MBCS codepoint.
Modifier /a /d /l and /u of Regular Expression
The concept of this software is not to use two or more encoding methods as literal string and literal of regexp in one Perl script. Therefore, modifier /a, /d, /l, and /u are not supported. \d means [0-9] universally.
Named Codepoint
A named codepoint, such \N{GREEK SMALL LETTER EPSILON}, \N{greek:epsilon}, or \N{epsilon} is not supported.
Unicode Properties (aka Codepoint Properties) of Regular Expression
Unicode properties (aka codepoint properties) of regexp are not available. Also (?[]) in regexp of Perl 5.18 is not available. There is no plans to currently support these.
${^WIN32_SLOPPY_STAT} is ignored
Even if ${^WIN32_SLOPPY_STAT} is set to a true value, file test functions mb::*(), mb::lstat(), and mb::stat() on Microsoft Windows open the file for the path which has chr(0x5c) at end.
Delimiter of String and Regexp
qq//, q//, qw//, qx//, qr//, m//, s///, tr///, and y/// can't use a wide codepoint as the delimiter.
\b{...} Boundaries in Regular Expressions
Following \b{...} available starting in v5.22 are not supported.
\b{gcb} or \b{g} Unicode "Grapheme Cluster Boundary" \b{sb} Unicode "Sentence Boundary" \b{wb} Unicode "Word Boundary" \B{gcb} or \B{g} Unicode "Grapheme Cluster Boundary" doesn't match \B{sb} Unicode "Sentence Boundary" doesn't match \B{wb} Unicode "Word Boundary" doesn't match
format
Function "format" can't handle MBCS codepoints unlike JPerl.
Mac OS 9
Apple Inc. Mac OS 9 not supported. sorry about it.
UTF8 Flag Considered Harmful, and Our Goal
P.401 See chapter 15: Unicode of ISBN 0-596-00027-8 Programming Perl Third Edition.
Before the introduction of Unicode support in perl, The eq operator just compared the byte-strings represented by two scalars. Beginning with perl 5.8, eq compares two byte-strings with simultaneous consideration of the UTF8 flag.
-- we have been taught so for a long time.
Perl is a powerful language for everyone, but UTF8 flag is a barrier for common beginners. Because everyone can only one task on one time. So calling Encode::encode() and Encode::decode() in application program is not better way. Making two scripts for information processing and encoding conversion may be better. Please trust me.
/*
* You are not expected to understand this.
*/
Information processing model beginning with perl 5.8
+----------------------+---------------------+
| Text strings | |
+----------+-----------| Binary strings |
| UTF-8 | Latin-1 | |
+----------+-----------+---------------------+
| UTF8 | Not UTF8 |
| Flagged | Flagged |
+--------------------------------------------+
http://perl-users.jp/articles/advent-calendar/2010/casual/4
Confusion of Perl string model is made from double meanings of
"Binary string."
Meanings of "Binary string" are
1. Non-Text string
2. Digital octet string
Let's draw again using those term.
+----------------------+---------------------+
| Text strings | |
+----------+-----------| Non-Text strings |
| UTF-8 | Latin-1 | |
+----------+-----------+---------------------+
| UTF8 | Not UTF8 |
| Flagged | Flagged |
+--------------------------------------------+
| Digital octet string |
+--------------------------------------------+
There are people who don't agree to change in the character string processing model at Perl 5.8. It is impossible to get agreement it from majority of Perl programmers who are not heavy users. How to solve it by returning to an original Perl, let's read page 402 of the Programming Perl, 3rd edition, again.
Information processing model beginning with perl3 or this software
of UNIX/C-ism.
+--------------------------------------------+
| Text string as Digital octet string |
| Digital octet string as Text string |
+--------------------------------------------+
| Not UTF8 Flagged, No MOJIBAKE |
+--------------------------------------------+
In UNIX Everything is a File
- In UNIX everything is a stream of bytes
- In UNIX the filesystem is used as a universal name space
Native Encoding Scripting
- native encoding of file contents
- native encoding of file name on filesystem
- native encoding of command line
- native encoding of environment variable
- native encoding of API
- native encoding of network packet
- native encoding of database
Ideally, We'd like to achieve these five Goals:
Goal #1:
Old byte-oriented programs should not spontaneously break on the old byte-oriented data they used to work on.
This software attempts to achieve this goal by embedded functions work as traditional and stably.
Goal #2:
Old byte-oriented programs should magically start working on the new character-oriented data when appropriate.
This software is not a magician, so cannot see your mind and run it.
You must decide and write octet semantics or codepoint semantics yourself in case by case.
figure of Goal #1 and Goal #2.
Goal #1 Goal #2 (a) (b) (c) (d) (e) +--------------+-------+-------+-------+-------+-------+ | data | Old | Old | New | Old | New | +--------------+-------+-------+-------+-------+-------+ | script | Old | Old | New | +--------------+-------+---------------+---------------+ | interpreter | Old | New | +--------------+-------+-------------------------------+ Old --- Old byte-oriented New --- New character-oriented
There is a combination from (a) to (e) in data, script, and interpreter of old and new. Let's add JPerl, utf8 pragma, and this software.
(a) (b) (c) (d) (e) JPerl,mb utf8 +--------------+-------+-------+-------+-------+-------+ | data | Old | Old | New | Old | New | +--------------+-------+-------+-------+-------+-------+ | script | Old | Old | New | +--------------+-------+---------------+---------------+ | interpreter | Old | New | +--------------+-------+-------------------------------+ Old --- Old byte-oriented New --- New character-oriented
The reason why JPerl is very excellent is that it is at the position of (c). That is, it is almost not necessary to write a special code to process new codepoint oriented script.
Goal #3:
Programs should run just as fast in the new character-oriented mode as in the old byte-oriented mode.
It is impossible. Because the following time is necessary.
(1) Time of escape script for old byte-oriented perl.
(2) Time of processing regular expression by escaped script while multibyte anchoring.
Goal #4:
Perl should remain one language, rather than forking into a byte-oriented Perl and a character-oriented Perl.
JPerl remains one Perl "language" by forking to two "interpreters." However, the Perl core team did not desire fork of the "interpreter." As a result, Perl "language" forked contrary to goal #4.
A codepoint oriented perl is not necessary to make it specially, because a byte-oriented perl can already treat the binary data. This software is only an application program of byte-oriented Perl, a filter program.
And you will get support from the Perl community, when you solve the problem by the Perl script.
mb.pm modulino keeps one "language" and one "interpreter."
Goal #5:
mb.pm users will be able to maintain mb.pm by Perl.
May the mb.pm be with you, always.
Back when Programming Perl, 3rd ed. was written, UTF8 flag was not born and Perl is designed to make the easy jobs easy. This software provides programming environment like at that time.
Perl's motto
Some computer scientists (the reductionists, in particular) would
like to deny it, but people have funny-shaped minds. Mental geography
is not linear, and cannot be mapped onto a flat surface without
severe distortion. But for the last score years or so, computer
reductionists have been first bowing down at the Temple of Orthogonality,
then rising up to preach their ideas of ascetic rectitude to any who
would listen.
Their fervent but misguided desire was simply to squash your mind to
fit their mindset, to smush your patterns of thought into some sort of
Hyperdimensional Flatland. It's a joyless existence, being smushed.
--- Learning Perl on Win32 Systems
If you think this is a big headache, you're right. No one likes
this situation, but Perl does the best it can with the input and
encodings it has to deal with. If only we could reset history and
not make so many mistakes next time.
--- Learning Perl 6th Edition
The most important thing for most people to know about handling
Unicode data in Perl, however, is that if you don't ever use any Uni-
code data -- if none of your files are marked as UTF-8 and you don't
use UTF-8 locales -- then you can happily pretend that you're back in
Perl 5.005_03 land; the Unicode features will in no way interfere with
your code unless you're explicitly using them. Sometimes the twin
goals of embracing Unicode but not disturbing old-style byte-oriented
scripts has led to compromise and confusion, but it's the Perl way to
silently do the right thing, which is what Perl ends up doing.
--- Advanced Perl Programming, 2nd Edition
AUTHOR
INABA Hitoshi <ina@cpan.org>
This project was originated by INABA Hitoshi.
LICENSE AND COPYRIGHT
This software is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See perlartistic.
This software is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
SEE ALSO
perlunicode, Encode, open, utf8, bytes, Arabic, Big5, Big5HKSCS, CP932::R2,
CP932IBM::R2, CP932NEC::R2, CP932X::R2, Char::Arabic, Char::Big5HKSCS,
Char::Big5Plus, Char::Cyrillic, Char::EUCJP, Char::EUCTW, Char::GB18030,
Char::GBK, Char::Greek, Char::HP15, Char::Hebrew, Char::INFORMIXV6ALS,
Char::JIS8, Char::KOI8R, Char::KOI8U, Char::KPS9566, Char::Latin1,
Char::Latin10, Char::Latin2, Char::Latin3, Char::Latin4, Char::Latin5,
Char::Latin6, Char::Latin7, Char::Latin8, Char::Latin9, Char::OldUTF8,
Char::Sjis, Char::TIS620, Char::UHC, Char::USASCII, Char::UTF2,
Char::Windows1252, Char::Windows1258, Cyrillic, GBK, Greek, IOas::CP932,
IOas::CP932IBM, IOas::CP932NEC, IOas::CP932X, IOas::SJIS2004, Jacode,
Jacode4e, Jacode4e::RoundTrip, KOI8R, KOI8U, KPS9566, KSC5601, Latin1,
Latin10, Latin2, Latin3, Latin4, Latin5, Latin6, Latin7, Latin8, Latin9,
Modern::Open, SJIS2004::R2, Sjis, UTF2, UTF8::R2, Windows1250, Windows1252,
Windows1254, Windows1257, Windows1258.
Announcing Perl 7
Jun 24, 2020 by brian d foy
https://www.perl.com/article/announcing-perl-7/
PERL PUROGURAMINGU
Larry Wall, Randal L.Schwartz, Yoshiyuki Kondo
December 1997
ISBN 4-89052-384-7
http://www.context.co.jp/~cond/books/old-books.html
Programming Perl, Second Edition
By Larry Wall, Tom Christiansen, Randal L. Schwartz
October 1996
Pages: 670
ISBN 10: 1-56592-149-6 | ISBN 13: 9781565921498
http://shop.oreilly.com/product/9781565921498.do
Programming Perl, Third Edition
By Larry Wall, Tom Christiansen, Jon Orwant
Third Edition July 2000
Pages: 1104
ISBN 10: 0-596-00027-8 | ISBN 13: 9780596000271
http://shop.oreilly.com/product/9780596000271.do
The Perl Language Reference Manual (for Perl version 5.12.1)
by Larry Wall and others
Paperback (6"x9"), 724 pages
Retail Price: $39.95 (pound 29.95 in UK)
ISBN-13: 978-1-906966-02-7
https://dl.acm.org/doi/book/10.5555/1893028
Perl Pocket Reference, 5th Edition
By Johan Vromans
Publisher: O'Reilly Media
Released: July 2011
Pages: 102
http://shop.oreilly.com/product/0636920018476.do
Programming Perl, 4th Edition
By: Tom Christiansen, brian d foy, Larry Wall, Jon Orwant
Publisher: O'Reilly Media
Formats: Print, Ebook, Safari Books Online
Released: March 2012
Pages: 1130
Print ISBN: 978-0-596-00492-7 | ISBN 10: 0-596-00492-3
Ebook ISBN: 978-1-4493-9890-3 | ISBN 10: 1-4493-9890-1
http://shop.oreilly.com/product/9780596004927.do
Perl Cookbook
By Tom Christiansen, Nathan Torkington
August 1998
Pages: 800
ISBN 10: 1-56592-243-3 | ISBN 13: 978-1-56592-243-3
http://shop.oreilly.com/product/9781565922433.do
Perl Cookbook, Second Edition
By Tom Christiansen, Nathan Torkington
Second Edition August 2003
Pages: 964
ISBN 10: 0-596-00313-7 | ISBN 13: 9780596003135
http://shop.oreilly.com/product/9780596003135.do
Perl in a Nutshell, Second Edition
By Stephen Spainhour, Ellen Siever, Nathan Patwardhan
Second Edition June 2002
Pages: 760
Series: In a Nutshell
ISBN 10: 0-596-00241-6 | ISBN 13: 9780596002411
http://shop.oreilly.com/product/9780596002411.do
Learning Perl on Win32 Systems
By Randal L. Schwartz, Erik Olson, Tom Christiansen
August 1997
Pages: 306
ISBN 10: 1-56592-324-3 | ISBN 13: 9781565923249
http://shop.oreilly.com/product/9781565923249.do
Learning Perl, Fifth Edition
By Randal L. Schwartz, Tom Phoenix, brian d foy
June 2008
Pages: 352
Print ISBN:978-0-596-52010-6 | ISBN 10: 0-596-52010-7
Ebook ISBN:978-0-596-10316-3 | ISBN 10: 0-596-10316-6
http://shop.oreilly.com/product/9780596520113.do
Learning Perl, 6th Edition
By Randal L. Schwartz, brian d foy, Tom Phoenix
June 2011
Pages: 390
ISBN-10: 1449303587 | ISBN-13: 978-1449303587
http://shop.oreilly.com/product/0636920018452.do
Advanced Perl Programming, 2nd Edition
By Simon Cozens
June 2005
Pages: 300
ISBN-10: 0-596-00456-7 | ISBN-13: 978-0-596-00456-9
http://shop.oreilly.com/product/9780596004569.do
Perl RESOURCE KIT UNIX EDITION
Futato, Irving, Jepson, Patwardhan, Siever
ISBN 10: 1-56592-370-7
http://shop.oreilly.com/product/9781565923706.do
Perl Resource Kit -- Win32 Edition
Erik Olson, Brian Jepson, David Futato, Dick Hardt
ISBN 10:1-56592-409-6
http://shop.oreilly.com/product/9781565924093.do
MODAN Perl NYUMON
By Daisuke Maki
2009/2/10
Pages: 344
ISBN 10: 4798119172 | ISBN 13: 978-4798119175
https://www.seshop.com/product/detail/10250
Understanding Japanese Information Processing
By Ken Lunde
January 1900
Pages: 470
ISBN 10: 1-56592-043-0 | ISBN 13: 9781565920439
http://shop.oreilly.com/product/9781565920439.do
CJKV Information Processing
Chinese, Japanese, Korean & Vietnamese Computing
By Ken Lunde
First Edition January 1999
Pages: 1128
ISBN 10: 1-56592-224-7 | ISBN 13: 9781565922242
http://shop.oreilly.com/product/9781565922242.do
DB2 GIJUTSU ZENSHO
By BM Japan Systems Engineering Co.,Ltd. and IBM Japan, Ltd.
2004/05
Pages: 887
ISBN-10: 4756144659 | ISBN-13: 978-4756144652
https://iss.ndl.go.jp/books/R100000002-I000007400836-00
Mastering Regular Expressions, Second Edition
By Jeffrey E. F. Friedl
Second Edition July 2002
Pages: 484
ISBN 10: 0-596-00289-0 | ISBN 13: 9780596002893
http://shop.oreilly.com/product/9780596002893.do
Mastering Regular Expressions, Third Edition
By Jeffrey E. F. Friedl
Third Edition August 2006
Pages: 542
ISBN 10: 0-596-52812-4 | ISBN 13:9780596528126
http://shop.oreilly.com/product/9780596528126.do
Regular Expressions Cookbook
By Jan Goyvaerts, Steven Levithan
May 2009
Pages: 512
ISBN 10:0-596-52068-9 | ISBN 13: 978-0-596-52068-7
http://shop.oreilly.com/product/9780596520694.do
Regular Expressions Cookbook, 2nd Edition
By Steven Levithan, Jan Goyvaerts
Released August 2012
Pages: 612
ISBN: 9781449327453
https://www.oreilly.com/library/view/regular-expressions-cookbook/9781449327453/
JIS KANJI JITEN
By Kouji Shibano
Pages: 1456
ISBN 4-542-20129-5
https://www.e-hon.ne.jp/bec/SA/Detail?refISBN=4542201295
UNIX MAGAZINE
1993 Aug
Pages: 172
T1008901080816 ZASSHI 08901-8
LINUX NIHONGO KANKYO
By YAMAGATA Hiroo, Stephen J. Turnbull, Craig Oda, Robert J. Bickel
June, 2000
Pages: 376
ISBN 4-87311-016-5
https://www.oreilly.co.jp/books/4873110165/
Windows NT Shell Scripting
By Timothy Hill
April 27, 1998
Pages: 400
ISBN 10: 1578700477 | ISBN 13: 9781578700479
https://www.abebooks.com/9781578700479/Windows-NT-Scripting-Circle-Hill-1578700477/plp
Windows(R) Command-Line Administrators Pocket Consultant, 2nd Edition
By William R. Stanek
February 2009
Pages: 594
ISBN 10: 0-7356-2262-0 | ISBN 13: 978-0-7356-2262-3
https://www.abebooks.com/9780735622623/Windows-Command-Line-Administrators-Pocket-Consultant-0735622620/plp
Kaoru Maeda, Perl's history Perl 1,2,3,4
https://www.slideshare.net/KaoruMaeda/perl-perl-1234
nurse, What is "string"
https://naruse.hateblo.jp/entries/2014/11/07#1415355181
NISHIO Hirokazu, What's meant "string as a sequence of characters"?
https://nishiohirokazu.hatenadiary.org/entry/20141107/1415286729
nurse, History of Japanese EUC 22:00
https://naruse.hateblo.jp/entries/2009/03/08
Mike Whitaker, Perl And Unicode
https://www.slideshare.net/Penfold/perl-and-unicode
About Windows and Japanese text
https://blogs.windows.com/japan/2020/02/20/about-windows-and-japanese-text/
About Windows diagnostic data
https://blogs.windows.com/japan/2019/12/05/about-windows-diagnostic-data/
Ricardo Signes, Perl 5.14 for Pragmatists
https://www.slideshare.net/rjbs/perl-514-8809465
Ricardo Signes, What's New in Perl? v5.10 - v5.16 #'
https://www.slideshare.net/rjbs/whats-new-in-perl-v510-v516
YAP(achimon)C::Asia Hachioji 2016 mid in Shinagawa
Kenichi Ishigaki (@charsbar) July 3, 2016 YAP(achimon)C::Asia Hachioji 2016mid
https://www.slideshare.net/charsbar/cpan-63708689
Causes and countermeasures for garbled Japanese characters in perl
https://prozorec.hatenablog.com/entry/2018/03/19/080000
Perl regular expression bug?
http://moriyoshi.hatenablog.com/entry/20090315/1237103809
http://moriyoshi.hatenablog.com/entry/20090320/1237562075
About Windows and Japanese text
https://blogs.windows.com/japan/2020/02/20/about-windows-and-japanese-text/
About Windows diagnostic data
https://blogs.windows.com/japan/2019/12/05/about-windows-diagnostic-data/
CPAN Directory INABA Hitoshi
https://metacpan.org/author/INA
http://backpan.cpantesters.org/authors/id/I/IN/INA/
https://metacpan.org/release/Jacode4e-RoundTrip
https://metacpan.org/release/Jacode4e
https://metacpan.org/release/Jacode
Recent Perl packages by "INABA Hitoshi"
http://code.activestate.com/ppm/author:INABA-Hitoshi/
ACKNOWLEDGEMENTS
This software was made referring to software and the document that the following hackers or persons had made. I am thankful to all persons.
Rick Yamashita, Shift_JIS
https://shino.tumblr.com/post/116166805/%E5%B1%B1%E4%B8%8B%E8%89%AF%E8%94%B5%E3%81%A8%E7%94%B3%E3%81%97%E3%81%BE%E3%81%99-%E7%A7%81%E3%81%AF1981%E5%B9%B4%E5%BD%93%E6%99%82us%E3%81%AE%E3%83%9E%E3%82%A4%E3%82%AF%E3%83%AD%E3%82%BD%E3%83%95%E3%83%88%E3%81%A7%E3%82%B7%E3%83%95%E3%83%88jis%E3%81%AE%E3%83%87%E3%82%B6%E3%82%A4%E3%83%B3%E3%82%92%E6%8B%85%E5%BD%93
http://www.wdic.org/w/WDIC/%E3%82%B7%E3%83%95%E3%83%88JIS
Larry Wall, Perl
http://www.perl.org/
Kazumasa Utashiro, jcode.pl
https://metacpan.org/author/UTASHIRO
ftp://ftp.iij.ad.jp/pub/IIJ/dist/utashiro/perl/
Jeffrey E. F. Friedl, Mastering Regular Expressions
http://regex.info/
SADAHIRO Tomoyuki, The right way of using Shift_JIS
http://nomenclator.la.coocan.jp/perl/shiftjis.htm
https://metacpan.org/author/SADAHIRO
Yukihiro "Matz" Matsumoto, YAPC::Asia2006 Ruby on Perl(s)
https://archive.org/details/YAPCAsia2006TokyoRubyonPerls
jscripter, For jperl users
http://text.world.coocan.jp/jperl.html
Bruce., Unicode in Perl
http://www.rakunet.org/tsnet/TSabc/18/546.html
Hiroaki Izumi, Shouldn't use Perl5.8 / Perl5.10 on the Windows
https://sites.google.com/site/hiroa63iz/perlwin
Yuki Kimoto, Is it true that you shouldn't use Perl on Windows?
https://philosophy.perlzemi.com/blog/20200122080040.html
chaichanPaPa, Matching Shift_JIS file name
http://chaipa.hateblo.jp/entry/20080802/1217660826
SUZUKI Norio, Jperl
http://www.dennougedougakkai-ndd.org/alte/3tte/jperl-5.005_03@ap522/homepage2.nifty.com..kipp..perl..jperl..index.html
WATANABE Hirofumi, Jperl
https://www.cpan.org/src/5.0/jperl/
https://metacpan.org/author/WATANABE
ftp://ftp.oreilly.co.jp/pcjp98/watanabe/jperlconf.ppt
Chuck Houpt, Michiko Nozu, MacJPerl
https://habilis.net/macjperl/index.j.html
Kenichi Ishigaki, Pod-PerldocJp, Welcome to modern Perl world
https://metacpan.org/release/Pod-PerldocJp
http://gihyo.jp/dev/serial/01/modern-perl/0031
http://gihyo.jp/dev/serial/01/modern-perl/0032
http://gihyo.jp/dev/serial/01/modern-perl/0033
Fuji, Goro (gfx), Perl Hackers Hub No.16
http://gihyo.jp/dev/serial/01/perl-hackers-hub/001602
Dan Kogai, Encode module
https://metacpan.org/release/Encode
https://archive.org/details/YAPCAsia2006TokyoPerl58andUnicodeMythsFactsandChanges
http://yapc.g.hatena.ne.jp/jkondo/
Takahashi Masatuyo, JPerl Wiki
https://jperl.fandom.com/ja/wiki/JPerl_Wiki
Juerd, Perl Unicode Advice
https://juerd.nl/site.plp/perluniadvice
daily dayflower, 2008-06-25 perluniadvice
https://dayflower.hatenablog.com/entry/20080625/1214374293
Unicode issues in Perl
https://www.i-programmer.info/programming/other-languages/1973-unicode-issues-in-perl.html
Jesse Vincent, Compatibility is a virtue
https://www.nntp.perl.org/group/perl.perl5.porters/2010/05/msg159825.html
Tokyo-pm archive
https://mail.pm.org/pipermail/tokyo-pm/
https://mail.pm.org/pipermail/tokyo-pm/1999-September/001844.html
https://mail.pm.org/pipermail/tokyo-pm/1999-September/001854.html
Error: Runtime exception on jperl 5.005_03
http://www.rakunet.org/tsnet/TSperl/12/374.html
http://www.rakunet.org/tsnet/TSperl/12/375.html
http://www.rakunet.org/tsnet/TSperl/12/376.html
http://www.rakunet.org/tsnet/TSperl/12/377.html
http://www.rakunet.org/tsnet/TSperl/12/378.html
http://www.rakunet.org/tsnet/TSperl/12/379.html
http://www.rakunet.org/tsnet/TSperl/12/380.html
http://www.rakunet.org/tsnet/TSperl/12/382.html
ruby-list
http://blade.nagaokaut.ac.jp/ruby/ruby-list/index.shtml
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/2440
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/2446
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/2569
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/9427
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/9431
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/10500
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/10501
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/10502
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/12385
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/12392
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/12393
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/19156
TANABATA - The Star Festival - common legend of east asia
https://ja.wikipedia.org/wiki/%E4%B8%83%E5%A4%95
https://ko.wikipedia.org/wiki/%EC%B9%A0%EC%84%9D
https://zh-classical.wikipedia.org/wiki/%E4%B8%83%E5%A4%95
https://zh-yue.wikipedia.org/wiki/%E4%B8%83%E5%A7%90%E8%AA%95
https://zh.wikipedia.org/wiki/%E4%B8%83%E5%A4%95