NAME
Data::Kanji::Kanjidic - parse the "kanjidic" kanji data file
SYNOPSIS
use Data::Kanji::Kanjidic;
FUNCTIONS
parse_kanjidic
my $kanjidic = parse_kanjidic ('kanjidic');
The input is the name of the file. The output is a hash reference. The keys of the hash reference are kanji, encoded as Unicode.
This function assumes that the kanjidic file is encoded using the EUC-JP encoding.
parse_entry
my %values = parse_entry ($line);
Parse one line of kanjidic.
The possible keys and values of the returned hash are
- B
- C
- DB
- DC
- DF
- DG
- DH
- DJ
- DK
- DM
- DO
- DR
- DS
- DT
- E
- F
- G
- H
- I
- IN
- J
- K
- L
- MN
- MP
- N
- O
- P
- Q
- S
- T
- U
- V
- W
- X
- XDR
- XH
- XI
- XJ
- XN
- XO
- Y
- ZBP
- ZPP
- ZRP
- ZSP
For detailed explanations of these codes, see "About Kanjidic".
SEE ALSO
Other Perl modules
- Lingua::JP::Kanjidic
-
Parses old version of kanjidic.
Kanjidic
To download kanjidic, visit the Monash University Nihongo FTP server maintained by Professor Jim Breen at ftp://ftp.edrdg.org/pub/Nihongo/00INDEX.html.
The description of kanjidic is in ftp://ftp.edrdg.org/pub/Nihongo/kanjidic.html.
AUTHOR
Ben Bullock, <bkb@cpan.org>
COPYRIGHT & LICENCE
This package and associated files are copyright (C) 2012 Ben Bullock.
You can use, copy, modify and redistribute this package and associated files under the Perl Artistic Licence or the GNU General Public Licence.