NAME

Lingua::JA::Numbers - Converts numeric values into their Japanese string equivalents and vice versa

VERSION

$Revision: 0.3 $ $Date: 2005/08/18 07:16:13 $

SYNOPSIS

use Lingua::JA::Numbers;

# OO Style
my $ja = Lingua::JA::Numbers->new(1234567890, {style=>'romaji'});
# JuuNiOkuSanzenYonHyakuGoJuuRokuManNanaSenHappyakuKyuuJuu
# $ja->get_string is implictly called
print "$ja\n"; 
print $ja+0, "\n";
# 1234567890
# $ja->number is implicitly called.
# 1234567890

# Functional Style
my $str = ja2num(1234567890, {style=>'romaji'});
print "$str\n";
# JuuNiOkuSanzenYonHyakuGoJuuRokuManNanaSenHappyakuKyuuJuu
print num2ja($str), "\n";
# 1234567890

INSTALLATION

To install this module type the following:

perl Makefile.PL
make
make test
make install

DEPENDENCIES

This module requires perl 5.8.1 or better. It also uses bignum internally (that comes with perl core).

DESCRIPTION

This module converts Japanese text in UTF-8 (or romaji in ascii) to number, AND vice versa. Though this pod is in English and all examples are in romaji to make http://search.cpan.org/ happy, this module does accept Japanese in UTF-8. Try the code below to see it.

perl -MLingua::JA::Numbers \
  -e '$y="\x{4e8c}\x{5343}\x{4e94}"; printf "(C) %d Dan Kogai\n", ja2num($y)'

CAVEAT

DO NOT BE CONFUSED WITH Lingua::JA::Number by Mike Schilli. This module is far more comprehensive. As of 0.03, it even does its to_string() upon request.

METHODS

This module supports the following methods. They are compliant with Lingua::En::Numbers and others.

->new($str [, {key=>var ...} ])

Constructs an object via $str. String can either be number or a string in Japanese that represents a number. Optionally take options. See "Functions" for options.

->parse($str, [, {key=>var ...} ])

Parses $str.

->opt(key => var)

Changes internal options.

->get_string =item ->stringify =item ->as_string

Stringifies the object accordingly to the options. The object auto-stringifies via overload so you don't usally need this.

->as_number =item ->numify

Numifies the object. The object auto-numifies via overload so you don't usally need this.

Functions

This module supports the funcitons below;

num2ja($num, [{key => value ... }]); =item number_to_ja()

Converts the number to Japanese accordingly to the options. number_to_ja() is just an alias to num2ja().

# \x{767e}\x{4e8c}\x{5341}\x{4e09}
num2ja(123)
# HyakuNijuuSan
num2ja(123, {style=>"romaji"})

This function supports the options as follows;

style => (kanji|romaji|hiragana|katakana)

Sets which style (well, script but the word "script" is confusing). You can choose "kanji" (default), romaji, hiragana and katakana.

daiji => (0|1|2)

When 1, daiji is used. When 2 or larger, even those that are not represented as daiji will be in daiji. See http://ja.wikipedia.org/wiki/%E5%A4%A7%E5%AD%97_%28%E6%95%B0%E5%AD%97%29 for details.

When this option is set to non-zero, style is ignored (kanji).

p_one

Forciblly prefix one even when not needed.

print num2ja(1110, {style=>"romaji"}), "\n";
# SenHyakuJuu
print num2ja(1110, {style=>"romaji", p_one=>1}), "\n";
# IchiSenIchiHyakuIchiJuu
fixed4

Just stack numbers for thousands.

print num2ja(2005, {style=>"romaji"}), "\n";
NiSenGo
print num2ja(2005, {style=>"romaji", fixed4=>1}), "\n";
NiZeroZeroGo
with_arabic

Like fixed4 but stack these numbers with arabic.

print num2ja(20050831, {style=>"romaji"}), "\n";
# NiSenGoManHappyakuSanJuuIchi
print num2ja(20050831, {style=>"romaji" with_arabic=>1}), "\n";
# 2005Man0831
manman

Depreciated. When set to non-zero, it 8-digit (4x2) denomination for 'Goku' (10**48) and above.

print num2ja(10**60, {style=>"romaji"}), "\n";
# IchiAsougi
print num2ja(10**60, {style=>"romaji" manman=>1}), "\n";
# IchiManKougasha
ja2num($str, [{key => value ... }]); =item ja_to_number()

Converts Japanese number to number. Unlike num2ja(), its counterpart, it supports only one option, manman = (0|1)> which toggles 8-digit denomination.

It is pretty liberal on what it takes. For instance they all return 20050831.

ja2num("NisenGoManHappyakuSanjuIchi")
ja2num("NiZeroZeroGoZeroHachiSanIchi")
ja2num("2005Man0831")

ja2num() hacks

ja2num() acts like a calculator -- the easiest way to support scientific notation was just that. Try

ja2num("6.0225Kakeru10No23Jou")

to_string() of Lingua::JA::Number

Though not exported by default, This module comes with to_string() that is (upper-)compatibile with Lingua::JA::Number.

my @words = Lingua::JA::Numbers::to_string(1234);
print join('-', @words), "\n";   
# "sen-ni-hyaku-san-ju-yon"

EXPORT

ja2num(), num2ja(), num2ja_ordinal(), ja_to_number(), number_to_ja(), number_to_ja_ordinal()

BUGS

bignum vs. Lingua::JA::Numbers

Because of overload, The OO approach does not go well with bignum, despite the fact this module uses it internally

Jo, or 10**24

The chacracter Jo (U+25771) which represents ten to twenty-four does not have a code point in BMP so it is represented in two letters that look like one (U+79be U+x4e88)

SEE ALSO

Lingua::En::Numbers Lingua::En::Number http://ja.wikipedia.org/wiki/%E6%BC%A2%E6%95%B0%E5%AD%97

AUTHOR

Dan Kogai, <dankogai@dan.co.jp>

COPYRIGHT AND LICENSE

Copyright (C) 2005 by Dan Kogai

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.7 or, at your option, any later version of Perl 5 you may have available.