NAME
Convert::Moji - objects to convert alphabets
SYNOPSIS
# Examples of rot13 transformers:
use Convert::Moji;
# Using a table
my %rot13;
@rot13{('a'..'z')} = ('n'..'z','a'..'m');
my $rot13 = Convert::Moji->new (["table", \%rot13]);
# Using tr
my $rot13_1 = Convert::Moji->new (["tr", "a-z", "n-za-m"]);
# Using a callback
sub rot_13_sub { tr/a-z/n-za-m/; return $_ }
my $rot13_2 = Convert::Moji->new (["code", \&rot_13_sub]);
Then to do the actual conversion
my $out = $rot13->convert ("secret");
and now $out contains "frperg". You also can go backwards with
my $inverted = $rot13->invert ("frperg");
and now $inverted contains "secret".
DESCRIPTION
Convert::Moji creates objects which can be used to convert between different alphabets. It was originally designed to do the work for Lingua::JA::Moji, to convert between different forms of Japanese writing. It was split out of that module as a general-purpose converter for any alphabets.
new
Create the object. Arguments are a list of array references. The array references should have either the "noninvertible" flag "oneway" or one of the following as its first argument.
You can also chain the converters together, with
my $does_something = Convert::Moji->new (["table", $mytable],
["tr", $left, $right]);
- table
-
After this comes one more argument, a reference to the hash containing the table. For example
my $conv = Convert::Moji->new (["table", \%crazyhash]);
The hash keys and values can be any length, so you can convert single characters into words, as in
my %crazyhash = {"a" => "apple", "b" => "banana"}
and vice-versa if you wish. The conversion will be performed correctly regardless of the weirdness of your table.
- file
-
After this comes one more argument, the name of a file containing some information to convert into a hash table. The file format is space-separated pairs, no comments or blank lines allowed. If the file does not exist or cannot be opened, the module prints an error message, and returns the undefined value.
- code
-
After this comes one or two references to subroutines. The first subroutine is the conversion and the second one is the inversion routine. If you omit the second routine, it is equivalent to specifying "oneway".
- tr
-
After this come two arguments, the left and right hand sides of a "tr" expression, for example
Convert::Moji->new (["tr", "A-Z", "a-z"])
will convert upper to lower case
A "tr" is performed, and inversely for the invert case.
Conversions, via "convert", will be performed in the order of the arguments. Inversions will be performed in reverse order of the arguments, skipping uninvertibles.
Uninvertible operations
If your conversion doesn't actually go backwards, you can tell the module when you create the object using a keyword "oneway":
my $uninvertible = Convert::Moji->new (["oneway", "table", $mytable]);
Then $uninvertible->invert doesn't do anything. You can also selectively choose which operations of a list are invertible and which aren't, so that only the invertible ones do something.
Load from a file
Load a character conversion table from a file using
Convert::Moji->new (["file", $filename]);
In this case, the file needs to contain a space-separated list to be converted one into the other.
Bugs
This doesn't handle comments or blank lines in the file.
convert
The convert method takes one argument, which is a scalar string to be converted into the other list by the stuff we fed in at "new".
Bugs
- no "strict conversion"
-
Just ignores (passes through) characters which it can't convert. It should have a "strict" option to also validate the input.
invert
Inverts the input.
Takes two arguments, the first is the string to be inverted back through the conversion process, and the second is the type of conversion to perform if the inversion is ambiguous. This can take one of the following values
- first
-
If the inversion is ambiguous, it picks the first one it finds.
- random
-
If the inversion is ambiguous, it picks one at random.
- all
-
In this case you get an array reference back containing either strings where the inversion was unambiguous, or array references to arrays containing all possible strings. So it's a horrible mess.
- all_joined
-
Like "all" but you get a scalar with all the options in square brackets instead of lots of array references.
Bugs
- second argument not implemented fully
-
The second argument part is only implemented for hash table based conversions, and is very likely to be buggy even then.
FUNCTIONS
These functions are used by the module and may be useful to outside programs.
length_one
# Returns false:
length_one ('x', 'y', 'monkey');
# Returns true:
length_one ('x', 'y', 'm');
Returns true if every element of the array has a length equal to one, and false if any of them does not have length one.
make_regex
my $regex = make_regex (qw/a b c de fgh/);
# $regex = "fgh|de|a|b|c";
Given a list of inputs, make a regular expression which matches any of the characters in the list of inputs, longest match first. Each of the elements of the list is quoted using quotemeta
. The regular expression does not contain capturing parentheses. For example, to convert everything in string $x
from the keys of %foo2bar
to its values,
my %foo2bar = (mad => 'max', dangerous => 'trombone');
my $x = 'mad, bad, and dangerous to know';
my $regex = make_regex (keys %foo2bar);
$x =~ s/($regex)/$foo2bar{$1}/g;
# Now $x = "max, bad, and trombone to know".
unambiguous
if (unambiguous (\%table)) {
}
Returns true if all of the values in %table
are distinct, and false if any two of the values in %table
are the same.
AUTHOR
Ben Bullock, <bkb@cpan.org>
COPYRIGHT & LICENSE
Copyright 2008-2012 Ben Bullock, all rights reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.