NAME

Lingua::Translate::Google 0.01 - Translation back-end for Google's beta translation service.

SYNOPSIS

use Lingua::Translate;

Lingua::Translate::config
    (
      backend => 'Google',
      api_key => 'YoUrApIkEy',
      referer => 'http://your.domain.tld/yourdir/',
      ua      => LWP::UserAgent->new(),
    );

my $xl8r = Lingua::Translate->new(src => 'de', dest => 'en');

# prints 'My hovercraft is full of eels'
print $xl8r->translate('Mein Luftkissenfahrzeug ist voller Aale');

DESCRIPTION

Lingua::Translate::Google is a translation back-end for Lingua::Translate that contacts Google (http://ajax.googleapis.com/ajax/services/language/translate/) to do the real work.

It is normally invoked by Lingua::Translate; there should be no need to call it directly. If you do call it directly, you will lose the ability to easily switch your programs over to alternate back-ends that are later produced.

If you omit the API key config option, then this module uses the fallback service at translate.google.com/ which works fine without it.

To use the Google APIs, Google asks that you obtain an API key, and that you always include a valid and accurate referer URL. If you supply an API key, then this module uses the AJAX API which Google provides specifically for third party application development.

One way or another, by using Google services (either directly or via this module) you are agreeing by their terms of service.

Please read:

http://www.google.com/accounts/TOS

To obtain your own API key:

http://code.google.com/apis/ajaxfeeds/signup.html

CONSTRUCTOR

new(src => $lang, dest => lang)

    Creates a new translation handle. Determines whether the requested language pair is available and will croak if not.

paramters

src

Source language, in RFC-3066 form. See I18N::LangTags for a discussion of RFC-3066 language tags.

dest

Destination Language

Other options that may be passed to the config() function (see below) may also be passed as arguments to this constructor.

METHODS

The following methods may be called on Lingua::Translate::Google objects.

available() : @list

    Returns a list of available language pairs, in the form of 'XX_YY', where XX is the source language and YY is the destination. If you want the english name of a language tag, call I18N::LangTags::List::name() on it. See I18N::LangTags::List.

    This method contacts Google (http://translate.google.com/translate_a/t?) to calculate available language pairs. The data about available language pairs is cached in memory.

translate($text) : $translated

    Translates the given text, or die's on any kind of error.

    It is assumed that the $text coming in is UTF-8 encoded, and that Google will be returning UTF-8 encoded text. In the case that Google returns some other encoding, then an attempt to convert the result to UTF-8 is made with Unicode::MapUTF8::to_utf8. Observation has indicated that the fallback service (at /translate_a/t) is inclined to return windows-1255 encoded text, despite the value of the 'Accept-Charset' header sent in the request. However, a non-windows user agent string seems to remedy this.

    Also, the primary service (at googleapis.com) returns JSON which assumes the client is JavaScript running with an HTML document. This being the case strings are double encoded. First special characters are converted to HTML entities, and then the ampersands are converted to unicode escape sequences. For example, the string "Harold's" is encoded as "Harold\u0027#39;s". The translate function attempts to return plain old UTF-8 encoded strings without any entities or escape sequences.

agent() : LWP::UserAgent

    Returns the LWP::UserAgent object used to contact Google.

CONFIGURATION FUNCTIONS

config( option => $value, )

This function sets defaults for use when constructing objects. Options include:

google_uri

The uri to use when contacting Google.

The default value is

"http://ajax.googleapis.com/ajax/services/language/translate?" v=1.0 &q=hello%20world &langpair=en%7Cit &key=yourapikey

For details see: http://code.google.com/apis/ajaxlanguage/documentation/#fonje

Another possibility is:

"http://www.google.com/uds/Gtranslate?" v=1.0 &q=hello%20world &langpair=en%7Czh-TW &callback=google.language.callbacks.id101 &context=22 &key=notsupplied &key=yourapikey

google_fallback_uri

The URI used when contacting Google, and no api_key is provided.

The default value is

"http://translate.google.com/translate_a/t?" client=t &text=hello%20world &sl=en &tl=zh-Tw

Note, Google states clearly that they want you to obtain and use an API key, and also include a valid and accurate referer URL.

agent

The User-Agent string to use when contacting Google.

The default value is "Lingua::Translate::Google/", plus the version number of the package.

chunk_size

The size to break chunks into before handing them off to Google. The default value is "1000" (bytes).

retries

The number of times to retry contacting Google if the first attempt fails. The default value is "2".

BUGS/TODO

The chunk_size attribute is a hold-over from the Babelfish algorithm. It is TBD as to what chunk size ought to be set for Google.

SEE ALSO

Lingua::Translate, Lingua::Translate::Babelfish, LWP::UserAgent, Unicode::MapUTF8

ACKNOWLEDGEMENTS

This is a rewritten copy of Lingua::Translate::Babelfish by Sam Vilain <enki@snowcra.sh>.

AUTHOR

Dylan Doxey, <dylan.doxey@gmail.com>

7 POD Errors

The following errors were encountered while parsing the POD:

Around line 430:

You forgot a '=back' before '=head2'

You forgot a '=back' before '=head2'

Around line 434:

=back without =over

Around line 438:

You forgot a '=back' before '=head2'

Around line 442:

=back without =over

Around line 446:

=back without =over

Around line 490:

'=item' outside of any '=over'

Around line 544:

You forgot a '=back' before '=head1'