NAME
Lingua::TR::ASCII - (De)asciify Turkish texts.
SYNOPSIS
use Lingua::TR::ASCII;
print ascii_to_turkish(
'Acimasizca acelya gorunen bir sacmaliktansa acilip sacilmak...'
);
DESCRIPTION
This document describes version 0.12
of Lingua::TR::ASCII
released on 9 July 2012
.
If you try to write Turkish with a non-Turkish keyboard (assuming you can't change the layout or can't touch-type) this'll result with the ascii-fication of the Turkish characters and this actually results with bogus text since the text you wrote is not literally Turkish anymore (although the Turkish speaking people and search engines will most likely understand it). And in some cases, ascii-fication of some sentences might result with funny words. This module tries to mitigate this problem with a wrapper around a pre-compiled decison list.
The original creator of the decision list states that it was "created based on 1 million words of Turkish news text using the GPA algorithm". See the links below for more information.
This module is based on the previous Python and Ruby implementations.
FUNCTIONS
ascii_to_turkish STRING
Converts (corrects) the supplied string into Turkish.
turkish_to_ascii STRING
Converts the supplied STRING
into an ascii equivalent. This function is a wrapper around Text::Unidecode.
SEE ALSO
Lingua::DE::ASCII, Text::Unidecode, http://ileriseviye.org/blog/?tag=turkish-deasciifier, http://www.denizyuret.com/2006/11/emacs-turkish-mode.html.
OTHER IMPLEMENTATIONS
Languages
- Java
- JavaScript
- Python
- Ruby
Tools
AUTHOR
Burak Gursoy <burak@cpan.org>.
COPYRIGHT
Copyright 2011 - 2012 Burak Gursoy. All rights reserved.
LICENSE
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.12.3 or, at your option, any later version of Perl 5 you may have available.