NAME
Lingua::TR::ASCII
VERSION
version 0.16
SYNOPSIS
use Lingua::TR::ASCII;
print ascii_to_turkish(
'Acimasizca acelya gorunen bir sacmaliktansa acilip sacilmak...'
);
DESCRIPTION
If you try to write Turkish with a non-Turkish keyboard (assuming you can't change the layout or can't touch-type) this will result with the ascii-fication
of the Turkish characters and this actually results with bogus text since the text you wrote is not literally Turkish anymore (although the Turkish speaking people and search engines will most likely understand it). And in some cases, ascii-fication
of some sentences might result with funny words. This module tries to mitigate this problem with a wrapper around a pre-compiled
decision list.
The original creator of the decision list states that it was "created based on 1 million words of Turkish news text using the GPA algorithm". See the links below for more information.
This module is based on the previous Python and Ruby implementations.
NAME
Lingua::TR::ASCII - (De)asciify Turkish texts.
FUNCTIONS
ascii_to_turkish STRING
Converts (corrects) the supplied string into Turkish.
turkish_to_ascii STRING
Converts the supplied STRING
into an ascii
equivalent. This function is a wrapper around Text::Unidecode.
SEE ALSO
Lingua::DE::ASCII, Text::Unidecode, http://ileriseviye.org/blog/?tag=turkish-deasciifier, http://www.denizyuret.com/2006/11/emacs-turkish-mode.html.
OTHER IMPLEMENTATIONS
Languages
- Java
- JavaScript
- Python
- Ruby
Tools
AUTHOR
Burak Gursoy <burak@cpan.org>
COPYRIGHT AND LICENSE
This software is copyright (c) 2011 by Burak Gursoy.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.