NAME

Lingua::AR::Tashkeel - Subroutines for handling Arabic Vowels and Vowel marks

SYNOPSIS

use Lingua::AR::Tashkeel;

# Strip all short vowels
Lingua::AR::Tashkeel->strip('مَكَرُونَة'); # => مكرونة
# Heuristic for removing short vowels without causing ambiguity
Lingua::AR::Tashkeel->prune('فَتَّة');    # => فتّة
# Heuristic for fixing mixed up short and long vowels
Lingua::AR::Tashkeel->fix('ماحشي');    # => مَحشي 

DESCRIPTION

Subroutines for working with Arabic long (حروف علة) and short vowels (حركات تشكيل)

METHODS AND ARGUMENTS

new()

Constructs a new Lingua::AR::Tashkeel instance.

strip($string)

Strips away all Arabic short vowels (Tashkeel).

prune($string)

Heuristic for pruning the short vowels that a native speaker wouldn't write, as leaving them out wouldn't introduce ambiguity.

This is often preferable to strip, as Shaddas, or Dammas that indicate a passive verb are useful clues that one might want to keep.

fix($string)

Transliterating from a romanized representation of Arabic to actual Arabic script often gives incorrect results regarding short/long vowels.

This subroutine implements a heuristic for fixing such mix ups.

REQUIREMENTS

Needs Perl v5.18 or greater

GIT REPOSITORY

http://github.com/a3f/Lingua-AR-Vowels

SEE ALSO

Perl

AUTHOR

Ahmad Fatoum <athreef@cpan.org>, http://a3f.at

COPYRIGHT AND LICENSE

Copyright (C) 2016 Ahmad Fatoum

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.