NAME
Text::Amuse::Preprocessor::Typography - Perl extension for pre-processing of Text::Amuse files
SYNOPSIS
use Text::Amuse::Preprocessor::Typography qw/typography_filter/;
my $cleanedtext = typography_filter($lang, $text)
DESCRIPTION
Common routines to filter the input files, fixing typography and language-specific rules. All the text is assumed to be already decoded.
FUNCTIONS
linkify_filter($string)
Detect and replace the bare links with the proper markup, as [[http://domain.org/my/url/and_params?a=1&b=c][domain.org]]
It's a bit opinionated to hide the full url and show only the domain. Anyway, it's a preprocessing filter and the most important thing is not to loose pieces. And we don't, because the full url is still there. Anyway, long urls are a pain to display and to typeset, so the domain is a sensible choise. The user can anyway change this. It's just an helper to avoid boring tasks, nothing more.
Returns the adjusted string.
typography_filter($lang, $string)
Perform the smart replacement of single quotes, double quotes, dashes and, in some cases, the superscript for things like 2nd, 13th, etc.
The languages supported are en
, fi
, hr
, sr
, ru
, es
.
Returns the adjusted string.
get_typography_filter($lang, $links)
Return a sub which you can call later on a string. The sub will first call the common replacements (ugly unicode ligatures). If the first argument is set and is a valid language, will do the language specific replacements. If the second argument is set and true, will also fix the links.
The sub itself will return the adjusted string.