NAME
Text::Sentence - module for splitting text into sentences
SYNOPSIS
use Text::Sentence;
use locale;
use POSIX qw( locale_h );
setlocale( LC_CTYPE, 'iso_8859_1' );
@sentences = split_sentences( $text );
DESCRIPTION
The Text::Sentence
module contains the function split_sentences, which splits text into its constituent sentences, based on a fairly approximate regex. If you set the locale before calling it, it will deal correctly with locale dependant capitalization to identify sentence boundaries. Certain well know exceptions, such as abreviations, may cause incorrect segmentations.
SEE ALSO
AUTHOR
Ave Wrigley <wrigley@cre.canon.co.uk>
COPYRIGHT
Copyright (c) 1997 Canon Research Centre Europe (CRE). All rights reserved. This script and any associated documentation or files cannot be distributed outside of CRE without express prior permission from CRE.