NAME
Text::Darts - Perl interface to DARTS by Taku Kudoh
SYNOPSIS
use Text::Darts;
my @words = qw/ALGOL ANSI ARCO ARPA ARPANET ASCII/;
my %word = map { $_ => lc $_ } @words;
my $td = Text::Darts->new(@words);
my $newstr = $td->gsub("ARPANET is a net by ARPA", sub{ "<<$_[0]>>" });
# $newstr is now "<<ARPANET>> is a net by <<ARPA>>".
my $lstr = $td->gsub("ARPANET is a net by ARPA", \%words);
# $Lstr is now "arpanet is a net by arpa".
# or
my $td = Text::Darts->open("words.darts"); # make one with mkdarts
my $newstr = $td->gsub($str, sub{
qq(<a href="http://dictionary.com/browse/$_[0]">$_[0]</a>)
}); # link'em all!
DESCRIPTION
Darts, or Double-ARray Trie System is a C++ Template Library by Taku Kudoh. This module makes use of Darts to implement global replace like below;
$str = s{ (foo|bar|baz) }{ "<<$1>>" }msgex;
The problem with regexp is that it is slow with alterations. Suppose you want to anchor all words that appear in /usr/share/dict/words with regexp. It would be impractical with regexp. This module makes it practical.
Even if you are using perl 5.10 or better which does TRIE optimization internally, you still have to compile the regexp everytime you run the script. So it is still more practical to use this module if your match is exact -- containing no quantifier.
Since Version 0.05, Text::Darts also accepts a hash reference instead of a code reference. In such cases gsub behaves as follows.
$str = s{ (foo|bar|baz) }{$replacement{$1}}msgx;
like s///ge
vs s///g
, this is less flexible but faster.
REQUIREMENT
Since Version 0.07, you no longer need to have Darts installed. This module now bundles darts.h which is needed to build this module. To get the most out of Darts, you still need to install Darts 0.32 or above.
http://chasen.org/~taku/software/darts/index.html (Japanese)
fetch http://chasen.org/~taku/software/darts/src/darts-0.32.tar.gz
tar zxvf darts-0.32.tar.gz
cd darts-0.32
configure
make
make check
sudo make install
EXPORT
None.
Functions
SEE ALSO
http://chasen.org/~taku/software/darts/index.html (Japanese)
AUTHOR
Dan Kogai, <dankogai@dan.co.jp>
COPYRIGHT AND LICENSE
- darts.h
-
Copyright (c) 2001-2008, Taku Kudo All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
* Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. * Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. * Neither the name of the Nippon Telegraph and Telegraph Corporation nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABI LITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONT RIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES ( INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR P ROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT , STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF TH IS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
- anything else
-
Copyright (C) 2007-2009 by Dan Kogai
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.10.0 or, at your option, any later version of Perl 5 you may have available.