NAME

Text::Darts - Perl interface to DARTS by Taku Kudoh

SYNOPSIS

use Text::Darts;
my @words = qw/ALGOL ANSI ARCO ARPA ARPANET ASCII/;
my %word   = map { $_ => lc $_ } @words;
my $td     = Text::Darts->new(@words);
my $newstr = $td->gsub("ARPANET is a net by ARPA", sub{ "<<$_[0]>>" });
# $newstr is now "<<ARPANET>> is a net by <<ARPA>>".
my $lstr   = $td->gsub("ARPANET is a net by ARPA", \%words);
# $Lstr is now "arpanet is a net by arpa".

# or
my $td     = Text::Darts->open("words.darts"); # make one with mkdarts
my $newstr = $td->gsub($str, sub{ 
   qq(<a href="http://dictionary.com/browse/$_[0]">$_[0]</a>)
}); # link'em all!

DESCRIPTION

Darts, or Double-ARray Trie System is a C++ Template Library by Taku Kudoh. This module makes use of Darts to implement global replace like below;

$str = s{ (foo|bar|baz) }{ "<<$1>>" }msgex;

The problem with regexp is that it is slow with alterations. Suppose you want to anchor all words that appear in /usr/share/dict/words with regexp. It would be impractical with regexp. This module makes it practical.

Even if you are using perl 5.10 or better which does TRIE optimization internally, you still have to compile the regexp everytime you run the script. So it is still more practical to use this module if your match is exact -- containing no quantifier.

Since Version 0.05, Text::Darts also accepts a hash reference instead of a code reference. In such cases gsub behaves as follows.

$str = s{ (foo|bar|baz) }{$replacement{$1}}msgx;

like s///ge vs s///g, this is less flexible but faster.

REQUIREMENT

Since Version 0.07, you no longer need to have Darts installed. This module now bundles darts.h which is needed to build this module. To get the most out of Darts, you still need to install Darts 0.32 or above.

http://chasen.org/~taku/software/darts/index.html (Japanese)

fetch http://chasen.org/~taku/software/darts/src/darts-0.32.tar.gz
tar zxvf darts-0.32.tar.gz
cd darts-0.32
configure
make
make check
sudo make install

EXPORT

None.

Functions

Text::Darts::DARTS_VERSION

returns DARTS version. Currently 0.32

SEE ALSO

http://chasen.org/~taku/software/darts/index.html (Japanese)

Regexp::Assemble

AUTHOR

Dan Kogai, <dankogai@dan.co.jp>

COPYRIGHT AND LICENSE

darts.h

Copyright (c) 2001-2008, Taku Kudo All rights reserved.

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

* Redistributions of source code must retain the above
  copyright notice, this list of conditions and the
  following disclaimer.

* Redistributions in binary form must reproduce the above
  copyright notice, this list of conditions and the
  following disclaimer in the documentation and/or other
  materials provided with the distribution.

* Neither the name of the Nippon Telegraph and Telegraph Corporation
  nor the names of its contributors may be used to endorse or
  promote products derived from this software without specific
  prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABI LITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONT RIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES ( INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR P ROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT , STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF TH IS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

anything else

Copyright (C) 2007-2009 by Dan Kogai

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.10.0 or, at your option, any later version of Perl 5 you may have available.