Changes for version 0.33

  • switch default Snipper type to 'offset' to optimize for large target texts.
  • add Tokenizer->get_offsets() method in C/XS.
  • fix Snipper->show feature to work as the author expected it to. Do not return anything if no match.
  • refactor is_ascii C code and is_sentence_start() to return false if match on UPPER as opposed to Upper.

Modules

high-performance tools for building search applications
locate the best matches in a snippet extract
highlight terms in text
(**DEPRECATED**) extract keywords from a search query
Class::Accessor::Fast-compatible accessors
base class for Search::Tools objects
objectified string for highlighting, snipping, etc.
convert string queries into objects
regular expressions for terms
(**DEPRECATED**) build regular expressions from search queries
(**DEPRECATED**) access regular expressions for a keyword
(**DEPRECATED**) access regular expressions for keywords
extract terms in context
offer spelling suggestions
a token object returned from a TokenList
a bunch of tokens from a Tokenizer
a bunch of tokens from a Tokenizer
mixin methods for TokenList and TokenListPP
a token object returned from a TokenList
split a string into meaningful tokens
transliterations of UTF-8 chars
UTF8 string wrangling
methods for playing nice with XML and HTML