Changes for version 0.38

  • add support for wildcard at start of term in addition to end of term.
  • added Windows-1252 (cp1252) encoding helpers.
  • added Encoding::FixLatin as a dependency.
  • fix off-by-one errors in find_bad_*_report and find_bad_* UTF8 functions.
  • add debug_bytes() to UTF8 class.

Modules

high-performance tools for building search applications
locate the best matches in a snippet extract
highlight terms in text
(**DEPRECATED**) extract keywords from a search query
Class::Accessor::Fast-compatible accessors
base class for Search::Tools objects
objectified string for highlighting, snipping, etc.
convert string queries into objects
regular expressions for terms
(**DEPRECATED**) build regular expressions from search queries
(**DEPRECATED**) access regular expressions for a keyword
(**DEPRECATED**) access regular expressions for keywords
extract terms in context
offer spelling suggestions
a token object returned from a TokenList
a bunch of tokens from a Tokenizer
a bunch of tokens from a Tokenizer
mixin methods for TokenList and TokenListPP
a token object returned from a TokenList
split a string into meaningful tokens
transliterations of UTF-8 chars
UTF8 string wrangling
methods for playing nice with XML and HTML