Changes for version 0.3 - 2002-04-17

  • specification of ids and column names in matches has been extended and simplified. id and cols now may take arrayrefs of things to match. colre is deprecated.
  • matches can be made using literals, subroutines and qr// created regexps. the latter two are automatically recognized.
  • explicit exclusions and skipping of tables is now possible
  • improved docs (a bit)
  • decode_entitites was being called too late in fix_texts, so Trim was less than effectual.
  • added DecodeNBSP attribute to fix nbsp decoding issues.
  • callbacks for methods and classes can be turned off by assigning an undef value to the appropriate key in the table request.
  • the warn callback is also passed the line number now. this is an incompatible change in the API.

Modules

Extract data from an HTML table

Provides

in TableParser.pm