NAME

HTML::TreeBuilder - Parser that builds a HTML syntax tree

SYNOPSIS

$h = new HTML::TreeBuilder;
$h->parse($document);
#...

print $h->as_HTML;  # or any other HTML::Element method

DESCRIPTION

This is a parser that builds (and actually itself is) a HTML syntax tree.

Objects of this class inherit the methods of both HTML::Parser and HTML::Element. After parsing has taken place it can be regarded as the syntax tree itself.

The following method all control how parsing takes place. You can set the attributes by passing a TRUE or FALSE value as argument.

$p->implicit_tags

Setting this attribute to true will instruct the parser to try to deduce implicit elements and implicit end tags. If it is false you get a parse tree that just reflects the text as it stands. Might be useful for quick & dirty parsing. Default is true.

Implicit elements have the implicit() attribute set.

$p->ignore_unknown

This attribute controls whether unknown tags should be represented as elements in the parse tree. Default is true.

$p->ignore_text

Do not represent the text content of elements. This saves space if all you want is to examine the structure of the document. Default is false.

$p->warn

Call warn() with an appropriate message for syntax errors. Default is false.

SEE ALSO

HTML::Parser, HTML::Element

COPYRIGHT

Copyright 1995-1996 Gisle Aas. All rights reserved.

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

AUTHOR

Gisle Aas <aas@sn.no>