NAME

XML::Rabbit - Consume XML with Moose and xpath queries

VERSION

version 0.0.4

SYNOPSIS

my $xhtml = W3C::XHTML->new( file => 'index.xhtml' );
print "Title: " . $xhtml->title . "\n";
print "First image source: " . $xhtml->body->images->[0]->src . "\n";

exit;

package W3C::XHTML;
use Moose;
with 'XML::Rabbit::RootNode';

has '+namespace_map' => (
    default => sub { {
        "xhtml" => "http://www.w3.org/1999/xhtml"
    } },
);

has 'title' => (
    isa         => 'Str',
    traits      => [qw(XPathValue)],
    xpath_query => '/xhtml:html/xhtml:head/xhtml:title',
);

has 'body' => (
    isa         => 'W3C::XHTML::Body',
    traits      => [qw(XPathObject)],
    xpath_query => '/xhtml:html/xhtml:body',
);

has 'all_anchors_and_images' => (
    traits      => ['XPathObjectList'],
    xpath_query => '//xhtml:a|//xhtml:img',
    isa_map     => {
        'xhtml:a'   => 'W3C::XHTML::Anchor',
        'xhtml:img' => 'W3C::XHTML::Image',
    },
);

no Moose;
__PACKAGE__->meta->make_immutable();

package W3C::XHTML::Body;
use Moose;
with 'XML::Rabbit::Node';

has 'images' => (
    isa         => 'ArrayRef[W3C::XHTML::Image]',
    traits      => [qw(XPathObjectList)],
    xpath_query => './/xhtml:img',
);

no Moose;
__PACKAGE__->meta->make_immutable();

package W3C::XHTML::Image;
use Moose;
with 'XML::Rabbit::Node';

has 'src' => (
    isa         => 'Str',
    traits      => [qw(XPathValue)],
    xpath_query => './@src',
);

has 'alt' => (
    isa         => 'Str',
    traits      => [qw(XPathValue)],
    xpath_query => './@alt',
);

has 'title' => (
    isa         => 'Str',
    traits      => [qw(XPathValue)],
    xpath_query => './@title',
);

no Moose;
__PACKAGE__->meta->make_immutable();

package W3C::XHTML::Anchor;
use Moose;
with 'XML::Rabbit::Node';

has 'href' => (
    isa         => 'Str',
    traits      => [qw(XPathValue)],
    xpath_query => './@src',
);

has 'title' => (
    isa         => 'Str',
    traits      => [qw(XPathValue)],
    xpath_query => './@title',
);

no Moose;
__PACKAGE__->meta->make_immutable();

1;

DESCRIPTION

XML::Rabbit is a simple Moose-based base class you can use to make simple XPath-based XML extractors. Each attribute in your class is linked to an XPath query that is executed on your XML document when you request the value.

Also notice that if you specify an xpath_query that can return multiple types, you need to specify isa_map instead of just specifying the types as a union type constraint in isa. If you specify isa_map you should not specify isa aswell, as it will be overridden by the trait. The trait will wrap the type constraint union in an ArrayRef if the trait name is XPathObjectList and as a HashRef if the trait name is XPathObjectMap. As all the traits that end with List return array references, their isa must be an ArrayRef. The same is valid for the *Map traits, just that they return HashRef instead of ArrayRef.

The namespace prefix used in isa_map MUST be specified in the namespace_map. If a prefix is used in isa_map without a corresponding entry in namespace_map an exception will be thrown.

CAVEATS

Be aware of the syntax of XPath when used with namespaces. You should almost always define namespace_map when dealing with XML that use namespaces. Namespaces explicitly declared in the XML are usable with the prefix specified in the XML (except if you use isa_map). Be aware that a prefix must usually be declared for the default namespace (xmlns=...) to be able to use it in XPath queries. See the example above (on XHTML) for details. See LXML::LibXML::Node/findnodes for more information.

Because XML::Rabbit uses XML::LibXML's DOM parser it is limited to handling XML documents that can fit in available memory. Unfortunately there is no easy way around this, because XPath queries need to work on a tree model, and I am not aware of any way of doing that without keeping the document in memory. Luckily XML::LibXML's DOM implementation is written in C, so it should use much less memory than a pure Perl DOM parser.

SEMANTIC VERSIONING

This module uses semantic versioning concepts from http://semver.org/.

ACKNOWLEDGEMENTS

The following people have helped to review or otherwise encourage me to work on this module.

Chris Prather (perigrin)

Matt S. Trout (mst)

Stevan Little (stevan)

SUPPORT

You can find documentation for this module with the perldoc command.

perldoc XML::Rabbit

Websites

Search CPAN

http://search.cpan.org/dist/XML-Rabbit

AnnoCPAN: Annotated CPAN documentation

http://annocpan.org/dist/XML-Rabbit

CPAN Ratings

http://cpanratings.perl.org/d/XML-Rabbit

CPAN Forum

http://cpanforum.com/dist/XML-Rabbit

RT: CPAN's Bug Tracker

http://rt.cpan.org/NoAuth/Bugs.html?Dist=XML-Rabbit

CPANTS Kwalitee

http://cpants.perl.org/dist/overview/XML-Rabbit

CPAN Testers Results

http://cpantesters.org/distro/X/XML-Rabbit.html

CPAN Testers Matrix

http://matrix.cpantesters.org/?dist=XML-Rabbit

Source Code Repository

The code is open to the world, and available for you to hack on. Please feel free to browse it and play with it, or whatever. If you want to contribute patches, please send me a diff or prod me to pull from your repository :)

Lgit://github.com/robinsmidsrod/XML-Rabbit.git

Bugs

Please report any bugs or feature requests to bug-xml-rabbit at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=XML-Rabbit. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.

AUTHOR

Robin Smidsrød robin@smidsrod.no

COPYRIGHT AND LICENSE

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)