NAME
Perl6::Pod::Parser::AddIds - generate attribute id for elements
SYNOPSIS
#make filter with namespace for generated id
my $add_ids_ns = new Perl6::Pod::Parser::AddIds::
ns=>"namespace";
my $add_ids_ns_file = new Perl6::Pod::Parser::AddIds::
ns=>"test.pod";
#leave empty namespace
my $add_ids = new Perl6::Pod::Parser::AddIds::;
my $out = '';
my $to_mem = new Perl6::Pod::To::XML:: out_put => \$out;
#make pipe for process pod
my $p = create_pipe( 'Perl6::Pod::Parser', $add_ids_ns , $to_mem);
$p->parse( \$pod_text );
print $out
DESCRIPTION
Perl6::Pod::Parser::AddIds - add id attribute to processed pods elements.
my $add_ids = new Perl6::Pod::Parser::AddIds:: ns=>"namespace";
For Pod:
=begin pod
=head1 test
tst2
=end pod
XML is:
<pod pod:type='block'
xmlns:pod='http://perlcabal.org/syn/S26.html'>
<head1 pod:type='block' pod:id='namespace:test_tst2'>test
tst2
</head1>
</pod>
Added atribute pod:id :
pod:id='namespace:test_tst2'
_make_id($text[, $base_id])
Function will construct an element id string. Id string is composed of join (':', $base_id || $parser->{base_id} , $text)
, where $text
in most cases is the pod heading text.
The xml id string has strict format. Checkout "cleanup_id" function for specification.
_make_uniq_id($text)
Calls $parser->make_id($text)
and checks if such id was already generated. If so, generates new one by adding _i1 (or _i2, i3, ...) to the id string. Return value is new uniq id string.
_cleanup_id($id_string)
This function is used internally to remove/change any illegal characters from the elements id string. (see http://www.w3.org/TR/2000/REC-xml-20001006#NT-Name for the id string specification)
$id_string =~ s/<!\[CDATA\[(.+?)\]\]>/$1/g; # keep just inside of CDATA
$id_string =~ s/<.+?>//g; # remove tags
$id_string =~ s/^\s*//; # ltrim spaces
$id_string =~ s/\s*$//; # rtrim spaces
$id_string =~ tr{/ }{._}; # replace / with . and spaces with _
$id_string =~ s/[^\-_a-zA-Z0-9\.: ]//g; # closed set of characters allowed in id string
In the worst case when the $id_string
after clean up will not conform with the specification, warning will be printed out and random number with leading colon will be used.