NAME

dtatw-cids2local.perl - convert //c/@xml:id attributes to page-local encoding

SYNOPSIS

dtatw-cids2local.perl [OPTIONS] [XMLFILE(s)...]

Options:
 -help                  # this help message
 -output FILE           # specify output file (default='-' (STDOUT))
 -trace  TRACEFILE      # send trace output to file (default=none)
 -xmlns , -noxmlns      # do/don't prepend 'xml:' to output id attributes (default=don't)

OPTIONS AND ARGUMENTS

Not yet written.

DESCRIPTION

Converts //c/@xml:id attributes to page-local encoding.

New IDs are computed page-locally, where the page element associated with each //c is given by the XPath preceding::pb[1], abbreviated hereafter as $pb. The associated $pb supplies a (unique) prefix $pbid to all //c elements on the given page. The prefix is determined according to the following rules:

  1. If $pb has a @facs attribute, it is used to define $pbid by removing any prefix matching the regex /#?f?0*/ and prefixing a "p", e.g. the following //pb elements all map to $pbid="p42":

    <pb facs="42"/>
    <pb facs="#42"/>
    <pb facs="#f0042"/>
    <pb facs="f00042"/>
    <pb facs="000042"/>
  2. Otherwise, a global counter over all //pb elements is used (whose value is initialized to "0" (zero) before the initial //pb), prefixed by "pz".

                    <!-- before first page: $pbid="pz0" -->
    <pb />          <!-- first page, no @facs: $pbid="pz1" -->
    <pb />          <!-- second page, no @facs: $pbid="pz2" -->
    <pb facs="42"/> <!-- third page, with @facs: $pbid="p42" -->
    <pb />          <!-- fourth page: $pbid="pz4" -->

Finally, //c/@xml:id attributes are computed by a page-local counter $ci, to be of the form ${pbid}.c${ci}, e.g.:

<!-- before first page -->
<c xml:id="pz0.c1"/>
<c xml:id="pz0.c2"/>
<!-- ... -->

<!-- first page, with @facs -->
<pb facs="#f0042"/>
<c xml:id="p42.c1"/>
<c xml:id="p42.c2"/>
<!-- ... -->

<!-- second page, no @facs -->
<pb/>
<c xml:id="pz2.c1"/>
<c xml:id="pz2.c2"/>
<!-- ... -->

SEE ALSO

dta-tokwrap.perl(1), dtatw-add-c.perl(1), dtatw-rm-c.perl(1), ...

AUTHOR

Bryan Jurish <moocow@cpan.org>