NAME
DTA::TokWrap::Processor::standoff - DTA tokenizer wrappers: t.xml -> (s.xml, w.xml, a.xml) via external filter programs
SYNOPSIS
use DTA::TokWrap::Processor::standoff;
$so = DTA::TokWrap::Processor::standoff->new(%opts);
$doc_or_undef = $CLASS_OR_OBJECT->sosxml($doc);
$doc_or_undef = $CLASS_OR_OBJECT->sowxml($doc);
$doc_or_undef = $CLASS_OR_OBJECT->soaxml($doc);
$doc_or_undef = $CLASS_OR_OBJECT->standoff($doc);
##-- backwards-compatibility
undef = $so->dump_t2s_stylesheet($filename_or_fh);
undef = $so->dump_t2w_stylesheet($filename_or_fh);
undef = $so->dump_t2a_stylesheet($filename_or_fh);
DESCRIPTION
Constants
- Variable: @ISA
-
DTA::TokWrap::Processor::standoff inherits from DTA::TokWrap::Processor.
Constructors etc.
- new
-
$so = $CLASS_OR_OBJECT->new(%args);
Constructor.
%args, %$so:
t2w => $path_to_dtatw_txml2wxml, ##-- default: search t2s => $path_to_dtatw_txml2sxml, ##-- default: search t2a => $path_to_dtatw_txml2axml, ##-- default: search inplace => $bool, ##-- prefer in-place programs for search?
- defaults
-
%defaults = CLASS->defaults();
Static class-dependent defaults.
- init
-
$so = $so->init();
Dynamic object-dependent defaults.
Methods: Backwards-compatibility
- _xsl
-
$so_xsl = $so->_xsl();
Return a DTA::TokWrap::Processor::standoff::xsl object which may or may not be logically equivalent to
$so
. - dump_t2s_stylesheet
-
undef = $so->dump_t2s_stylesheet($filename_or_fh);
See DTA::TokWrap::Processor::standoff::xsl::dump_t2s_stylesheet().
- dump_t2w_stylesheet
-
undef = $so->dump_t2w_stylesheet($filename_or_fh);
See DTA::TokWrap::Processor::standoff::xsl::dump_t2w_stylesheet().
- dump_t2a_stylesheet
-
undef = $so->dump_t2a_stylesheet($filename_or_fh);
See DTA::TokWrap::Processor::standoff::xsl::dump_t2a_stylesheet().
Methods: document processing
- soxml
-
$doc_or_undef = $CLASS_OR_OBJECT->soxml($doc,$X,$xmlbase);
Low-level generic standoff formatting method. Generate
$X
-level standoff for the DTA::TokWrap::Document object $doc.Relevant %$doc keys:
xtokdata => $xtokdata, ##-- (input) XML-ified tokenizer output data (string) so${X}file => $sosfile, ##-- (output) standoff file, refers to $xml_base ## so${X}xml_stamp0 => $f, ##-- (output) timestamp of operation begin so${X}xml_stamp => $f, ##-- (output) timestamp of operation end so${X}file_stamp => $f, ##-- (output) timestamp of operation end
- sosxml
-
$doc_or_undef = $CLASS_OR_OBJECT->sosxml($doc);
Just a wrapper for:
$so->soxml($doc,'s',basename($doc->{sowfile}));
- sowxml
-
$doc_or_undef = $CLASS_OR_OBJECT->sowxml($doc);
Just a wrapper for:
$so->soxml($doc,'w',$doc->{xmlbase});
- soaxml
-
$doc_or_undef = $CLASS_OR_OBJECT->soaxml($doc);
Just a wrapper for:
$so->soxml($doc,'a',basename($doc->{sowfile}));
SEE ALSO
DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...
SEE ALSO
DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...
AUTHOR
Bryan Jurish <moocow@cpan.org>
COPYRIGHT AND LICENSE
Copyright (C) 2009-2018 by Bryan Jurish
This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.14.2 or, at your option, any later version of Perl 5 you may have available.