NAME

DTA::TokWrap::Processor::standoff - DTA tokenizer wrappers: t.xml -> (s.xml, w.xml, a.xml) via external filter programs

SYNOPSIS

use DTA::TokWrap::Processor::standoff;

$so = DTA::TokWrap::Processor::standoff->new(%opts);
$doc_or_undef = $CLASS_OR_OBJECT->sosxml($doc);
$doc_or_undef = $CLASS_OR_OBJECT->sowxml($doc);
$doc_or_undef = $CLASS_OR_OBJECT->soaxml($doc);
$doc_or_undef = $CLASS_OR_OBJECT->standoff($doc);

##-- backwards-compatibility
undef = $so->dump_t2s_stylesheet($filename_or_fh);
undef = $so->dump_t2w_stylesheet($filename_or_fh);
undef = $so->dump_t2a_stylesheet($filename_or_fh);

DESCRIPTION

Constants

Variable: @ISA

DTA::TokWrap::Processor::standoff inherits from DTA::TokWrap::Processor.

Constructors etc.

new
$so = $CLASS_OR_OBJECT->new(%args);

Constructor.

%args, %$so:

t2w => $path_to_dtatw_txml2wxml, ##-- default: search
t2s => $path_to_dtatw_txml2sxml, ##-- default: search
t2a => $path_to_dtatw_txml2axml, ##-- default: search
inplace => $bool,                ##-- prefer in-place programs for search?
defaults
%defaults = CLASS->defaults();

Static class-dependent defaults.

init
$so = $so->init();

Dynamic object-dependent defaults.

Methods: Backwards-compatibility

_xsl
$so_xsl = $so->_xsl();

Return a DTA::TokWrap::Processor::standoff::xsl object which may or may not be logically equivalent to $so.

dump_t2s_stylesheet
undef = $so->dump_t2s_stylesheet($filename_or_fh);

See DTA::TokWrap::Processor::standoff::xsl::dump_t2s_stylesheet().

dump_t2w_stylesheet
undef = $so->dump_t2w_stylesheet($filename_or_fh);

See DTA::TokWrap::Processor::standoff::xsl::dump_t2w_stylesheet().

dump_t2a_stylesheet
undef = $so->dump_t2a_stylesheet($filename_or_fh);

See DTA::TokWrap::Processor::standoff::xsl::dump_t2a_stylesheet().

Methods: document processing

soxml
$doc_or_undef = $CLASS_OR_OBJECT->soxml($doc,$X,$xmlbase);

Low-level generic standoff formatting method. Generate $X-level standoff for the DTA::TokWrap::Document object $doc.

Relevant %$doc keys:

xtokdata    => $xtokdata, ##-- (input) XML-ified tokenizer output data (string)
so${X}file  => $sosfile,  ##-- (output) standoff file, refers to $xml_base
##
so${X}xml_stamp0 => $f,   ##-- (output) timestamp of operation begin
so${X}xml_stamp  => $f,   ##-- (output) timestamp of operation end
so${X}file_stamp => $f,   ##-- (output) timestamp of operation end
sosxml
$doc_or_undef = $CLASS_OR_OBJECT->sosxml($doc);

Just a wrapper for:

$so->soxml($doc,'s',basename($doc->{sowfile}));
sowxml
$doc_or_undef = $CLASS_OR_OBJECT->sowxml($doc);

Just a wrapper for:

$so->soxml($doc,'w',$doc->{xmlbase});
soaxml
$doc_or_undef = $CLASS_OR_OBJECT->soaxml($doc);

Just a wrapper for:

$so->soxml($doc,'a',basename($doc->{sowfile}));

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

SEE ALSO

DTA::TokWrap::Intro(3pm), dta-tokwrap.perl(1), ...

AUTHOR

Bryan Jurish <moocow@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2009-2018 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.14.2 or, at your option, any later version of Perl 5 you may have available.