NAME

DTA::CAB::Analyzer::DTAClean - Chain::DTA cleanup (prune sensitive and redundant data from document)

SYNOPSIS

##========================================================================
## PRELIMINARIES

use DTA::CAB::Analyzer::DTAClean;

##========================================================================
## Constructors etc.

$obj = CLASS_OR_OBJ->new(%args);

##========================================================================
## Methods: Analysis

$bool = $anl->doAnalyze(\%opts, $name);
$doc = $ach->analyzeClean($doc,\%opts);

DESCRIPTION

DTA::CAB::Analyzer::DTAClean DTA::CAB::Analyzer::DTAClean provides a DTA::CAB::Analyzer class for removing temporary internal data from documents processed with a DTA::CAB::Chain::DTA analyzer.

Constructors etc.

new
$obj = CLASS_OR_OBJ->new(%args);

%$obj, %args:

label => $label,     ##-- default='clean'
forceClean => $bool, ##-- always run analyzeClean() regardless of user options? (also checked in analyzeClean() itself)

Methods: Analysis

doAnalyze
$bool = $anl->doAnalyze(\%opts, $name);

Alias for $anl->can("analyze${name}") && (!exists($opts{"doAnalyze${name}"}) || $opts{"doAnalyze${name}"}). Override checks $anl->{forceClean} flag.

analyzeClean
$doc = $ach->analyzeClean($doc,\%opts);

Cleanup any temporary data associated with $doc. Override removes all but the following keys from each token in $doc:

text
xlit
mlatin
eqpho
eqrw
eqlemma
moot

Additionally, the 'analyses' key of the 'moot' field is removed if present.

AUTHOR

Bryan Jurish <moocow@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2011-2019 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.24.1 or, at your option, any later version of Perl 5 you may have available.

SEE ALSO

dta-cab-analyze.perl(1), DTA::CAB::Chain::DTA(3pm), DTA::CAB::Analyzer(3pm), DTA::CAB::Chain(3pm), DTA::CAB(3pm), perl(1), ...

1 POD Error

The following errors were encountered while parsing the POD:

Around line 181:

=back without =over