NAME

DTA::CAB::Analyzer::Common - common analyzers for DTA::CAB suite

SYNOPSIS

use DTA::CAB::Analyzer::Common;

$anl = $ANALYZER_CLASS->new(%args);
$anl->analyzeDocument($doc,%analyzeOptions);
# ... etc.

DESCRIPTION

The DTA::CAB::Analyzer::Common package just includes some default analyzer classes used by the rest of the DTA::CAB suite, namely:

DTA::CAB::Analyzer

Abstract base class for analyzer objects.

DTA::CAB::Analyzer::Automaton

Generic API for finite-state automaton analyzers.

DTA::CAB::Analyzer::Automaton::Gfsm

Finite-state analyzer base class using Gfsm for low-level automaton operations (lookup).

DTA::CAB::Analyzer::Automaton::Gfsm::XL

Finite-state analyzer base class using Gfsm::XL for low-level automaton operations (k-best cascade lookup).

DTA::CAB::Analyzer::Dict

Full-form dictionary-based analyzer (aka "cache") using a flat hash.

DTA::CAB::Analyzer::Dict::BDB

Full-form dictionary-based analyzer (aka "cache") using Berkeley DB.

DTA::CAB::Analyzer::Dict::CDB

Full-form dictionary-based analyzer (aka "cache") using CDB.

DTA::CAB::Analyzer::EqLemma

Lemma-equivalence expander (wrapper).

DTA::CAB::Analyzer::EqPho

Phonetic equivalence-class expander (wrapper).

DTA::CAB::Analyzer::EqRW

Rewrite equivalence class expander (wrapper).

DTA::CAB::Analyzer::Lemmatizer

Lemma extractor for TAGH morphological analyses.

DTA::CAB::Analyzer::LTS

Letter-To-Sound (phonetic) analysis via Gfsm automaton lookup.

DTA::CAB::Analyzer::Moot

Hidden Markov Model Viterbi decoder using libmoot.

DTA::CAB::Analyzer::Moot::DynLex

Dynamic-Lexicon Hidden Markov Model Viterbi decoder using libmoot.

DTA::CAB::Analyzer::Morph

Morphological analysis via Gfsm automaton lookup.

DTA::CAB::Analyzer::Morph::Latin

Latin pesudo-morphological analyzer (wrapper).

DTA::CAB::Analyzer::MorphSafe

Safety heuristics for analyses output by TAGH via DTA::CAB::Analyzer::Morph.

DTA::CAB::Analyzer::Null

Null analyzer, for testing purposes.

DTA::CAB::Analyzer::Rewrite

Error-correction (rewrite) analyzer using a Gfsm::XL cascade.

DTA::CAB::Analyzer::RewriteSub

Sub-analyzer for rewrite output.

DTA::CAB::Analyzer::TokPP

Type-level heuristic token preprocessor (for punctuation etc)

DTA::CAB::Analyzer::Unicruft

Transliterator for latin-1 approximation using libunicruft.

DTA::CAB::Chain

Analyzer chains (aka "pipelines").

DTA::CAB::Chain::Multi

Analyzer multi-chains (collection of named pipelines).

AUTHOR

Bryan Jurish <moocow@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2008-2019 by Bryan Jurish

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.24.1 or, at your option, any later version of Perl 5 you may have available.

SEE ALSO

dta-cab-analyze.perl(1), DTA::CAB::Analyzer(3pm), DTA::CAB::Chain(3pm), DTA::CAB(3pm), perl(1), ...