NAME
DTA::CAB::Analyzer::Common - common analyzers for DTA::CAB suite
SYNOPSIS
use DTA::CAB::Analyzer::Common;
$anl = $ANALYZER_CLASS->new(%args);
$anl->analyzeDocument($doc,%analyzeOptions);
# ... etc.
DESCRIPTION
The DTA::CAB::Analyzer::Common package just includes some default analyzer classes used by the rest of the DTA::CAB suite, namely:
- DTA::CAB::Analyzer
-
Abstract base class for analyzer objects.
- DTA::CAB::Analyzer::Automaton
-
Generic API for finite-state automaton analyzers.
- DTA::CAB::Analyzer::Automaton::Gfsm
-
Finite-state analyzer base class using Gfsm for low-level automaton operations (lookup).
- DTA::CAB::Analyzer::Automaton::Gfsm::XL
-
Finite-state analyzer base class using Gfsm::XL for low-level automaton operations (k-best cascade lookup).
- DTA::CAB::Analyzer::Dict
-
Full-form dictionary-based analyzer (aka "cache") using a flat hash.
- DTA::CAB::Analyzer::Dict::BDB
-
Full-form dictionary-based analyzer (aka "cache") using Berkeley DB.
- DTA::CAB::Analyzer::Dict::CDB
-
Full-form dictionary-based analyzer (aka "cache") using CDB.
- DTA::CAB::Analyzer::EqLemma
-
Lemma-equivalence expander (wrapper).
- DTA::CAB::Analyzer::EqPho
-
Phonetic equivalence-class expander (wrapper).
- DTA::CAB::Analyzer::EqRW
-
Rewrite equivalence class expander (wrapper).
- DTA::CAB::Analyzer::Lemmatizer
-
Lemma extractor for TAGH morphological analyses.
- DTA::CAB::Analyzer::LTS
-
Letter-To-Sound (phonetic) analysis via Gfsm automaton lookup.
- DTA::CAB::Analyzer::Moot
-
Hidden Markov Model Viterbi decoder using libmoot.
- DTA::CAB::Analyzer::Moot::DynLex
-
Dynamic-Lexicon Hidden Markov Model Viterbi decoder using libmoot.
- DTA::CAB::Analyzer::Morph
-
Morphological analysis via Gfsm automaton lookup.
- DTA::CAB::Analyzer::Morph::Latin
-
Latin pesudo-morphological analyzer (wrapper).
- DTA::CAB::Analyzer::MorphSafe
-
Safety heuristics for analyses output by TAGH via DTA::CAB::Analyzer::Morph.
- DTA::CAB::Analyzer::Null
-
Null analyzer, for testing purposes.
- DTA::CAB::Analyzer::Rewrite
-
Error-correction (rewrite) analyzer using a Gfsm::XL cascade.
- DTA::CAB::Analyzer::RewriteSub
-
Sub-analyzer for rewrite output.
- DTA::CAB::Analyzer::TokPP
-
Type-level heuristic token preprocessor (for punctuation etc)
- DTA::CAB::Analyzer::Unicruft
-
Transliterator for latin-1 approximation using libunicruft.
- DTA::CAB::Chain
-
Analyzer chains (aka "pipelines").
- DTA::CAB::Chain::Multi
-
Analyzer multi-chains (collection of named pipelines).
AUTHOR
Bryan Jurish <moocow@cpan.org>
COPYRIGHT AND LICENSE
Copyright (C) 2008-2019 by Bryan Jurish
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.24.1 or, at your option, any later version of Perl 5 you may have available.
SEE ALSO
dta-cab-analyze.perl(1), DTA::CAB::Analyzer(3pm), DTA::CAB::Chain(3pm), DTA::CAB(3pm), perl(1), ...