The Perl and Raku Conference 2025: Greenville, South Carolina - June 27-29 Learn more

NAME

DTA::CAB::Format::ExpandList - Datum I/O: expansion list for use with DDC

SYNOPSIS

##========================================================================
## Methods: Constructors etc.
$fmt = CLASS_OR_OBJ->new(%args)
##========================================================================
## Methods: Output
$type = $fmt->mimeType();
$ext = $fmt->defaultExtension();
$fmt = $fmt->putToken($tok);

DESCRIPTION

DTA::CAB::Format::ExpandList is a DTA::CAB::Format subclass intended for use in a CAB HTTP server as a CAB-class term expander for the DDC corpus query engine. As for DTA::CAB::Format::TT (from which this class inherits), each token is represented by a single line and sentence boundaries are represented by blank lines. Token lines have the format:

ORIG_TEXT EQUIVALENT(s)...

Where EQUIVALENT(s) is a list of TAB-separated equivalent forms as determined by the analysis phase.

Methods: Constructors etc.

new
$fmt = CLASS_OR_OBJECT->new(%args);

Recognized %args:

##---- Input
doc => $doc, ##-- buffered input document
##---- Output
level => $formatLevel, ##-- output formatting level
## 0: TAB-separated (default)
## 1: sorted, NEWLINE-separated
## 2: sorted, NEWLINE+TAB-separated
keys => \@expandKeys, ##-- keys to include (default: [qw(text xlit eqpho eqrw eqlemma eqtagh gn-syn gn-isa gn-asi)])
##---- Common
utf8 => $bool, ##-- default: 1

Methods: Output

mimeType
$type = $fmt->mimeType();

Default returns text/plain.

defaultExtension
$ext = $fmt->defaultExtension();

Deturns default filename extension for this format. Override returns '.xl'.

putToken
$fmt = $fmt->putToken($tok);

Appends $tok to output buffer.

AUTHOR

Bryan Jurish <moocow@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2011-2019 by Bryan Jurish

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.24.1 or, at your option, any later version of Perl 5 you may have available.

SEE ALSO

dta-cab-analyze.perl(1), dta-cab-convert.perl(1), DTA::CAB::Format::TT(3pm), DTA::CAB::Format(3pm), DTA::CAB(3pm), ddc_opt(5), ddc_proto(5), perl(1), ...