NAME

Treex::Block::Read::BaseAlignedTextReader - abstract ancestor for parallel-corpora document readers

VERSION

version 0.07191

DESCRIPTION

This class serves as an common ancestor for document readers, that have parameter from with a space or comma separated list of filenames to be loaded and load the documents from plain text files. It is designed to implement the Treex::Core::DocumentReader interface.

In derived classes you need to define the next_document method, and you can use next_document_texts and new_document methods.

METHODS

next_document_texts

Returns a hashref, where keys are zone labels and values are strings representing contents of the files.

SEE

Treex::Block::Read::BaseAlignedReader Treex::Block::Read::AlignedSentences

AUTHOR

Martin Popel

COPYRIGHT AND LICENSE

Copyright © 2011 by Institute of Formal and Applied Linguistics, Charles University in Prague

This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

1 POD Error

The following errors were encountered while parsing the POD:

Around line 88:

Non-ASCII character seen before =encoding in '©'. Assuming UTF-8