NAME

Treex::Block::Read::CoNLLX

VERSION

version 0.13095

DESCRIPTION

Document reader for CoNLL format. Each token is on separated line in the following format: ord<tab>form<tab>lemma<tab>cpos<tab>pos<tab>features<tab>head<tab>deprel Sentences are separated with blank line. The sentences are stored into bundles in the document.

ATTRIBUTES

from

space or comma separated list of filenames

lines_per_doc

number of sentences (!) per document

METHODS

next_document

Loads a document.

SEE

Treex::Block::Read::BaseTextReader Treex::Core::Document Treex::Core::Bundle

AUTHOR

David Mareček

COPYRIGHT AND LICENSE

Copyright © 2011-2013 by Institute of Formal and Applied Linguistics, Charles University in Prague

This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

1 POD Error

The following errors were encountered while parsing the POD:

Around line 106:

Non-ASCII character seen before =encoding in 'Mareček'. Assuming UTF-8