NAME
nat-codify - Command line tool to codify corpora
SYNOPSIS
nat-codify <file1.nat> <file2.nat>
nat-codify -tmx <file.tmx>
DESCRIPTION
The -tokenize
flag can be used to force NATools to tokenize the texts. Note that at the moment a Portuguese tokenizer is used for all languages. This might change in the future.
The -id=name
flag can be used to force NATools Corpora name. By default the name is read interactively.
The -q
flag can be used to force quite mode. In thic case, the name is extracted from the file-names.
The -lang=PT..EN
flag can be used to force languages.
SEE ALSO
NATools documentation, perl(1), nat-create
AUTHOR
Alberto Manuel Brandão Simões, <ambs@cpan.org>
COPYRIGHT AND LICENSE
Copyright (C) 2002-2012 by Alberto Manuel Brandão Simões
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 130:
Non-ASCII character seen before =encoding in 'Brandão'. Assuming UTF-8