NAME

Treex::Tool::Lexicon::CS::Reflexivity

VERSION

version 0.13100

SYNOPSIS

use Treex::Tool::Lexicon::CS::Reflexivity;

foreach my $lemma (qw(chodit zeptat troufat)) {
    print Treex::Tool::Lexicon::CS::Reflexivity::fix_reflexivity($lemma)."\n";
}

if (!Treex::Tool::Lexicon::CS::Reflexivity::is_possible_tantum('zvýhodnit_se')){
    # do something about it
}

DESCRIPTION

my $corrected_tlemma = Treex::Tool::Lexicon::CS::Reflexivity::fix_reflexivity($tlemma);

If the given Czech verb lemma is reflexivum tantum, then the reflexive suffix "_si" or "_se" is added to the lemma. Based on a list of reflexives extracted from VALLEX 2.5.

my $bool = Treex::Tool::Lexicon::CS::Reflexivity::is_possible_tantum($tlemma)

Returns true only if the given lemma (such as "ptát_se", "jistý_si", "kolísat_se") is on the list of possible reflexive tantums (which is not true for the last example). The list is based on VALLEX 2.5, PDT-VALLEX 2.0 and the PDT 2.0 data.

TODO

The list of possible reflexives should be probably narrowed down, since it contains also verbs in which the reflexive tantum variant from the dictionary is either highly unlikely to occur (e.g. "dělat_se" as in "Dělá se hezky.") or not entirely justifiable (e.g. "seznámit se" or "zabít se").

AUTHORS

Zdeněk Žabokrtský <zabokrtsky@ufal.mff.cuni.cz>

Ondřej Dušek <odusek@ufal.mff.cuni.cz>

COPYRIGHT AND LICENSE

Copyright © 2011 by Institute of Formal and Applied Linguistics, Charles University in Prague

This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

1 POD Error

The following errors were encountered while parsing the POD:

Around line 307:

Non-ASCII character seen before =encoding in '(!Treex::Tool::Lexicon::CS::Reflexivity::is_possible_tantum('zvýhodnit_se')){'. Assuming UTF-8