NAME

Lingua::Word::Parser - Parse a word into known and unknown parts

VERSION

version 0.03

SYNOPSIS

use Lingua::Word::Parser;
my $p = Lingua::Word::Parser->new(
   word => 'abioticaly',
   file => 'eg/lexicon.dat',
);
# Or with a localhost database source:
my $p = Lingua::Word::Parser->new(
   word   => 'abioticaly',
   dbname => 'fragments',
   dbuser => 'akbar',
   dbpass => '0p3n53454m3',
);
my ($known) = $p->knowns;
my $combos  = $p->power;
my $scored  = $p->score;
# The best guess is the last sorted score-set:
warn Dumper $scored->{ [ sort keys %$score ]->[-1] };

DESCRIPTION

A Lingua::Word::Parser breaks a word into known affixes.

METHODS

new()

$x = Lingua::Word::Parser->new(%arguments);

Create a new Lingua::Word::Parser object.

Arguments and defaults:

word: undef
lex:  undef

fetch_lex()

Populate word-part => regular-expression lexicon.

This file has lines of the form:

a(?=\w) opposite
ab(?=\w) away
(?<=\w)o(?=\w) combining
(?<=\w)tic possessing

db_fetch()

Populate the lexicon from a database source called `fragments`.

This database table has records of the form:

       affix     definition
-----------------------------
       a(?=\w)   opposite
       ab(?=\w)  away
(?<=\w)o(?=\w)   combining
(?<=\w)tic       possessing

knowns()

Fingerprint the known word parts.

power()

Find the "non-overlapping powerset."

score()

$score = $p->score( $open_sparator, $close_separator, $line_terminator );

Score the known vs unknown word part combinations into ratios of characters and chunks or parts or "spans of adjacent characters."

If not given, the $open_sparator, $close_separator and $line_terminator are '<', '>' and '', by default, respectively.

AUTHOR

Gene Boggs <gene@cpan.org>

COPYRIGHT AND LICENSE

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

To install Lingua::Word::Parser, copy and paste the appropriate command in to your terminal.

cpanm

cpanm Lingua::Word::Parser

CPAN shell

perl -MCPAN -e shell
install Lingua::Word::Parser

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	Go to GitHub issues (only if GitHub is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)