Lingua-YaTeA-0.6

Changes for version 0.6

Add of installation tests
Add of examples for French
Integration of the MIG corrections (Node, Testified terms)
Correction in the forbidden structure management (split action)
Add NEXT.pm in the pre-required module list
UTF-8 is used as charset for both configuration files and texts to process
Integration of attested terms (in a bracketed format, can be produced by the bootstrap option)
Option "bootstrap" to generate an output in a bracketed format. This output can be used as a attested resource on a other corpus
Add a defined test in the function appendInclude (but it seems there is still a bug on tree building).
Monolexical term occurrences are identified as MNP when the monolexical oprion is set
Addition of a config set for French based on the POSTagger Flemm (which uses the Multex tagset) and modification of the read method in Corpus.pm (addition opf the parameter language) to normalisze the input (correction of the output of Flemm)
Correction in the input normalization for Flemm
Each printing function can now print results on stdout
Replace ':' as tag by 'COLUMN'
Add of some Chunking Frontiers as '('
Only the tag is taken into account for sentence_boundary detection
Add Weights section to the DTD and in the XML rendering
Correction: TF-IDF based ranking method is a DDW ranking
Add ROOT as a reference to the term in which the current term is nested
Addition of several term weighting and selection measures (C-Value and variations, iLong, ilnc, iLong, and term autonomy)
Add the status of the term candidate (0 : not a term, 1 : term). By default, term candidate are terms
Add a option for term output style to print only term candidate having the status of term
colors of the terms in the HTML output can be parametrized (option file, options PARSED_COLOR and UNPARSED_COLOR)
new function for printing only list of candidate terms without XML header
Addition of the option XML-corpus-raw, rendering the corpus in XML format with terms. Documents and sentences are identified in the XML files.

Documentation

Perl script for extracting terms from a corpus of texts and providing a syntactic analysis in a head-modifier representation.

Modules

Lingua::YaTeA

Perl extension for extracting terms from a corpus and providing a syntactic analysis in a head-modifier format.

Lingua::YaTeA::AnnotationMark

Perl extension for annotation marks

Lingua::YaTeA::ChunkingDataSet

Perl extension for the set of chuncking data

Lingua::YaTeA::ChunkingDataSubset

Perl extension for subset of chuncking data.

Lingua::YaTeA::Corpus

Perl extension for ???

Lingua::YaTeA::Document

Perl extension for words of input document

Lingua::YaTeA::DocumentSet

Perl extension for document set

Lingua::YaTeA::Edge

Perl extension for edge between nodes

Lingua::YaTeA::File

Perl extension for managing information related to a configuration file.

Lingua::YaTeA::FileSet

Perl extension for managing the directory containing the configuration file set given a language.

Lingua::YaTeA::ForbiddenStructure

Perl extension for the forbidden structures.

Lingua::YaTeA::ForbiddenStructureAny

Perl extension for forbidden structures in any position of a chunk.

Lingua::YaTeA::ForbiddenStructureMark

Perl extension for mananging the annotation marks for the forbidden structures

Lingua::YaTeA::ForbiddenStructureSet

Perl extension for managing the forbiddent structures.

Lingua::YaTeA::ForbiddenStructureStartOrEnd

Perl extension for forbidden structures in at the start or end position of a chunk.

Lingua::YaTeA::IndexSet

Perl extension for ???

Lingua::YaTeA::InternalNode

Perl extension for internal nodes

Lingua::YaTeA::Island

Perl extension for island of reliability

Lingua::YaTeA::IslandSet

Perl extension for set of reliability islands

Lingua::YaTeA::Lexicon

Perl extension for lexicon of the corpus.

Lingua::YaTeA::LexiconItem

Perl extension for representing word

Lingua::YaTeA::LinguisticItem

Perl extension for the linguistic item of the forbiddent structures

Lingua::YaTeA::Message

Perl extension for managing a message in the term extractor

Lingua::YaTeA::MessageSet

Perl extension for message set

Lingua::YaTeA::MonolexicalPhrase

Perl extension for monoloexical phrases

Lingua::YaTeA::MonolexicalTermCandidate

Perl extension for the monolexical term candidate

Lingua::YaTeA::MonolexicalTestifiedTerm

Perl extension for monolexical testified terms

Lingua::YaTeA::MonolexicalUnit

Perl extension for monolexical word

Lingua::YaTeA::MultiWordPhrase

Perl extension for ???

Lingua::YaTeA::MultiWordTermCandidate

Perl extension for ???

Lingua::YaTeA::MultiWordTestifiedTerm

Perl extension for multi-word testified terms

Lingua::YaTeA::MultiWordUnit

Perl extension for ???

Lingua::YaTeA::Node

Perl extension for ???

Lingua::YaTeA::NodeSet

Perl extension for ???

Lingua::YaTeA::Occurrence

Perl extension for the phrase occurrences

Lingua::YaTeA::Option

Perl extension for option of the term extraction process

Lingua::YaTeA::OptionSet

Perl extension for handling option set in YaTeA

Lingua::YaTeA::ParsingPattern

Perl extension for parsing pattern

Lingua::YaTeA::ParsingPatternParser

Perl extension for parsing the file containing the parsing patterns (based on Parse::Yapp)

Lingua::YaTeA::ParsingPatternRecord

Perl extension for recording parsing patterns

Lingua::YaTeA::ParsingPatternRecordSet

Perl extension for managing the set of the parsing patterns

Lingua::YaTeA::PatternLeaf

Perl extension for the leaf node of a syntactic pattern tree

Lingua::YaTeA::Phrase

Perl extension for phrases corresponding to the parsed terms

Lingua::YaTeA::PhraseSet

Perl extension for ???

Lingua::YaTeA::RootNode

Perl extension for the root node of the syntactic tree of a term

Lingua::YaTeA::Sentence

Perl extension for sentence

Lingua::YaTeA::SentenceSet

Perl extension for the sentence set

Lingua::YaTeA::TagSet

Perl extension for managing the set of Part-of-Speech tags and inflected that can be accepted in the terms.

Lingua::YaTeA::TermCandidate

Perl extension for Term Candidate

Lingua::YaTeA::TermLeaf

Perl extension for leaf node of term tree

Lingua::YaTeA::TestifiedTerm

Perl extension for Testified Term

Lingua::YaTeA::TestifiedTermMark

Perl extension for marks of testified terms

Lingua::YaTeA::TestifiedTermParser

Perl extension for the parser of testified term file (based on Parse::Yapp)

Lingua::YaTeA::TestifiedTermSet

Perl extension for ???

Lingua::YaTeA::Tree

Perl extension for ???

Lingua::YaTeA::Trigger

Perl extension for a trigger.

Lingua::YaTeA::TriggerSet

Perl extension for managing the trigger set

Lingua::YaTeA::WordFromCorpus

Perl extension for managing word of the corpus and related information

Lingua::YaTeA::WordOccurrence

Perl extension for managing word occurrence

Lingua::YaTeA::XMLEntities

Perl extension for managing characters which can not be used in a XML document

Examples

Other files

To install Lingua::YaTeA, copy and paste the appropriate command in to your terminal.

cpanm

cpanm Lingua::YaTeA

CPAN shell

perl -MCPAN -e shell
install Lingua::YaTeA

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)

Changes for version 0.6

Documentation

Modules

Examples

Other files

Module Install Instructions