NAME
NanoB2B::NER - turns labeled text lines into ARFF files based on specified features that are extracted using MetaMap and runs through WEKA to average the results
DESCRIPTION
This package turns labeled text lines into ARFF files based on specified features that are extracted using MetaMap and runs through WEKA to average the results
For more information please see the NanoB2B::NER.pm documentation.
SYNOPSIS
add synopsis
ABSTRACT
There is a critical need to automatically extract and synthesize knowledge and trends in nanotechnology research from an exponentially increasing body of literature. Engineered nanomaterials (ENMs), such as nanomedicines, are continuously being discovered and Natural Language Processing approaches can semi‐automate the cataloging of ENMs and their unique physico‐chemical properties; automatically aggregate studies on their exposure and hazards; and link the physicochemical properties to the measured effects. The goal of this project is to develop a nanomedicine entity extraction system to automatically identify nanomedicine physico-characteristics, exposure and biological effects.
INSTALL
To install the module, run the following magic commands:
perl Makefile.PL
make
make test
make install
This will install the module in the standard location. You will, most probably, require root privileges to install in standard system directories. To install in a non-standard directory, specify a prefix during the 'perl Makefile.PL' stage as:
perl Makefile.PL PREFIX=/home/milk
It is possible to modify other parameters during installation. The details of these can be found in the ExtUtils::MakeMaker documentation. However, it is highly recommended not messing around with other parameters, unless you know what you're doing.
FUNCTION DESCRIPTIONS
NAME
NanoB2B-NNER-PM::NER - The main file that runs all of the processes for NER
DESCRIPTION
This package turns nanoparticle texts into ARFF files and WEKA accuracy files based on the nanoparticle characteristics found from pre-annotated articles
VERSION
Version 1.01
INITIALIZING THE MODULE
To create an instance of the ner module, using default values for all configuration options:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
nerByFile
description:
Runs the files specified in the parameters program_dir metamaps all the files, arffs all the files, wekas all the files, and averages all the files
This NER method doesn't move on to the next file until all the methods have been used
input:
None
output:
Metamap files, ARFF file sets, Weka file sets, and Averaged Accuracy files
example:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
$nner->nerByFile();
nerByMethod
description:
Runs the files specified in the parameters program_dir metamaps all the files, arffs all the files, wekas all the files, and averages all the files
This NER method doesn't move on to the next method until all the files have been processed
input:
None
output:
Metamap files, ARFF file sets, Weka file sets, and Averaged Accuracy files
example:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
$nner->nerByMethod();
metaSet
description:
Runs a set of files through metamap
input:
None
output:
Metamap files for every file found in the directory specified in the constructor parameters
example:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
$nner->metaSet();
arffSet
description:
Turns a set of files into ARFF files based on the features specificied in the constructor parameters
input:
None
output:
ARFF file sets for every file found in the directory specified in the constructor parameters
example:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
$nner->arffSet();
wekaSet
description:
Runs a set of ARFF files through WEKA
input:
None
output:
WEKA files for every file found in the directory specified in the constructor parameters
example:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
$nner->wekaSet();
modelSet
description:
Creates WEKA models from the training ARFF files
input:
None
output:
WEKA model files for every file with training ARFF files
example:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
$nner->modelSet();
avgSet
description:
Averages together a set of WEKA files
input:
None
output:
Average accuracy files for every file found in the directory specified in the constructor parameters
example:
use NanoB2B::NER;
my %params = ();
$params{'dir'} = "my_directory";
$params{'features'} = "ortho morph text pos cui sem";
my $nner = new NanoB2B::NER(\%params);
$nner->avgSet();
SEE ALSO
AUTHOR
Megan Charity <charityml@vcu.edu> Bridget T McInnes <btmcinnes@vcu.edu>
COPYRIGHT
Copyright (c) 2017
Megan Charity, Virginia Commonwealth University
charityml at vcu.edu
Bridget T. McInnes, Virginia Commonwealth University
btmcinnes at vcu.edu
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to
The Free Software Foundation, Inc.,
59 Temple Place - Suite 330,
Boston, MA 02111-1307, USA.
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 60:
Non-ASCII character seen before =encoding in 'semi‐automate'. Assuming UTF-8