NAME
calculate-corpus-stats.pl - perl script that gives corpus statistics given a semcor-reformatted corpus file.
SYNOPSIS
calculate-corpus-stats.pl --file FILE
DESCRIPTION
This script gives the information about the distribution of instances based on the part of speech. It also gives the distribution of instances based on the number of senses available for the instances. For example, the instance winter#n has only 1 sense associated with it and so it will be counted in the instances with only 1 sense.
AUTHORS
Varada Kolhatkar, University of Minnesota, Duluth
<kolha002 at d.umn.edu>
Ted Pedersen, University of Minnesota, Duluth
<tpederse at d.umn.edu>
This document last modified by : $Id: calculate-corpus-stats.pl,v 1.3 2009/04/30 22:08:49 kvarada Exp $
SEE ALSO
L<semcor-reformat.pl>
COPYRIGHT
Copyright (C) 2005-2008 by Jason Michelizzi and Ted Pedersen
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.