NAME
Lingua::EN::GivenNames::Database::Download - An SQLite database of derivations of English given names
Synopsis
See "Synopsis" in Lingua::EN::GivenNames for a long synopsis.
See also "How do the scripts and modules interact to produce the data?" in Lingua::EN::GivenNames.
Description
Documents the methods used to download web pages which will be imported into lingua.en.givennames.sqlite (which ships with this distro).
Specifically, downloads these pages (for sex in ['female', 'male']):
Input: http://www.20000-names.com/${sex}_english_names*.htm.
Output: data/${sex}_english_names*.htm.
The * means there are a set of pages for each sex.
See scripts/get.name.pages.pl .
Note: These pages have been downloaded, and are shipped with the distro.
Constructor and initialization
new(...) returns an object of type Lingua::EN::GivenNames::Database::Download
.
This is the class's contructor.
Usage: Lingua::EN::GivenNames::Database::Download -> new()
.
Distributions
This module is available as a Unix-style distro (*.tgz).
See http://savage.net.au/Perl-modules.html for details.
See http://savage.net.au/Perl-modules/html/installing-a-module.html for help on unpacking and installing.
Methods
This module is a sub-class of Lingua::EN::GivenNames::Database and consequently inherits its methods.
get_1_page($url, $data_file)
Called by get_name_pages().
Download $url and save it in $data_file. $data_file takes the form 'data/${sex}_english_names*.htm'.
Returns 0 to indicate success.
get_name_pages()
Downloads 20 pages of female given names and 17 pages of male given names.
See scripts/get.name.pages.pl.
Returns the result of the last call to "get_1_page($url, $data_file)" (which will be 0) to indicate success.
new()
See "Constructor and initialization".
url()
Returns the string 'http://www.20000-names.com/'.
FAQ
For the database schema, etc, see "FAQ" in Lingua::EN::GivenNames.
References
See "References" in Lingua::EN::GivenNames.
Support
Email the author, or log a bug on RT:
https://rt.cpan.org/Public/Dist/Display.html?Name=Lingua::EN::GivenNames.
Author
Lingua::EN::GivenNames
was written by Ron Savage <ron@savage.net.au> in 2012.
Home page: http://savage.net.au/index.html.
Copyright
Australian copyright (c) 2012 Ron Savage.
All Programs of mine are 'OSI Certified Open Source Software';
you can redistribute them and/or modify them under the terms of
The Artistic License, a copy of which is available at:
http://www.opensource.org/licenses/index.html