NAME

Lingua::EN::GivenNames::Database::Download - An SQLite database of derivations of English given names

Synopsis

See "Synopsis" in Lingua::EN::GivenNames for a long synopsis.

See also "How do the scripts and modules interact to produce the data?" in Lingua::EN::GivenNames.

Description

Documents the methods used to download web pages which will be imported into lingua.en.givennames.sqlite (which ships with this distro).

Specifically, downloads these pages (for sex in ['female', 'male']):

Input: http://www.20000-names.com/${sex}_english_names*.htm.

Output: data/${sex}_english_names*.htm.

The * means there are a set of pages for each sex.

See scripts/get.name.pages.pl .

Note: These pages have been downloaded, and are shipped with the distro.

Constructor and initialization

new(...) returns an object of type Lingua::EN::GivenNames::Database::Download.

This is the class's contructor.

Usage: Lingua::EN::GivenNames::Database::Download -> new().

Distributions

This module is available as a Unix-style distro (*.tgz).

See http://savage.net.au/Perl-modules.html for details.

See http://savage.net.au/Perl-modules/html/installing-a-module.html for help on unpacking and installing.

Methods

This module is a sub-class of Lingua::EN::GivenNames::Database and consequently inherits its methods.

get_1_page($url, $data_file)

Called by get_name_pages().

Download $url and save it in $data_file. $data_file takes the form 'data/${sex}_english_names*.htm'.

Returns 0 to indicate success.

get_name_pages()

Downloads 20 pages of female given names and 17 pages of male given names.

See scripts/get.name.pages.pl.

Returns the result of the last call to "get_1_page($url, $data_file)" (which will be 0) to indicate success.

new()

See "Constructor and initialization".

url()

Returns the string 'http://www.20000-names.com/'.

FAQ

For the database schema, etc, see "FAQ" in Lingua::EN::GivenNames.

References

See "References" in Lingua::EN::GivenNames.

Support

Email the author, or log a bug on RT:

https://rt.cpan.org/Public/Dist/Display.html?Name=Lingua::EN::GivenNames.

Author

Lingua::EN::GivenNames was written by Ron Savage <ron@savage.net.au> in 2012.

Home page: http://savage.net.au/index.html.

Copyright

Australian copyright (c) 2012 Ron Savage.

All Programs of mine are 'OSI Certified Open Source Software';
you can redistribute them and/or modify them under the terms of
The Artistic License, a copy of which is available at:
http://www.opensource.org/licenses/index.html