NAME
Lingua::EN::NameCase - Correctly case a person's name from UPERCASE or lowcase
SYNOPSIS
# Working with scalars; complementing lc and uc.
use Lingua::EN::NameCase qw( nc );
$FixedCasedName = nc( $OriginalName );
$FixedCasedName = nc( \$OriginalName );
# Working with arrays or array references.
use Lingua::EN::NameCase 'NameCase';
$FixedCasedName = NameCase( $OriginalName );
@FixedCasedNames = NameCase( @OriginalNames );
$FixedCasedName = NameCase( \$OriginalName );
@FixedCasedNames = NameCase( \@OriginalNames );
NameCase( \@OriginalNames ) ; # In-place.
# NameCase will not change a scalar in-place, i.e.
NameCase( \$OriginalName ) ; # WRONG: null operation.
$Lingua::EN::NameCase::SPANISH = 1;
# Now 'El' => 'El' instead of (default) Greek 'El' => 'el'.
# Now 'La' => 'La' instead of (default) French 'La' => 'la'.
$Lingua::EN::NameCase::HEBREW = 0;
# Now 'Aharon BEN Amram Ha-Kohein' => 'Aharon Ben Amram Ha-Kohein'
# instead of (default) => 'Aharon ben Amram Ha-Kohein'.
$Lingua::EN::NameCase::ROMAN = 0;
# Now 'Li' => 'Li' instead of (default) 'Li' => 'LI'.
$Lingua::EN::NameCase::POSTNOMINAL = 0;
# Now 'PHD' => 'PhD' instead of (default) 'PHD' => 'Phd'.
DESCRIPTION
Forenames and surnames are often stored either wholly in UPPERCASE or wholly in lowercase. This module allows you to convert names into the correct case where possible.
Although forenames and surnames are normally stored separately if they do appear in a single string, whitespace separated, NameCase and nc deal correctly with them.
NameCase currently correctly name cases names which include any of the following:
Mc, Mac, al, el, ap, da, de, delle, della, di, du, del, der,
la, le, lo, van and von.
It correctly deals with names which contain apostrophes and hyphens too.
EXAMPLE FIXES
Original Name Case
-------- ---------
KEITH Keith
LEIGH-WILLIAMS Leigh-Williams
MCCARTHY McCarthy
O'CALLAGHAN O'Callaghan
ST. JOHN St. John
plus "son (daughter) of" etc. in various languages, e.g.:
VON STREIT von Streit
VAN DYKE van Dyke
AP LLWYD DAFYDD ap Llwyd Dafydd
etc.
plus names with roman numerals (up to 89, LXXXIX), e.g.:
henry viii Henry VIII
louis xiv Louis XIV
METHODS
NameCase
Takes a scalar, scalarref, array or arrayref, and changes the case of the contents, as appropriate. Essentially a wrapper around nc().
nc
Takes a scalar or scalarref, and change the case of the name in the corresponding string appropriately.
BUGS
The module covers the rules that I know of. There are probably a lot more rules, exceptions etc. for "Western"-style languages which could be incorporated.
There are probably lots of exceptions and problems - but as a general data 'cleaner' it may be all you need.
Use Kim Ryan's NameParse.pm for any really sophisticated name parsing.
AUTHOR
1998-2014 Mark Summerfield <summer@qtrac.eu>
2014-present Barbie <barbie@cpan.org>
ACKNOWLEDGEMENTS
Thanks to Kim Ryan <kimaryan@ozemail.com.au> for his Mc/Mac solution.
COPYRIGHT
Copyright (c) Mark Summerfield 1998-2014. All Rights Reserved. Copyright (c) Barbie 2014. All Rights Reserved.
This distribution is free software; you can redistribute it and/or modify it under the Artistic Licence v2.